This tool will convert special characters (including Chinese; Japanese hiragana, katakana, and kanji; tonal Hanyu Pinyin; and Cyrillic script) into the decimal (not hex) form of Unicode numerical character references (NCRs). These NCRs can then be used in Emails or Web pages, with greater ease than the Chinese characters themselves for those whose operating systems or other software don’t handle double-byte text well.
For example, 台北, the characters for “Taipei” (Táiběi), can also be written as &#21488;北. For this to work, the “charset” of the Web page should be set to Unicode:
<meta http-equiv="content-type" content="text/html; charset=utf-8" />
It’s generally better, however, to use the characters themselves rather than their Unicode NCRs in cases where a Web page has a lot of Chinese text, because Chinese characters take up less file space than their NCRs.
Sample text: 台北 Táiběi.