Introduction to Chinese Coding

xiaoxiao2021-03-06  49

Common encodings related to Chinese Starters are: single-byte coding, GB2312-80, GB12345-90, GBK, Unicode encoding, Unicode character set, BIG5 encoding. The following is a brief introduction: 1. GB2312-80 full name is GB2312-80 "The Basic Set of Han Character Coding Character Set" in 1980, is a national standard for Chinese information, using Simplified Chinese in China (such as Singapore, etc.) is the only Chinese code for forced use. P-Windows3.2 and Apple OS are coded as Basic Chinese characters in GB2312, and Windows 95/98 is encoded by GBK as basic Chinese character, but is compatible with GB2312. Double-byte coding range: A1A1 ~ Fefe A1-A9: Symbol area, including 682 symbols B0-F7: Chinese character, including 6763 Chinese characters 2. GB12345-90 In 1990, traditional Chinese coding standard GB12345-90 "information The first auxiliary set of Chinese character encoding character sets is the purpose of specification. This standard has a total of 6866 Chinese characters (more than 103 words more than GB2312, which do not include these words), and pure traditional words have about 2,200 words. Double-byte coding range: A1A1 ~ Fefe A1-A9: Symbol area, increase vertical symbol B0-F9: Chinese character, including 6866 Chinese characters 3. UNICODE Coding International Standards Organization was established in April 1984 ISO / IEC JTC1 / SC2 / WG2 Working Group, in a unified code of symbols for Chinese characters. 1991 US multinational company established UNICODE CONSORTIUM, and reached an agreement with the WG2 in October 1991, using the same codeworthy set. At present, Unicode is a 16-bit encoding system that is the same as the BMP (Basic Multilingual Plane) of ISO10646. Unicode passed DIS (Draf International Standard) in June 1992, the current version V2.0 is announced in 1996, including 6811 symbols, 20902 Chinese characters, 11,172 Chinese characters, 6400 Chinese, 20,249, total 65534 . 4. GBK Coding GBK Coding is a new Chinese coding expansion national standard made by China's mainland, equivalent to UCS. GBK Working Group completed the GBK specification in December of the same year in October 1995. The coding standard is compatible with GB2312, including 2,1003 Chinese characters, 883 symbols, and provides 1894 types of characters, simple, and traditional characters in one library. Windows 95/98 Simplified Chinese version of the font library surface encoding is GBK, contact the underlying letter library with the Codes table with one or one between GBK and UCS.

转载请注明原文地址:https://www.9cbs.com/read-85328.html

New Post(0)