About GB18030 Chinese character coding standard set
Http://tech.sina.com.cn 2001/07/26
CCID News - China Computer News
Linning
Master of Economic Management in Tsinghua University, Deputy Director of the Institute of Electronic Industry, Ministry of Information Industry, the deputy secretary-general of the National Information Technology Standardization Technical Committee, undertake a number of national projects, and published many books. National Standard GB18030-2000 "Exchange of the basic set of Chinese characters coded character set" is the most important Chinese character encoding standard after GB2312-1980 and GB13000-1993, is one of the basic standards that the computer system must follow in the future. In order to ensure the successful implementation of the standard, the State Quality Supervision Administration will first enforce inspection of a wide range of computer operating systems from September 1. Any product that does not meet the standard is not qualified. To this end, the National Information Technology Standardization Technical Committee will organize standard compliance testing on the main operating system products in the market according to relevant standards and norms. Detection requirements and standard development 1. Product range must be detected GB18030 is the basic standards that the information products must follow, and considering some objective practical, adopting a strategy from the foundation, step-by-step implementation. This time the scope of the product must be specified as follows: ● Personal computer operating system products must be standard compliance testing, other products do not require requirements; ● All official release before GB 18030 release (March 17, 2000) Or the factory product is deemed to have a historic product, not within the supervision and inspection of GB 18030; ● After the update version or upgrade version of historic products after March 17, 2000, it is treated as a new product; ● Any implementation of GB 18030 During the transition period (ie, from March 17, 2000 to August 31, 2001), it should meet the relevant requirements of GB 18030. Products that do not meet the standard requirements should adopt remedies to meet the requirements of standards. The remedial measures should be recognized by the National Information Technology Standardization Technical Committee; ● Any product officially released or factory-free after GB 18030 transition period (i.e., August 31, 2001) must comply with GB 18030. 2. About the standard compliance test In order to cooperate with the implementation of GB 18030, the Information Processing Product Standard Compliance Testing Center (located in the Electronic Industry Standardization Institute of Information Industry) has carried out the preparation of GB 18030 testing, and more domestic and foreign A product was tested. In order to guide the standard implementation, GB 18030 is implemented on the product as soon as possible, and the information processing product standard compliance test center has proposed "GB 18030 standard compliance detection specification" in November 2000. "Detection Specification" made a clear and detailed provision for the software and hardware environment, test requirements, testing steps, applications used by the test. The test general requirements are as follows: ● Word exchange integrity: The word distribution range of products should be all characters given in the national standard GB 18030; ● Systematic correctness: Products must be able to correctly identify and process text files encoded in accordance with national standard GB 18030 . It should be noted that the detection range does not include embedded systems, such as PDA, mobile phones; single-byte currency symbols are not within the detection range; the operating system is the near future. 3. Support for minority texts ● Products should have the ability to support GB 18030 stipulated in my country's ethnic minority text coding space; ● We sell products in ethnic minority areas in my country, encourage local minority fonts and input methods. 4. According to the international practice, the standard GB 18030 has been included in 2,7484 Chinese characters. The total coded space exceeds 1.5 million code bits. In order to solve the name of the name, the name of the name, the name of the name is provided, providing unified information for Chinese character research, ancient books, etc. Platform basis. At present, most of our computer systems are still encoded with GB 2312. The GB 18030 is associated with GB 2312, which is better to solve the problem of converting the new system to the new system, and the transformation cost is small.
From the perspective of my country's information technology and information industry development, considering solving the needs of my country and solves the compatibility of existing systems and support for multiple operating systems, using GB 18030 is currently better choices in my country, and GB 13000.1 More applicable to information exchange in the future. Considering the compatibility of GB 18030 and GB 13000, the standard drafting group has prepared the code mapping table of GB 18030 and GB 13000.1 such that the two encoding systems can be freely converted. At the same time, GB 18030 Basic Patient Base Library has also been developed. Many countries and regions of the world have developed corresponding coding standards and internal code systems from the perspective of facilitating their national and national applications, such as Japanese JIS X 0208 and JIS X 0212, South Korea's KS C 5601 and KS C 5657, etc. The access practice used internationally. The GB 18030 is also in line with international practice. It is fully compatible with GB 2312. It is compatible with GB 13000.1 on the word. It can make full use of existing resources to ensure compatibility between different systems, maximize resource, and huge for my country's software industry. Expansion capacity. It can believe that the implementation of GB 18030 will facilitate the development of domestic software and form a scale, making my country's Chinese information technology in a step. From the perspective of the New Standard in 1980, my country promulgated the first Chinese character encoding character set standard, that is, GB 2312-80 "Information exchange with Chinese characters coded character set." The standard has received 6,763 Chinese characters and common symbols, which laid the foundation of Chinese information processing. With the expansion of international exchanges and cooperation, information processing applications put forward multi-cultural, large-order, multi-purpose requirements for character sets. In 1993, the International Standardization Organization issued ISO / IEC 10646-1 "Information Technology General Multi-eight-bit coding character sets first partial architecture and basic multi-cultural plane". my country is equivalent to this standard to develop GB 13000.1-1993. This standard uses a new multi-cultural encoding system, including 20902 Chinese characters, Japan, and Korea, is the future direction of the coding system. Since its new coding system is not compatible with existing most operating systems and external devices, its implementation still needs to have a process, and it is still not fully solving the urgent needs of my country's current application. Considering that the GB 13000 is completely achieved, and the continuation of the GB 2312 encoding system and the effective utilization and transition of existing resources and systems, we chose to expand on the GB 2312 (GB 2311), and in the word Up to GB 13000.1 compatible solution, develop a new standard - Chinese character encoding base set, which further improves GB 2312 to meet the urgent needs of my country's postal, household government, finance, geographic information systems and other applications. This project industry has been included in the National Standards for 1998. In October 1998, the Information Industry Electronics, Peking University Computer Technology Research Institute, Peking University Group, Xintiandi Company, Siyong New Century Company, Chinese Academy Software, Great Wall Software Company, ChinaSoft Corporation, Jinshan Software Company and Lenovo The company's technical staff forms a standard drafting group. During the standard development process, the National Information Technology Standardization Technical Committee has collected several standard drafting groups and well-known companies to fully study the standards, and invited Microsoft, Hewlett-Packard, Sun and IBM companies to participate. opinion. The standard drafting group has been repeatedly constructed and verified, and the standard development principle is proposed. The factual internal code standards corresponding to the GB 2312 information processing exchange code are supported, and all of the Niki, Japan, Japan (CJK) of GB 13000.1 on the word. Unified Chinese characters and all CJK expand A characters, and determine the encoding system and 27484 Chinese characters, forming compatibility, scalability, and forward-looking programs. The Ministry of Information Industry and the former National Bureau of Quality and Technical Supervision jointly issued this standard in January 17, 2000, namely GB 18030-2000 "Information Technology Information Exchange with Chinese Character Coding Character Set Basic Set". This standard is implemented as a national mandatory standard from the date of issuance, and the transition period is from August 31, 2001.