Garbled full (5)

zhaozj2021-02-08  289

1. GB code and BIG5 code

GB code is a Chinese character encoding method used in countries and regions such as mainland China, Singapore. The BIG5 code is a Chinese character encoding method used in Taiwan Province. Their coding method is completely different, and the conversion between them can only be done by the "Characterization Method". Therefore, the method of conversion is simple, difficult is "table" generation. Many articles have been introduced here, I will not be detailed here. In my homepage, I have the source of "Chinese Characters Transcodent V1.0" I wrote, with these two "tables", which can be used directly.

2. Hz code

The Hz code is to enable the encoding defined by the mail server or gateway that can only transmit 7 bit information, and is also a common coding in Chinese. It and the quoted-printable code described above can only be encoded, ie the control character is ignored when encoding.

This coding is also very well recognized: there are many "~ {" and "~}", and always appear. The following is an example of the Hz code:

~ {! 6BRBKKC7 (4SH ! 7 ~}

~ {Wwu_ ~}: Mogao ~ {#, 0WTF; f: wu> # (~} telnet: //202.112.20.132: 23 ~ {) 3IT1! # ~}

~ {D * 8_hm <~ 9 $ wwjr #: ~} http://mogao.bentiun.net

Emailto: Mogao@371.net

***************************************************

* ~ {3} ak

***************************************************

You can open the "Antarctic Star" to see this text.

Its algorithm is simpler: read a character, if it is an 8-bit character, turn its highest bit clear. Enclose the output of the continuous 8-bit character to zero zero to "~ {" and "~}". When decoding: That is "1" "1" in the 8th position of the part of "~ {" and "~}".

The conversion between the three codes described above is often met. I wrote "Chinese character transcodent v1.0" can be easily converted between these three types, I am open to the source of netizens. Learn.

3. Other commonly used coding

Unicode

The most typical example in the Unicode application is: IE4 or more version of HTML is encoded. It can be said to be the only character set under Windows. But it is still very imperfect, and WIN95 and WIN98 have also very limited support, and it does not even have a complete set of standards. However, Microsoft's latest office2000 and the Windows 2000 to be introduced immediately will fully support Unicode. Unicode replaces other codes will be an inevitable trend. However, in the past two years Unicode does not dominate, it is after the dominant position, because the operating system is different, other codes will not die immediately. Its Chinese information can be found in the documentation in Office2000 and Windows2000, and its official website is: http://www.unicode.org/.

2. Binhex

Binhex encoding is a Macintosh computer (that is, commonly known as "Apple Computer") uses a coding method that printed / transmits binary files with printable characters. Its main use is the attach binaries in the email program. Most email programs do not support this format (Eudora support), but use WinZip to decode. Please check it out

Macintosh

Computer band related documentation.

转载请注明原文地址:https://www.9cbs.com/read-1245.html

New Post(0)