[Old] 2004-1-28 1:25:59 Software Technology Frontier: About XML and RSS (2) - Learning XML - "No Non-None]

zhaozj2021-02-16  69

Software Technology Frontier: About XML and RSS (2)? - Learn XML - "No Narcisten XML" Wednesday, January 28, 2004? 1:25 59 seconds

? XML is a basket, and it can go up. The first time I have a concept of XML is in the Japanese Forum, I feel that there is nothing. Another foam, although it is very beautiful, it has blown only one beach.

• The standard development is a more interesting thing. Why do you have to develop, how to coordinate the interests, how to compatibility with existing systems, how to maintain scalability

? At night, I saw "There is no non-non-repetable XML", and the Taiwanese will have a book, and they are deeply shallow and fun. Oh, but still there is no concept. CSS is not a new thing, HTML is not more to say, this era, what is going to speculate.

? Simply said that XML is similar to the html tag language, but the syntax is more stringent, do you want to do Well-Formed? 1. The label tag must be closed, which is to be paired? 2. • Label can't cross, only nested . (I have always hated HTML TAG cross, now it is good, finally gotten the standard)? 3.? All attributes should be quoted (this is the problem I have committed, never write quotes, troublesome)? 4.? Turn on white.

• Push XML's purpose? 1. The most critical purpose: pave the way for mobile applications. (Such an alive truth is actually reported by me, cow!) Reports in the evening news, Hong Kong has begun to provide 3G service. For a five-flowers, embarrassing HTML code and a wide variety of plugins, there is a very large browser program to explain. For embedded devices, such as mobile phones, PDAs, there may be smart home appliances, and resources are very limited, I am afraid it is difficult to display these web pages according to design. But mobile applications happens to be a market that is the most potential in the future, and capitalists will definitely not let go.

? About character encoding, a little dizzy, summarize

? Ascii ??????? 1 Byte? No nonsense

? ISO8859-1 ??? commonly known as Latin-1? Western European letters ?? should be ASCII

? Unicode? 2 BYTE? Before 256 = 00 ASCII? Unihan Union Chinese characters (China Japan and South Korea) distributed in 0x3400-0X9FFF? BIG5 and GB2312 are in 0x4e00-0x9FFF? All 2048 locations for 0xD800-0XDFFFFFFF? Keep 0xE000-0XF8FF a total of 6,400 positions to private areas

?

UTF-8 ??????? Inequal length, 1-3 BYTE? Unicode Convert to UTF-8 as follows ?? 0x0000-0x007f? No change? Direct to 0x00-0x7f (1 Byte) ? 0x0080-0X07FF? ->? B? 110x? Xxxx ??? 10xx? Xxxx? 0x0800-0xffffff? -> b? 1110? Xxxx ??? 10xx? Xxxx ??? 10xx? Xxxx? UTF-8 can effectively solve Half Chinese characters. UTF-16 and other double-byte coding methods must be scanned from beginning to determine the character boundary. Once the dislocation, you must go to the next ASCII to recover

? UTF-16? Basic and Unicode, but add the concept of the agent pair (Surrogate? Pairs). Unicode retains 048 locations in 048 locations in 0xD800-0XDFFF. This area is also divided into high and low, first part (high) 0xD800-0XDBFF, the second part (low) 0xDC00-0xDFFF. This can accommodate more than 100,000 words by the high and low (4 bytes). (1024 × 1024). Plus the Unicode character that is originally not using the agent, it constitutes UTF-16. The reason why the high and low is not overlap, but also to solve the multi-byte boundary issue. Question:? 1. Has a long time (at least three years ago) has a .shtml format file, and XHTML has no relationship. Check. ? 2. The future file is saved as XML as a general format? ? It's too weak, it's just a joke. WindyWong Published in> 2004-1-28 1: 25: 59 ←

转载请注明原文地址:https://www.9cbs.com/read-18337.html

New Post(0)