Please don't reprint this article; please don't re-publish this article in any form; please delete it within 24 hours of downloading this article; it is forbidden to use this article for commercial purposes.
2 Lexical conventions [lex] 2.2 Character sets [lex.charset] 2 lexical Lexical Conventions [2.2] [lexical character set character set.} The basic source character set consists of 96 characters: the space character, the control characters representing horizontal tab, Vertical Tab, Form Feed, And New-line, Plus the Following 91 Graphical Characters: 15) Abcdefghijklmnopqrstu vwxyz Abcdefghijklmnopqrstu vwxyz 0 1 2 3 4 5 6 7 8 9_ {} [] # () <>%:;.? * - / ^ & | ~! =, / "The basic source character set consists of 96 characters: space characters, representation of horizontal table, vertical table, change page, wrap control character, plus the following 91 graphics characters: 15 Abcdefghijklmnopqrstu vwxyz abcdefghijklmnopqrstu vwxyz 0 1 2 3 4 5 6 7 8 9 _ {} [] # () <>%:;.? * - / ^ & | ~! =, / "'The universal-character-name Construct Provides A Way to Name Other Characters. Hex-quad: Hexadecimal-Digit Hexadecimal-Digit Hexadecimal-Digit Hexadecimal-Digit Universal-Character-Name: / U HEX-Quad / U HEX -quad hex-quad The character designated by the universal-character-name / UNNNNNNNN is character whose character short name in ISO / IEC 10646 is NNNNNNNN that; the character designated by the universal-character-name / uNNNN is that character whose character short name in ISO / IEC 10646 is 0000NNNN. If the hexadecimal value for a universal character name is less than 0x20 or in the range 0x7F-0x9F (inclusive), or if the universal character name designates a character in the basic source character set, then The program is ill-factoryd. The unified character name provides a configuration named for other characters.
HEX-four groups: hexadecimal digital hexadecimal digital hexadecimal number hexadecimal digital unified character name: / u HEX-four group / u HEX-four group HEX-four groups Unified Character Name / Unnnnnnnn The specified character is a character with short name nnnnnnnn in ISO / IEC 10646; the character specified by the Unified Character Name / Unnnn is a character in ISO / IEC 10646 with a short name 0000nnnn. If a hexadecimal value of a unified character name is less than 0x20 or between 0x7F-0x9f (included), or if a unified character name specified is in the basic source character set, the program is a pathological form. The basic execution character set and the basic execution wide-character set shall each contain all the members of the basic source character set, plus control characters representing alert, backspace, and carriage return, plus a null character (respectively, null wide character), whose representation has all zero bits. For each basic execution character set, the values of the members shall be non-negative and distinct from one another. The execution character set and the execution wide-character set are supersets of the basic execution character set and the basic execution wide-character set, respectively. The values of the members of the execution character sets are implementation-defined, and any additional members are locale-specific. basic execution character set and basic execution wide-character set should include all the basic source Members of the character set plus the control characters representing alert, back, and carriage return, plus an invalid character (invalid wide character) that behaves as a full zero bit. The value of any member of each basic execution character set should be a non-negative number and distinguishable. Performing a character set and performing a wide character set is a supercoming character set and basic execution wide character set, and the value value of the respective execution character set is defined by implementation, and any additional member is the site specified.