协会是什么意思| 早孕有什么反应| 卡粉是什么原因引起的| 坐骨神经痛吃什么药好得快| 魄力是什么意思| 胃大肚子大是什么原因| 白细胞计数偏低是什么原因| 天使什么意思| 得艾滋病有什么症状| 晚黄瓜什么时候种| 似乎的近义词是什么| 慢性前列腺炎有什么症状| 肛门里面有个肉疙瘩是什么| 膳食是什么| 月经不调是什么原因| 北京大学校长是什么级别| 医院属于什么单位| 成都是什么气候| 门当户对指的是什么| 缺维生素d吃什么| 痛经什么原因| 妇科炎症吃什么消炎药效果好| 清除胃火吃什么药| 蓝莓葡萄是什么品种| 彼岸花代表什么星座| 无厘头什么意思| 欲壑难填是什么意思| 婴儿42天检查什么项目| 物以类聚什么意思| 为什么同房后小腹疼痛| 市公安局政委是什么级别| 端午节都吃什么菜好| 女性备孕吃什么养卵泡| 再生牙技术什么时候能实现| 牙齿为什么会变黄| mpa是什么意思| 不什么不什么的词语| 周期是什么| 耳道炎是什么原因引起的| 异地办理护照需要什么材料| 刚拔完牙需要注意什么| 盆腔肿物是什么意思| 澳门什么时候回归| kobe是什么意思| 尿液中有泡沫是什么原因| 辰龙是什么意思| mt是什么意思| 带id是什么意思| 阴囊湿疹用什么药效果最好| 短发女人吸引什么男人| 硅是什么| 汉族是什么人种| 生姜什么时候种植最合适| 手掌上的三条线分别代表什么| 女人脚发热是什么原因| 白头发有什么方法变黑| 内裤上有黄色分泌物是什么原因| 晚上9点多是什么时辰| 四月初一是什么星座| ask是什么意思| 身怀六甲什么意思| 吃什么东西对肝脏好| 4是什么生肖| 什么的月饼| 头晕用什么药好| 什么时候种白菜| 外阴萎缩是什么症状| 嘴贱什么意思| 宫颈锥切术是什么意思| 指甲变厚是什么原因| 刀郎和那英是什么关系| 乙肝前s1抗原阳性是什么意思| 子宫内膜厚什么原因引起的| 支气管炎是什么症状| 榻榻米床垫什么材质的好| 海南有什么水果| 今年是什么年| 5月3号是什么星座| 吃了避孕药有什么副作用| 婴儿什么时候长牙| 普贤菩萨的坐骑是什么| kappa是什么牌子| 胆固醇高对身体有什么危害| 梦见蛇吃人代表什么预兆| 走路不稳是什么原因| 什么是乳腺结节| 什么东西软化鱼刺最快| 来例假肚子疼是什么原因| 用什么泡水喝补肾| 囊内可见卵黄囊是什么意思| 世态炎凉什么意思| 红茶适合什么季节喝| 阿尔茨海默症是什么病| 真人是什么意思| 最快的速度是什么| 唾液酸苷酶阳性什么意思| 最好的避孕方法是什么| 流浪猫吃什么| 什么叫钙化了| 孕酮是什么意思| 怀孕20天有什么症状| 集合是什么| 嫦娥住的宫殿叫什么| 两色富足间是什么生肖| 为什么有胎记| 狸猫是什么猫| 窦性心律早期复极是什么意思| 台湾是什么民族| 95511是什么电话| 小腿酸胀是什么原因| 烧火棍是什么意思| 牛蛙和青蛙有什么区别| 闽南语懒觉是什么意思| 辜负是什么意思| 面色发黄是什么原因| 荷兰的国花是什么花| 百合与什么搭配最好| rhubarb是什么意思| 帅t是什么意思| 2003年的羊是什么命| 双氧水是什么东西| 女人下身干燥无水是什么原因| 睡觉时头晕是什么原因| 糖化血红蛋白是什么意思| 生闷气容易得什么病| 广东有什么烟| 什么是资本运作| 血氨是什么| 领证需要准备什么| 麦粒肿吃什么药| 睾丸大是什么原因| 铁锈是什么颜色的| cbd是什么意思| 脂膜炎是什么原因引起的| 口腔溃疡为什么那么痛| 手关节黑是什么原因| 为什么血液是红色的| 吃什么补气血| 人参片泡水喝有什么功效和作用| 三点是什么时辰| 400年前是什么朝代| 十二年是什么婚| 大年初一是什么星座| 促排是什么意思| 百合什么时候开花| 口下面一个巴念什么| 血冲脑是什么原因引起| 什么是尿崩症| 纤维蛋白原是什么| sk是什么牌子| cd8高是什么原因| 易建联为什么不打nba| 小孩割包皮挂什么科室| 没有精液是什么原因| 补锌吃什么| 车厘子是什么季节的| 女性体毛多是什么原因| 什么是血友病| 为什么要做肠镜检查| 感冒了吃什么饭菜合适| 甲状腺低是什么意思| 招字五行属什么| 叶酸片什么时候吃| 说话不清楚去医院挂什么科| 怹是什么意思| 子宫长什么样| 湖南什么山最出名| 我做错了什么| 肚脐眼疼是什么原因| 上海话十三点是什么意思| 什么叫肿瘤| 世界上最多的动物是什么| 眼睛近视缺什么维生素| 出家当和尚有什么要求| 逝去是什么意思| 去医院打耳洞挂什么科| 玄关画挂什么图最好| 女人梦见蛇预示着什么| 月经期间吃什么对身体好| 一直拉肚子吃什么药| 蜜蜡是什么材料| 聚宝盆什么意思| 什么中生什么| 公务员和事业编有什么区别| 水是什么生肖| 卡西欧手表什么档次| 肩周炎属于什么科室| 国家穷为什么不多印钱| 湿疹为什么一热就出来| 嗓子老有痰是什么原因| nak是什么牌子| 手心干燥是什么原因| 心血管疾病做什么检查| nbr是什么材质| 木瓜什么时候成熟| 梦见下大雨是什么意思| 玄胡又叫什么| 胃不好吃什么蔬菜| 高血压吃什么| 为什么医院都让喝雀巢奶粉| 养寇自重什么意思| 大便发黑是什么情况| 梦见小麦粒是什么意思| 临床什么意思| 万象更新是什么生肖| 眼窝凹陷是什么原因| 发际线高适合什么发型| 空气净化器什么牌子好| acg文化是什么意思| 左侧腰疼是什么原因| 一箭双雕是指什么生肖| 吃榴莲不能和什么一起吃| 梦见做春梦是什么意思| 裳能组什么词| 破釜沉舟是什么意思| 三羊开泰什么意思| 猫毛过敏吃什么药| 为什么晚上睡觉老是做梦| 腰底部疼痛跟什么病有关| 腹部ct平扫能检查出什么| 衣冠禽兽什么意思| 回绝是什么意思| 老子叫什么名字| trab是甲状腺什么指标| 指甲变形是什么原因| 什么是情感障碍| 例假提前来是什么原因| 男人吃什么药时间长| 2000年为什么叫千禧年| 梦到自己开车是什么意思| 木字旁加差是什么字| 秋天的落叶像什么| 孕妇吃山竹对胎儿有什么好处| 公安局跟派出所有什么区别| 刻舟求剑的求是什么意思| 鸡鸡长什么样| 什么气组词| 应景是什么意思| 什么人不能坐飞机| 什么一笑| 吃什么能快速减肥| 黑色素痣看什么科| 讳莫如深是什么意思| 7点到9点是什么时辰| 梦遗太频繁是什么原因造成的| 什么大什么粗| 气血不足看什么科室| 马头琴是什么族的乐器| 疱疹性咽峡炎吃什么药最管用| 毛主席什么时候去世| 讲信修什么| 漱口杯什么材质好| 泰山山顶叫什么| 霍山石斛有什么作用| 晚上失眠是什么原因| 葬礼穿什么衣服| 不明原因腿疼挂什么科| 十一月九号是什么星座| 金火是什么生肖| 萤火虫为什么会发光简单回答| 皮肤过敏吃什么药好| 23度穿什么衣服合适| 七叶一枝花主治什么病| 1979年什么命| 百度Jump to content

青海省“多重医疗保障”全力阻击“因病致贫”

From Wikipedia, the free encyclopedia
百度 保监会相关负责人认为,在成熟的市场化环境下,充分竞争可以促进市场主体提高效率、改善服务,有利于市场发展,有利于消费者。

Universal Coded Character Set
Alias(es)UCS, Unicode
Language(s)International
StandardISO/IEC 10646
Encoding formatsUTF-8, UTF-16, GB 18030
Less common: UTF-32, BOCU, SCSU, UTF-7
Preceded byISO/IEC 8859, ISO/IEC 2022, various others

The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/IEC 10646, Information technology — Universal Coded Character Set (UCS) (plus amendments to that standard), which is the basis of many character encodings, improving as characters from previously unrepresented writing systems are added.[1]

The UCS has over 1.1 million possible code points available for use/allocation, but only the first 65,536, which is the Basic Multilingual Plane (BMP), had entered into common use before 2000. This situation began changing when the People's Republic of China (PRC) ruled in 2006 that all software sold in its jurisdiction would have to support GB 18030. This required software intended for sale in the PRC to move beyond the BMP.[clarification needed][2]

The system deliberately leaves many code points not assigned to characters, even in the BMP. It does this to allow for future expansion or to minimise conflicts with other encoding forms.

The original edition of the UCS defined UTF-16, an extension of UCS-2, to represent code points outside the BMP. A range of code points in the S (Special) Zone of the BMP remains unassigned to characters. UCS-2 disallows use of code values for these code points, but UTF-16 allows their use in pairs. Unicode also adopted UTF-16, but in Unicode terminology, the high-half zone elements become "high surrogates" and the low-half zone elements become "low surrogates".[clarification needed]

Another encoding, UTF-32 (previously named UCS-4), uses four bytes (total 32 bits) to encode a single character of the codespace. UTF-32 thereby permits a binary representation of every code point (as of year 2024) in the APIs, and software applications.

History

[edit]

The International Organization for Standardization (ISO) set out to compose the universal character set in 1989, and published the draft of ISO 10646 in 1990. Hugh McGregor Ross was one of its principal architects.

This work happened independently of the development of the Unicode standard, which had been in development since 1987 by Xerox and Apple.

The original ISO 10646 draft differed markedly from the current standard. It defined:

  • 128 groups of
  • 256 planes of
  • 256 rows of
  • 256 cells,

for an apparent total of 2,147,483,648 characters, but actually the standard could code only 679,477,248 characters, as the policy forbade byte values of C0 and C1 control codes (0x00 to 0x1F and 0x80 to 0x9F, in hexadecimal notation) in any one of the four bytes specifying a group, plane, row and cell. The Latin capital letter A, for example, had a location in group 0x20, plane 0x20, row 0x20, cell 0x41.

One could code the characters of this primordial ISO/IEC 10646 standard in one of three ways:

  1. UCS-4, four bytes for every character, enabling the simple encoding of all characters;
  2. UCS-2, two bytes for every character, enabling the encoding of the first plane, 0x20, the Basic Multilingual Plane, containing the first 36,864 codepoints, straightforwardly, and other planes and groups by switching to them with ISO/IEC 2022 escape sequences;
  3. UTF-1, which encodes all the characters in sequences of bytes of varying length (1 to 5 bytes, each of which contain no control codes).

In 1990, therefore, two initiatives for a universal character set existed: Unicode, with 16 bits for every character (65,536 possible characters), and ISO/IEC 10646. The software companies refused to accept the complexity and size requirement of the ISO standard and were able to convince a number of ISO National Bodies to vote against it.[citation needed] ISO officials realised they could not continue to support the standard in its current state and negotiated the unification of their standard with Unicode. Two changes took place: the lifting of the limitation upon characters (prohibition of control code values), thus opening code points for allocation; and the synchronisation of the repertoire of the Basic Multilingual Plane with that of Unicode.

Meanwhile, in the passage of time, the situation changed in the Unicode standard itself: 65,536 characters came to appear insufficient, and the standard from version 2.0 and onwards supports encoding of 1,112,064 code points from 17 planes by means of the UTF-16 surrogate mechanism. For that reason, ISO/IEC 10646 was limited to contain as many characters as could be encoded by UTF-16 and no more, that is, a little over a million characters instead of over 679 million. The UCS-4 encoding of ISO/IEC 10646 was incorporated into the Unicode standard with the limitation to the UTF-16 range and under the name UTF-32, although it has almost no use outside programs' internal data.

Rob Pike and Ken Thompson, the designers of the Plan 9 operating system, devised a new, fast and well-designed mixed-width encoding that was also backward-compatible with 7-bit ASCII, which came to be called UTF-8,[3] and is currently the most popular UCS encoding.

Differences from Unicode

[edit]

ISO/IEC 10646 and Unicode have an identical repertoire and numbers—the same characters with the same numbers exist on both standards, although Unicode releases new versions and adds new characters more often. Unicode has rules and specifications outside the scope of ISO/IEC 10646. ISO/IEC 10646 is a simple character map, an extension of previous standards like ISO/IEC 8859. In contrast, Unicode adds rules for collation, normalisation of forms, and the bidirectional algorithm for right-to-left scripts such as Arabic and Hebrew. For interoperability between platforms, especially if bidirectional scripts are used, it is not enough to support ISO/IEC 10646; Unicode must be implemented.

To support these rules and algorithms, Unicode adds many properties to each character in the set such as properties determining a character's default bidirectional class and properties to determine how the character combines with other characters. If the character represents a numeric value such as the European number '8', or the vulgar fraction '?', that numeric value is also added as a property of the character. Unicode intends these properties to support interoperable text handling with a mixture of languages.

Some applications support ISO/IEC 10646 characters but do not fully support Unicode. One such application, Xterm, can properly display all ISO/IEC 10646 characters that have a one-to-one character-to-glyph mapping[clarification needed] and a single directionality. It can handle some combining marks by simple overstriking methods, but cannot display Hebrew (bidirectional), Devanagari (one character to many glyphs) or Arabic (both features). Most GUI applications use standard OS text drawing routines which handle such scripts, although the applications themselves still do not always handle them correctly.

Citing the Universal Coded Character Set

[edit]

ISO/IEC 10646, a general, informal citation for the ISO/IEC 10646 family of standards, is acceptable in most prose. And even though it is a separate standard, the term Unicode is used just as often, informally, when discussing the UCS. However, any normative references to the UCS as a publication should cite the year of the edition in the form ISO/IEC 10646:{year}, for example: ISO/IEC 10646:2014.

Relationship with Unicode

[edit]

Since 1991, the Unicode Consortium and the ISO/IEC have developed The Unicode Standard ("Unicode") and ISO/IEC 10646 in tandem. The repertoire, character names, and code points of Unicode Version 2.0 exactly match those of ISO/IEC 10646-1:1993 with its first seven published amendments. After Unicode 3.0 was published in February 2000, corresponding new and updated characters entered the UCS via ISO/IEC 10646-1:2000. In 2003, parts 1 and 2 of ISO/IEC 10646 were combined into a single part, which has since had a number of amendments adding characters to the standard in approximate synchrony with the Unicode standard.

  • ISO/IEC 10646-1:1993 = Unicode 1.1
  • ISO/IEC 10646-1:1993 plus Amendments 5 to 7 = Unicode 2.0
  • ISO/IEC 10646-1:1993 plus Amendments 5 to 7 = Unicode 2.1 excluding Euro sign and Object Replacement Character, which are included in Amendment 18
  • ISO/IEC 10646-1:2000 = Unicode 3.0
  • ISO/IEC 10646-1:2000 and ISO/IEC 10646-2:2001 = Unicode 3.1
  • ISO/IEC 10646-1:2000 plus Amendment 1 and ISO/IEC 10646-2:2001 = Unicode 3.2
  • ISO/IEC 10646:2003 = Unicode 4.0
  • ISO/IEC 10646:2003 plus Amendment 1 = Unicode 4.1
  • ISO/IEC 10646:2003 plus Amendments 1 to 2 = Unicode 5.0 excluding Devanagari letters GGA, JJA, DDDA and BBA, which are included in Amendment 3
  • ISO/IEC 10646:2003 plus Amendments 1 to 4 = Unicode 5.1
  • ISO/IEC 10646:2003 plus Amendments 1 to 6 = Unicode 5.2
  • ISO/IEC 10646:2003 plus Amendments 1 to 8 = ISO/IEC 10646:2011 = Unicode 6.0 excluding Indian rupee sign
  • ISO/IEC 10646:2012 = Unicode 6.1
  • ISO/IEC 10646:2012 = Unicode 6.2 excluding Turkish lira sign, which is included in Amendment 1
  • ISO/IEC 10646:2012 = Unicode 6.3 excluding Turkish lira sign, which is included in Amendment 1, and five bidirectional control characters (Arabic Letter Mark, Left-To-Right Isolate, Right-To-Left Isolate, First Strong Isolate, Pop Directional Isolate), which are included in Amendment 2
  • ISO/IEC 10646:2012 plus Amendments 1 and 2 = Unicode 7.0 excluding the Ruble sign
  • ISO/IEC 10646:2014 plus Amendment 1 = Unicode 8.0 excluding the Lari sign, nine CJK unified ideographs, and 41 emoji characters
  • ISO/IEC 10646:2014 plus Amendments 1 and 2 = Unicode 9.0 excluding Adlam, Newa, Japanese TV symbols, and 74 emoji and symbols
  • ISO/IEC 10646:2017 = Unicode 10.0 excluding 285 Hentaigana characters, 3 Zanabazar Square characters, and 56 emoji symbols
  • ISO/IEC 10646:2017 plus Amendment 1 = Unicode 11.0 excluding 46 Mtavruli Georgian capital letters, 5 CJK unified ideographs, and 66 emoji characters
  • ISO/IEC 10646:2017 plus Amendments 1 and 2 = Unicode 12.0 excluding 62 additional characters
  • ISO/IEC 10646:2020 = Unicode 13.0
  • ISO/IEC 10646:2020 plus Amendments 1 = Unicode 15.0
  • ISO/IEC 10646:2020 plus Amendments 1 and 2 = Unicode 16.0

See also

[edit]

References

[edit]
  1. ^ Draft, Final Committee (2010). ISO/IEC International Standard ISO/IEC 10646 (PDF) (2nd ed.). Switzerland. p. 8.{{cite book}}: CS1 maint: location missing publisher (link)
  2. ^ "Universal Character Set - Acemap". ddescholar.acemap.info. Retrieved 2025-08-14.
  3. ^ Pike, Rob (2025-08-14). "UTF-8 history". Archived from the original on 2025-08-14.
[edit]
摩羯座女和什么座最配 丁丁是什么意思 上午9点是什么时辰 为什么叫汉族 树菠萝什么时候成熟
飞行员妻子有什么待遇 掉头发吃什么药最有效 桂林有什么好玩的 农历正月是什么星座 绿杨春属于什么茶
粟是什么农作物 舌根起泡是什么原因 小孩说梦话是什么原因引起的 肝低回声结节是什么意思 蛋疼是什么原因引起的
免疫固定电泳查什么的 紫癜是什么症状 人血白蛋白适合什么人 禁的拼音是什么 呆萌是什么意思
跟泰迪很像的狗叫什么hcv9jop0ns4r.cn 孙俪最新电视剧叫什么hcv9jop8ns1r.cn 粘米粉是什么粉hcv8jop1ns4r.cn 治肝病最好的药是什么药hcv8jop9ns7r.cn 辣椒油用什么能洗掉hcv7jop5ns1r.cn
跳蚤是什么样的图片hcv7jop9ns2r.cn 乙肝表面抗原阴性是什么意思sscsqa.com 女性夜尿多吃什么调理bjcbxg.com 人体最大的排毒器官是什么hcv8jop5ns1r.cn 阴虚内热是什么意思creativexi.com
专技十三级是什么意思hcv8jop7ns8r.cn 美国什么时候建国的hcv9jop5ns3r.cn 为什么邓超对鹿晗很好hcv7jop5ns1r.cn 83年猪是什么命hcv8jop4ns5r.cn 唇炎涂什么药膏hcv9jop1ns8r.cn
9.21是什么星座hcv8jop8ns2r.cn 尿隐血3十是什么病hcv8jop8ns8r.cn 玩票是什么意思hcv9jop6ns9r.cn 肠胃不好能吃什么水果hcv9jop5ns7r.cn 术后可以吃什么水果hcv8jop8ns1r.cn
百度