Unicode 3.0.0 (original) (raw)

3.0.0 Front Matter
Title Page and Colophon
Acknowledgements
Preface
3.0.0 Chapters
1 Introduction (HTML version)
2 General Structure
3 Conformance
4 Character Properties
5 Implementation Guidelines
6 Punctuation
7 European Alphabetic Scripts
8 Middle Eastern Scripts
9 South and Southeast Asian Scripts
10 East Asian Scripts
11 Additional Scripts
12 Symbols
13 Special Areas and Format Characters
14 Code Charts Introductory Text
Code Charts
Code Charts (Latest)
Code Charts (3.0.0, not online)
Han Radical-Stroke Index
15 Han Indices (Introductory Text)
15.1 Han Radical-Stroke Index (hard copy only)
15.2 Shift-JIS Index (hard copy only)
Interactive Han Radical-Stroke Index (Latest)
3.0.0 Appendices and Back Matter
A Han Unification History
B Submitting New Characters
C Relationship to ISO/IEC 10646
D Changes from Unicode Version 2.0
G Glossary
R References
I I.1Unicode Names Index I.2 General Index
3.0.0 Unicode Technical Reports
UTR #9: The Bidirectional Algorithm
UTR #11: East Asian Width
UTR #13: Unicode Newline Guidelines
UTR #14: Line Breaking Properties
UTR #15: Unicode Normalization Forms
3.0.0 UCD
3.0.0 (files) (about)
Related Links
About Versions
Latest Version
Archive of Unicode Versions
The Unicode Standard
Unicode Character Database
Technical Reports
Updates and Errata

Version 3.0.0 has been superseded by thelatest version of the Unicode Standard.

The Unicode Standard, Version 3.0 Version 3.0.0 of the Unicode Standard consists of the core specification, The Unicode Standard, Version 3.0, the code charts for this version (currently only available in hard copy), five Unicode Technical Reports, and the 3.0 Update of the Unicode Character Database (UCD). The core specification gives the general principles, requirements for conformance, and guidelines for implementers. The code charts show representative glyphs for all the Unicode characters. The Unicode Technical Reports supply detailed information about particular aspects of the standard. The Unicode Character Database supplies normative and informative data for implementers to allow them to implement the Unicode Standard.

A complete specification of the contributory files for Unicode 3.0.0 is found on the page Components for 3.0.0. That page also provides the recommended reference format for this version of the Unicode Standard.


Online Edition

The text of The Unicode Standard, Version 3.0 (ISBN 0-201-61633-5) is available online via the navigation links on this page, with the exception of the code charts and the Han radical-stroke indices. A slightly modified HTML version of Chapter 1 has also been provided. Printing from the PDF files has been disabled. Normative references to the Unicode Standard, Version 3.0 should use the printed edition.

Overview

Unicode 3.0.0 is a major version of the Unicode Standard and supersedes all previous versions. This page summarizes the important changes for the Unicode Standard, Version 3.0.0. In the discussion below, shortened references to "Unicode 3.0" or "Version 3.0" specifically refer to Version 3.0.0.

The core specification, The Unicode Standard, Version 3.0 contains descriptions and properties for many new characters. It is synchronized with ISO/IEC 10646-1 second edition. The text of the standard has been extensively rewritten to improve its structure and clarity.

Unicode 3.0 also includes enhanced implementation guidelines, and has been reorganized to describe related scripts within separate chapters. In addition to new characters, there are significant clarifications or modifications to character semantics from Unicode 2.0 to Unicode 3.0.

The vast majority of implementations of earlier versions will be conformant to Unicode 3.0.0 once the character properties for their supported characters are updated to Version 3.0.0 of the Unicode Character Database.

The most significant additions to the standard include the following:

New Characters

The new characters added to Unicode 3.0 are summarized in the following table:

Unicode 3.0 Summary

Category V 2.1 V 3.0
Alphabetics, Symbols 6511 10236
CJK Ideographs 21204 27786
Hangul Syllables 11172 11172
Total assigned characters 38887 49194
Private Use 6400 6400
Surrogates 2048 2048
Controls 65 65
Not Characters 2 2
Total assigned 16-bit code values 47402 57709
Unassigned 16-bit code values 18134 7827

Besides adding characters to existing blocks, Unicode 3.0 adds a number of new blocks, listed below, and including the number of code points allocated to each block. For a list of all the blocks in Unicode 3.0, see Blocks.txt

New Blocks

Number Block Name
80 Syriac
192 Thaana
128 Sinhala
160 Myanmar
384 Ethiopic
96 Cherokee
640 Unified Canadian Aboriginal Syllabics
32 Ogham
96 Runic
128 Khmer
176 Mongolian
256 Braille Patterns
128 CJK Radicals Supplement
224 Kangxi Radicals
16 Ideographic Description Characters
32 Bopomofo Extended
6582 CJK Unified Ideographs Extension A
1168 Yi Syllables
64 Yi Radicals

Conformance Changes

Conformance clauses, definitions, and explanatory text were added for handling Unicode Transformation Formats. The Unicode Bidirectional Behavior algorithm rules were clarified and expanded, and new bidirectional character properties were documented. Other normative character property values were changed; see the Unicode character database file for more information.

Unicode Technical Reports

The following technical reports are approved and considered part of the Unicode Standard, Version 3.0. These reports may contain either normative or informative material, or both. Any reference to version 3.0 of the standard automatically includes these technical reports.


Access to Copyright and terms of use