Working With Unicode Surrogates (Windows CE 5.0) (original) (raw)

Send Feedback

The Unicode Standard defines surrogates: a "surrogate" or "surrogate pair" is a pair of 16-bit Unicode code values that represent a single character. Surrogates provide additional character support for the languages that need more than the 65,536 characters in the 16-bit Unicode code space. For example, the Chinese speaking community alone uses over 55,000 characters.

Planes two and three defined in ISO/IEC 10646 are reserved for ideographic characters. These planes fall in the high surrogate range of U+D840 to U+D8BF. The first (high) surrogate is a 16-bit code value in the same range, U+D800 to U+DBFF. The second (low) surrogate is a 16-bit code value in the range U+DC00 to U+DFFF. Using surrogates, Unicode can support over one million characters.

Windows CE provides Unicode surrogate support at the OS level and in Microsoft® Internet Explorer. The support is limited to surrogate handling and display; editing is not supported.

The following list shows the ways Windows CE supports surrogates:

For additional information about font standards needed for surrogates, see The OpenType Specification. The specification is available in HTML format for viewing online at this Microsoft Web site.

See Also

Working with Surrogate Pairs | Understanding the Unicode Standard

Send Feedback on this topic to the authors

Feedback FAQs

© 2006 Microsoft Corporation. All rights reserved.