Enclosed Alphanumerics (original) (raw)

From Wikipedia, the free encyclopedia

Unicode block of typographical symbols

"Ⓤ" redirects here. For the kosher food label, see Orthodox Union.

"②" and "⑩" redirect here. For the Steinhaus-Moser number ② (mega) or ⑩ (megiston), see Steinhaus–Moser notation.

Unicode character block

Enclosed Alphanumerics
Range U+2460..U+24FF(160 code points)
Plane BMP
Scripts Common
Assigned 160 code points
Unused 0 reserved code points
Unicode version history
1.0.0 (1991) 139 (+139)
3.2 (2002) 159 (+20)
4.0 (2003) 160 (+1)
Unicode documentation
Code chartWeb page
Note: [1][2]

Enclosed Alphanumerics is a Unicode block of typographical symbols of an alphanumeric within a circle, a bracket or other not-closed enclosure, or ending in a full stop.

It is currently fully allocated. Within the Basic Multilingual Plane, a few additional enclosed numerals are in the Dingbats and the Enclosed CJK Letters and Months blocks. There is also a block with more of these characters in the Supplementary Multilingual Plane named Enclosed Alphanumeric Supplement (U+1F100–U+1F1FF), as of Unicode 6.0.

Many of these characters were originally intended for use as bullets for lists.[3] The parenthesized forms are historically based on typewriter approximations of the circled versions.[3] Although these roles have been supplanted by styles and other markup in "rich text" contexts, the characters are included in the Unicode standard "for interoperability with the legacy East Asian character sets and for the occasional text context where such symbols otherwise occur."[3] The Unicode Standard considers these characters to be distinct from characters which are similar in form but specialized in purpose, such as the circled C, P or R characters which are defined as copyright and trademark symbols or the circled a used for an at sign.[3]

A circled s (Ⓢ) was used in documents circa 1900 printed by German missionaries, especially the Basel Mission, in the Malayalam language to denote a ditto mark.[4]

Enclosed Alphanumerics[1]Official Unicode Consortium code chart (PDF)
0 1 2 3 4 5 6 7 8 9 A B C D E F
U+246x
U+247x
U+248x
U+249x
U+24Ax
U+24Bx
U+24Cx
U+24Dx
U+24Ex
U+24Fx
Notes 1.^ As of Unicode version 16.0

The Enclosed Alphanumerics block contains one emoji: U+24C2, the enclosed M used as a symbol for mask works.[5][6]

It defaults to a text presentation and has two standardized variants defined to specify text presentation (U+FE0E VS15) or emoji-style (U+FE0F VS16).[7]

Emoji variation sequences

U+ 24C2
base code point
base+VS15 (text) Ⓜ︎
base+VS16 (emoji) Ⓜ️

The following Unicode-related documents record the purpose and process of defining specific characters in the Enclosed Alphanumerics block:

  1. ^ "Unicode character database". The Unicode Standard. Retrieved 2023-07-26.
  2. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2023-07-26.
  3. ^ a b c d The Unicode Standard, 6.0.1
  4. ^ Joseph Muliyil; M Krishnan (1904). "Contents". The New Malayalam Reader (in Malayalam). Mangalore: Basel Mission Book and Tract Repository. p. vii.
  5. ^ "UTR #51: Unicode Emoji". Unicode Consortium. 2023-09-05.
  6. ^ "UCD: Emoji Data for UTR #51". Unicode Consortium. 2023-02-01.
  7. ^ "UTS #51 Emoji Variation Sequences". The Unicode Consortium.