Code page 932 (IBM) (original) (raw)

From Wikipedia, the free encyclopedia

Computer character set for Japanese

IBM code page 932 (abbreviated as IBM-932[1] or ambiguously as CP932) is one of IBM's extensions of Shift JIS. The coded character sets are JIS X 0201:1976, JIS X 0208:1983,[1] IBM extensions and IBM extensions for IBM 1880 UDC. It is the combination of the single-byte Code page 897 and the double-byte Code page 301.[2] Code page 301 is designed to encode the same repertoire as IBM Japanese DBCS-Host.[3]

IBM-932 resembles IBM-943. One difference is that IBM-932 encodes the JIS X 0208:1983 characters but preserves the 1978 ordering, whereas IBM-943 uses the 1983 ordering[1] (i.e. the character variant swaps made in JIS X 0208:1983). Another difference is that IBM-932 does not incorporate the NEC selected extensions, which IBM-943 includes for Microsoft compatibility.[1]

IBM-942 includes the same double-byte codes as IBM-932 (those from Code page 301) but includes additional single-byte extensions. International Components for Unicode treats "ibm-932" and "ibm-942" as aliases for the same decoder.[4]

IBM-932 contains 7-bit ISO 646 codes, and Japanese characters are indicated by the high bit of the first byte being set to 1. Some code points in this page require a second byte, so characters use either 8 or 16 bits for encoding.

First byte 0 1 2 3 4 5 6 7 8 9 A B C D E F 0 ␀ ␁ ␂ ␃ ␄ ␅ ␆ ␇ ␈ ␉ ␊ ␋ ␌ ␍ ␎ ␏ 1 ␐ ␑ ␒ ␓ ␔ ␕ ␖ ␗ ␘ ␙ ␚ ␛ ␜ ␝ ␞ ␟ 2 ␠ ! " # $ % & ' ( ) * + , - . / 3 0 1 2 3 4 5 6 7 8 9 : ; < = > ? 4 @ A B C D E F G H I J K L M N O 5 P Q R S T U V W X Y Z [ ¥ ] ^ _ 6 ` a b c d e f g h i j k l m n o 7 p q r s t u v w x y z { | } ‾ ␡ 8 9 A 。 「 」 、 ・ ヲ ァ ィ ゥ ェ ォ ャ ュ ョ ッ B ー ア イ ウ エ オ カ キ ク ケ コ サ シ ス セ ソ C タ チ ツ テ ト ナ ニ ヌ ネ ノ ハ ヒ フ ヘ ホ マ D ミ ム メ モ ヤ ユ ヨ ラ リ ル レ ロ ワ ン ゙ ゚ E F Second byte 0 1 2 3 4 5 6 7 8 9 A B C D E F 0 1 2 3 4 5 6 7 8 9 A B C D E F
Non printable ASCII character Unaltered ASCII character Modified ASCII character Single-byte half-width katakana First byte of a double-byte character, used by JIS X 0208 Not used as first byte, unallocated space in JIS X 0208 First byte of a double-byte IBM extension character First byte of a double-byte IBM-designated user defined character Not used as first byte Second byte of a double-byte character whose first half of the JIS sequence was odd Second byte of a double-byte character whose first half of the JIS sequence was even Unused as second byte of a double-byte character
  1. ^ a b c d "IBM-943 and IBM-932". IBM Knowledge Center. IBM.
  2. ^ "Coded character set identifiers - CCSID 932". IBM Globalization. IBM. Archived from the original on 2016-03-27.
  3. ^ Lunde, Ken (2009). "Appendix E: Vendor Character Set Standards" (PDF). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. ISBN 978-0-596-51447-1.
  4. ^ "Converter Explorer: ibm-942_P12A-1999". ICU Demonstration. International Components for Unicode.