Unihan Database Lookup (original) (raw)

Based on Unicode Version 16.0.0

About the Unihan Database Lookup Tool

The lookup interface on this page provides online access to property data in the Unicode Han (Unihan) database for individual ideographs via the “Lookup” button and text field above. Simply enter the four- or five-digit hexadecimal code point for the desired ideograph into the text field, or copy and paste the corresponding ideograph into it, then click the “Lookup” button. The resulting data set will contain various types of information available in the Unihan database, such as mappings to legacy encoding standards, references to dictionaries, meaning and reading information according to various authorities, links to other websites, and so on.

If you do not know the code point of the ideograph, or have no example of the ideograph to copy, the Search the Unihan Database page supports queries against several properties, such as those for ideograph readings. The following two indices are also available:

For access to the latest version of the data files that comprise the Unihan database, download and unzip the Unihan.zip file from the UCD (Unicode Character Database). The Unihan database and its properties are documented in UAX #38.

Unihan Code Charts and Indices

The Unihan Radical-Stroke (RS) indices, which are documented in the “Radical-Stroke Indices” subsection of Section 18.1, “Han,” of Chapter 18 of the Unicode Standard, are available online as the following three PDF files, the first of which is also available as “plain text” data file:

Code charts covering all of Unihan are available as PDF files linked from the Unicode 16.0 Character Code Charts page, along with other code charts.

Disclaimers

The Unihan database is provided as-is as a public service by Unicode, Inc. No claims are made as to its appropriateness for any particular purpose, and no warranties of any kind are expressed nor implied.


Access to Copyright and terms of use