[Python-Dev] [Python-checkins] cpython: Document requierements of Unicode kinds (original) (raw)
Terry Reedy tjreedy at udel.edu
Wed Oct 5 21:25:22 CEST 2011
- Previous message: [Python-Dev] cpython: PyUnicode_Join() checks output length in debug mode
- Next message: [Python-Dev] [Python-checkins] cpython: Document requierements of Unicode kinds
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
On 10/5/2011 1:43 PM, victor.stinner wrote:
http://hg.python.org/cpython/rev/055174308822 changeset: 72699:055174308822 user: Victor Stinner<victor.stinner at haypocalc.com> date: Wed Oct 05 01:31:05 2011 +0200 summary: Document requierements of Unicode kinds
files: Include/unicodeobject.h | 24 ++++++++++++++++++++---- 1 files changed, 20 insertions(+), 4 deletions(-)
diff --git a/Include/unicodeobject.h b/Include/unicodeobject.h --- a/Include/unicodeobject.h +++ b/Include/unicodeobject.h @@ -288,10 +288,26 @@ unsigned int interned:2; /* Character size: - PyUnicodeWCHARKIND (0): wchart* - PyUnicode1BYTEKIND (1): PyUCS1* - PyUnicode2BYTEKIND (2): PyUCS2* - PyUnicode4BYTEKIND (3): PyUCS4* + - PyUnicodeWCHARKIND (0): + + * character type = wchart (16 or 32 bits, depending on the + platform) + + - PyUnicode1BYTEKIND (1): + + * character type = PyUCS1 (8 bits, unsigned) + * if ascii is 1, at least one character must be in range + U+80-U+FF, otherwise all characters must be in range U+00-U+7F
Given that 1==True, this looks backwards.
+ + - PyUnicode2BYTEKIND (2): + + * character type = PyUCS2 (16 bits, unsigned) + * at least one character must be in range U+0100-U+1FFFF
/U+1FFFF/U+FFFF/ ?
Terry
- Previous message: [Python-Dev] cpython: PyUnicode_Join() checks output length in debug mode
- Next message: [Python-Dev] [Python-checkins] cpython: Document requierements of Unicode kinds
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]