Strange characters in the Collation Charts (original) (raw)
Next message: Neil Harris: "Re: Hebrew script in IDN (was Exemplar Characters)"
- Previous message: Michael Everson: "$100 laptop -- good news for disadvantaged regions (and scripts?)"
- Next in thread: Otto Stolz: "Re: Strange characters in the Collation Charts"
- Reply: Otto Stolz: "Re: Strange characters in the Collation Charts"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
- Mail actions: [ respond to this message ] [ mail a new topic ]
Looking at the handly Collation Charts at
http://www.unicode.org/charts/collation/
I was surprised when I looked at the "Null" part there, which is
addressable as
http://www.unicode.org/charts/collation/chart_Null.html
Viewed on IE 6, it seems to contain characters like the euro sign,
some punctuation marks, and some letters in positions U+0080 through
U+009F. Apparently, these positions are reserved for control characters
in Unicode, not assigned to some printable characters as in windows-1252.
The source code contains character references like €, which are
undefined according to HTML specifications. It seems that the chart has
been programmatically generated without handling some special cases
as they would need to be handled. I'm afraid people might get rather
confused with this.
-- Jukka "Yucca" Korpela, http://www.cs.tut.fi/~jkorpela/
- Next message: Neil Harris: "Re: Hebrew script in IDN (was Exemplar Characters)"
- Previous message: Michael Everson: "$100 laptop -- good news for disadvantaged regions (and scripts?)"
- Next in thread: Otto Stolz: "Re: Strange characters in the Collation Charts"
- Reply: Otto Stolz: "Re: Strange characters in the Collation Charts"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
- Mail actions: [ respond to this message ] [ mail a new topic ]
This archive was generated by hypermail 2.1.5: Thu Nov 17 2005 - 13:54:15 CST