Strange characters in the Collation Charts (original) (raw)

Next message: Neil Harris: "Re: Hebrew script in IDN (was Exemplar Characters)"


Looking at the handly Collation Charts at
http://www.unicode.org/charts/collation/
I was surprised when I looked at the "Null" part there, which is
addressable as
http://www.unicode.org/charts/collation/chart_Null.html

Viewed on IE 6, it seems to contain characters like the euro sign,
some punctuation marks, and some letters in positions U+0080 through
U+009F. Apparently, these positions are reserved for control characters
in Unicode, not assigned to some printable characters as in windows-1252.

The source code contains character references like €, which are
undefined according to HTML specifications. It seems that the chart has
been programmatically generated without handling some special cases
as they would need to be handled. I'm afraid people might get rather
confused with this.

-- Jukka "Yucca" Korpela, http://www.cs.tut.fi/~jkorpela/



This archive was generated by hypermail 2.1.5: Thu Nov 17 2005 - 13:54:15 CST