[Python-Dev] Ill-defined encoding for CP875? (original) (raw)

Fredrik Lundh fredrik@pythonware.com
Sat, 12 May 2001 14:25:43 +0200


tim wrote:

If, e.g., you're relying on "the first" of a set of ambiguous reverse mappings winning the game, then iterating over decodingmap.items() in reverse sorted order would do the trick reliably.

reverse sorting makes sense to me. but the cp-files appear to be machine generated, so patching that python file won't help.

But I don't know whether the ambiguity in cp875 is a bug or an undocumented feature ...

a truly future-proof solution would be to specify exactly how to resolve every many-to-one mapping, for every font having that problem. but sorting them is clearly better than relying on implementation-dependent behaviour...

(is Jython using exactly the same hashing and dictionary algorithms as CPython? or does it work by accident also under Jython?)

Cheers /F