[Python-Dev] Unicode charmap decoders slow (original) (raw)

"Martin v. Löwis" martin at v.loewis.de
Wed Oct 5 20:40:04 CEST 2005


Walter Dörwald wrote:

OK, here's a patch that implements this enhancement to PyUnicodeDecodeCharmap(): http://www.python.org/sf/1313939

Looks nice!

Creating the decodingmap as a string should probably be done by gencodec.py directly. This way the first import of the codec would be faster too.

Hmm. How would you represent the string in source code? As a Unicode literal? With \u escapes, or in a UTF-8 source file? Or as a UTF-8 string, with an explicit decode call?

I like the current dictionary style for being readable, as it also adds the Unicode character names into comments.

Regards, Martin



More information about the Python-Dev mailing list