[Python-Dev] PEP 383: Non-decodable Bytes in System Character Interfaces (original) (raw)

"Martin v. Löwis" martin at v.loewis.de
Thu Apr 30 21:10:37 CEST 2009


MRAB wrote:

One further question: should the encoder accept a string like u'\xDCC2\xDC80'? That would encode to b'\xC2\x80'

Indeed so.

which, when decoded, would give u'\x80'.

Assuming the encoding is UTF-8, yes.

Does the PEP only guarantee that strings decoded from the filesystem are reversible, but not check what might be de novo strings?

Exactly so.

Regards, Martin



More information about the Python-Dev mailing list