[Python-Dev] (Not) delaying the 3.2 release (original) (raw)
Martin (gzlist) gzlist at googlemail.com
Thu Sep 16 19:46:22 CEST 2010
- Previous message: [Python-Dev] (Not) delaying the 3.2 release
- Next message: [Python-Dev] (Not) delaying the 3.2 release
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
On 16/09/2010, Guido van Rossum <guido at python.org> wrote:
In all cases I can imagine where such polymorphic functions make sense, the necessary and sufficient assumption should be that the encoding is a superset of 7-bit(*) ASCII. This includes UTF-8, all Latin-N variant, and AFAIK also the popular CJK encodings other than UTF-16. This is the same assumption made by Python's byte type when you use "character-based" methods like lower().
Well, depends on what exactly you're doing, it's pretty easy to go wrong:
Python 3.2a2+ (py3k, Sep 16 2010, 18:43:45) [MSC v.1500 32 bit (Intel)] on win32 Type "help", "copyright", "credits" or "license" for more information.
import os, sys os.path.split("C:\十") ('C:\', '十') os.path.split("C:\十".encode(sys.getfilesystemencoding())) (b'C:\\x8f', b'')
Similar things can catch out web developers once they step outside the percent encoding.
Martin
- Previous message: [Python-Dev] (Not) delaying the 3.2 release
- Next message: [Python-Dev] (Not) delaying the 3.2 release
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]