[Python-Dev] str.translate vs unicode.translate (was: Re: str object going in Py3K) (original) (raw)

Fri Feb 17 03:25:25 CET 2006

s_str.translate(table, delch).encode('utf-8')
s_str.translate(table, delch).decode('latin-1').encode('utf-8')     # use str.translate
def translate(self, table, deletechars=None):
    return ''.join((table or isinstance(table,unicode) and uidentity or sidentity)[ord(x)] for x in self
                   if not deletechars or x not in deletechars)

# For convenience in just pruning with deletechars, s_str.translate('', deletechars) deletes without translating,
# and s_str.translate(u'', deletechars)  does the same and then maps to same-ord unicode characters
# given
#     sidentity = ''.join(chr(i) for i in xrange(256))
# and
#     uidentity = u''.join(unichr(i) for i in xrrange(256)).
UnicodeDecodeError: 'ascii' codec can't decode byte 0xf6 in position 3: ordinal not in range(128)