[Python-Dev] Unicode charmap decoders slow (original) (raw)
Hye-Shik Chang hyeshik at gmail.com
Thu Oct 6 13:33:11 CEST 2005
- Previous message: [Python-Dev] Unicode charmap decoders slow
- Next message: [Python-Dev] Unicode charmap decoders slow
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
On 10/6/05, M.-A. Lemburg <mal at egenix.com> wrote:
Hye-Shik Chang wrote: > (encoding, fastmap codec) > > % ./python Lib/timeit.py -s "s='a'531024; e='iso885910fc'; > u=unicode(s, e)" "u.encode(e)" > 1000 loops, best of 3: 536 usec per loop > > (encoding, utf-8 codec) > > % ./python Lib/timeit.py -s "s='a'531024; e='utf8'; u=unicode(s, > e)" "u.encode(e)" > 1000 loops, best of 3: 1.5 msec per loop
I wonder why the UTF-8 codec is slower than the fastmap codec in this case.
I guess that resizing made the difference. fastmap encoder doesn't resize the output buffer at all in the test case while UTF-8 encoder allocates 4531024 bytes and resizes it to 53*1024 bytes in the end.
Hye-Shik
- Previous message: [Python-Dev] Unicode charmap decoders slow
- Next message: [Python-Dev] Unicode charmap decoders slow
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]