[Python-Dev] Regression in unicodestr.encode()? (original) (raw)

Martin v. Loewis martin@v.loewis.de
10 Apr 2002 21:03:06 +0200

Previous message: [Python-Dev] Regression in unicodestr.encode()?
Next message: [Python-Dev] Regression in unicodestr.encode()?
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

"M.-A. Lemburg" <mal@lemburg.com> writes:

Some debugging with gdb indicates that the codec is indeed writing the 'nd', but the final PyStringResize() (which allocates a new buffer and copies the data into that buffer) fails to copy the last two characters from the string or overwrites it with NULLs.

Looks like a pymalloc problem to me. Tim ?

It's a UTF-8 codec bug. The codec writes over the end of the buffer, then invokes resize. Resizing only copies the allocated bytes, hence the uninitialized bytes at the end.

Regards, Martin

Previous message: [Python-Dev] Regression in unicodestr.encode()?
Next message: [Python-Dev] Regression in unicodestr.encode()?
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]