If you pass a PyObject representation the unicode string u'\xc4' to PyString_AsString() then Python (2.3.4, 2.4.0) segfault. Famous last words of Python 2.4: Exception exceptions.UnicodeEncodeError: <exceptions.UnicodeEncodeError instance at 0xf6f75e8c> in 'garbage collection' ignored Fatal Python error: unexpected exception during garbage collection Famous last words of Python 2.4 (debug build): XXX undetected error Traceback (most recent call last): File "test.py", line 4, in ? print S.split(u'\xc4') UnicodeEncodeError: 'ascii' codec can't encode character u'\xe4' in position 0: ordinal not in range(128) [6545 refs] This bug has been reported first on the zope-dev list. I confirmed that this error does not only occur in Zope but also in a Python-only environment.
Logged In: YES user_id=73023 I can't reproduce this: >>> ick = u'\xc4' >>> import struct >>> struct.pack(ick) Traceback (most recent call last): File "", line 1, in ? UnicodeEncodeError: 'ascii' codec can't encode character u'\xc4' in position 0: ordinal not in range(128) >>> struct.pack(u'i', 1) '\x01\x00\x00\x00' The code that provked this in Zope is buggy.,
Logged In: YES user_id=31435 For the record, this appeared to be due to an extension module doing PyString_AsString(name)[0] when name was a Unicode string containing a "high-bit" character. PyString_AsString(name) legitimately returned NULL, and bad stuff followed.