[Python-Dev] PyUnicodeObject / PyASCIIObject questions (original) (raw)

"Martin v. Löwis" martin at v.loewis.de
Wed Dec 14 01:01:40 CET 2011


Any chance of adding the rationale to the code?

I'm really short of time right now, so you need to find somebody else to make such a change.

I am willing to believe that requests for a wchart (or utf-8 or System Locale charset) representation are common enough to justify caching the data after the first request.

That's not the issue; the real issue is memory management.

But then why throw it away in the first place? Wouldn't programs that create unicode from wchart data also be the most likely to request wchart data back?

Perhaps. But are they likely to access the string they just created again at all? They know what's in it, so why look at it again?

In all other cases, (wstrlength == length), and wstr can be generated by widening the data without having to inspect it. Is it worth eliminating wstrlength (or even wstr) in those cases, or is that too much complexity?

It's too little saving.

What I'm asking is that (1) The other values be documented as reserved, rather than as illegal.

How is that different?

(2) The macros produce an error rather than silently corrupting data.

In debug mode, or release mode? -1 on release mode.

Regards, Martin



More information about the Python-Dev mailing list