[Python-Dev] bytes (original) (raw)

[Python-Dev] bytes / unicode

Toshio Kuratomi a.badger at gmail.com
Tue Jun 22 19:21:23 CEST 2010


On Tue, Jun 22, 2010 at 08:31:13PM +0900, Stephen J. Turnbull wrote:

Toshio Kuratomi writes: > unicode handling redesign. I'm stating my reading of the RFC not to defend > the use case Philip has, but because I think that the outlook that non-text > uris (before being percentencoded) are violations of the RFC

That's not what I'm saying. What I'm trying to point out is that manipulating a bytes object as an URI sort of presumes a lot about its encoding as text.

I think we're more or less in agreement now but here I'm not sure. What manipulations are you thinking about? Which stage of URI construction are you considering?

I've just taken a quick look at python3.1's urllib module and I see that there is a bit of confusion there. But it's not about unicode vs bytes but about whether a URI should be operated on at the real URI level or the data-that-makes-a-uri level.

Since many of the URIs we deal with are more or less textual, why not take advantage of that? Cool, so to summarize what I think we agree on:

-Toshio -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 198 bytes Desc: not available URL: <http://mail.python.org/pipermail/python-dev/attachments/20100622/a926e262/attachment.pgp>



More information about the Python-Dev mailing list