[Python-Dev] Internal representation of strings and Micropython (original) (raw)

Serhiy Storchaka storchaka at gmail.com
Thu Jun 5 09:54:03 CEST 2014


04.06.14 23:50, Glenn Linderman написав(ла):

3) (Most space efficient) One cached entry, that caches the last codepoint/byte position referenced. UTF-8 is able to be traversed in either direction, so "next/previous" codepoint access would be relatively fast (and such are very common operations, even when indexing notation is used: "for ix in range( len( strx )): func( strx[ ix ])".)

Great idea! It should cover most real-word cases. Note that we can scan UTF-8 string left-to-right and right-to-left.



More information about the Python-Dev mailing list