[Python-Dev] Internal representation of strings and Micropython (original) (raw)
Juraj Sukop juraj.sukop at gmail.com
Wed Jun 4 11:53:43 CEST 2014
- Previous message: [Python-Dev] Internal representation of strings and Micropython
- Next message: [Python-Dev] Internal representation of strings and Micropython
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
On Wed, Jun 4, 2014 at 11:36 AM, Stephen J. Turnbull <stephen at xemacs.org> wrote:
I think you really need to check what the applications are in detail. UTF-8 costs about 35% more storage for Japanese, and even more for Chinese, than does UTF-16.
"UTF-8 can be smaller even for Asian languages, e.g.: front page of Wikipedia Japan: 83 kB in UTF-8, 144 kB in UTF-16"
From http://www.lua.org/wshop12/Ierusalimschy.pdf (p. 12) -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.python.org/pipermail/python-dev/attachments/20140604/686f203d/attachment.html>
- Previous message: [Python-Dev] Internal representation of strings and Micropython
- Next message: [Python-Dev] Internal representation of strings and Micropython
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]