Issue 13417: faster utf-8 decoding (original) (raw)

Issue13417

Created on 2011-11-16 22:49 by pitrou, last changed 2022-04-11 14:57 by admin. This issue is now closed.

Files
File name	Uploaded	Description	Edit
utf8lib2.patch	pitrou,2011-11-16 22:49	review

Messages (4)
msg147778 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2011-11-16 22:49
PEP 393 and the need for a two-pass decoding process has made utf-8 decoding much slower, especially with the current generic implementation. Attached patch makes utf-8 more than twice faster, which means we're around 10-20% slower than 3.2 on non-trivial cases.
msg147926 - (view)	Author: Gregory P. Smith (gregory.p.smith) *	Date: 2011-11-19 02:35
+1 nice! A couple minor comments on the code review.
msg148076 - (view)	Author: Roundup Robot (python-dev)	Date: 2011-11-21 19:44
New changeset 8e6c4acaf530 by Antoine Pitrou in branch 'default': Issue #13417: speed up utf-8 decoding by around 2x for the non-fully-ASCII case. http://hg.python.org/cpython/rev/8e6c4acaf530
msg148078 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2011-11-21 19:48
Thanks for the review, patch committed now (with bogus comments removed).

History
Date	User	Action	Args
2022-04-11 14:57:23	admin	set	github: 57626
2011-11-21 19:48:56	pitrou	set	status: open -> closedresolution: fixedmessages: + stage: patch review -> resolved
2011-11-21 19:44:19	python-dev	set	nosy: + python-devmessages: +
2011-11-19 02:35:52	gregory.p.smith	set	nosy: + gregory.p.smithmessages: +
2011-11-16 22:49:41	pitrou	create