Issue 12746: normalization is affected by unicode width (original) (raw)

Issue12746

Created on 2011-08-13 04:12 by benjamin.peterson, last changed 2022-04-11 14:57 by admin. This issue is now closed.

Messages (6)
msg142008 - (view) Author: Benjamin Peterson (benjamin.peterson) * (Python committer) Date: 2011-08-13 04:12
Narrow build: unicodedata.normalize("NFKC", "𝔘𝔫𝔦𝔠𝔬𝔡𝔢") '𝔘𝔫𝔦𝔠𝔬𝔡𝔢' Wide build: >>> unicodedata.normalize("NFKC", "𝔘𝔫𝔦𝔠𝔬𝔡𝔢") 'Unicode' Normalization needs to properly decode characters in the supplementary plane.
msg142105 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2011-08-15 08:28
See also #12737.
msg143039 - (view) Author: Guido van Rossum (gvanrossum) * (Python committer) Date: 2011-08-26 21:18
Yeah, we should fix this. At least in 3.3, but (without knowing what exactly is involved) I think backporting to 2.7 and 3.2 makes sense too.
msg144563 - (view) Author: Benjamin Peterson (benjamin.peterson) * (Python committer) Date: 2011-09-28 12:15
Not anymore, though. :)
msg144564 - (view) Author: Benjamin Peterson (benjamin.peterson) * (Python committer) Date: 2011-09-28 13:05
Almost, that is. The unicodedata module still needs to use the new PEP 393 API.
msg144610 - (view) Author: STINNER Victor (vstinner) * (Python committer) Date: 2011-09-29 18:06
Issue fixed in Python 3.3 by Martin, changesets 40cf7204ca2b and 50f008019213.
History
Date User Action Args
2022-04-11 14:57:20 admin set github: 56955
2011-09-29 18:06:25 vstinner set status: open -> closednosy: + loewismessages: + resolution: fixed
2011-09-28 13:05:39 benjamin.peterson set status: closed -> openresolution: fixed -> (no value)messages: +
2011-09-28 12:41:53 vstinner set versions: - Python 2.7, Python 3.2
2011-09-28 12:15:35 benjamin.peterson set status: open -> closedresolution: fixedmessages: +
2011-08-26 21🔞05 gvanrossum set nosy: + gvanrossummessages: +
2011-08-15 19:28:23 Arfrever set nosy: + Arfrever
2011-08-15 09:00:31 tchrist set nosy: + tchrist
2011-08-15 08:28:57 vstinner set messages: +
2011-08-13 11:55:39 pitrou set nosy: + lemburg, vstinner
2011-08-13 08:02:13 ezio.melotti set nosy: + ezio.melotticomponents: + Unicode
2011-08-13 04:12:00 benjamin.peterson create