Issue 12746: normalization is affected by unicode width (original) (raw)
Issue12746
Created on 2011-08-13 04:12 by benjamin.peterson, last changed 2022-04-11 14:57 by admin. This issue is now closed.
Messages (6) | ||
---|---|---|
msg142008 - (view) | Author: Benjamin Peterson (benjamin.peterson) * ![]() |
Date: 2011-08-13 04:12 |
Narrow build: unicodedata.normalize("NFKC", "𝔘𝔫𝔦𝔠𝔬𝔡𝔢") '𝔘𝔫𝔦𝔠𝔬𝔡𝔢' Wide build: >>> unicodedata.normalize("NFKC", "𝔘𝔫𝔦𝔠𝔬𝔡𝔢") 'Unicode' Normalization needs to properly decode characters in the supplementary plane. | ||
msg142105 - (view) | Author: STINNER Victor (vstinner) * ![]() |
Date: 2011-08-15 08:28 |
See also #12737. | ||
msg143039 - (view) | Author: Guido van Rossum (gvanrossum) * ![]() |
Date: 2011-08-26 21:18 |
Yeah, we should fix this. At least in 3.3, but (without knowing what exactly is involved) I think backporting to 2.7 and 3.2 makes sense too. | ||
msg144563 - (view) | Author: Benjamin Peterson (benjamin.peterson) * ![]() |
Date: 2011-09-28 12:15 |
Not anymore, though. :) | ||
msg144564 - (view) | Author: Benjamin Peterson (benjamin.peterson) * ![]() |
Date: 2011-09-28 13:05 |
Almost, that is. The unicodedata module still needs to use the new PEP 393 API. | ||
msg144610 - (view) | Author: STINNER Victor (vstinner) * ![]() |
Date: 2011-09-29 18:06 |
Issue fixed in Python 3.3 by Martin, changesets 40cf7204ca2b and 50f008019213. |
History | |||
---|---|---|---|
Date | User | Action | Args |
2022-04-11 14:57:20 | admin | set | github: 56955 |
2011-09-29 18:06:25 | vstinner | set | status: open -> closednosy: + loewismessages: + resolution: fixed |
2011-09-28 13:05:39 | benjamin.peterson | set | status: closed -> openresolution: fixed -> (no value)messages: + |
2011-09-28 12:41:53 | vstinner | set | versions: - Python 2.7, Python 3.2 |
2011-09-28 12:15:35 | benjamin.peterson | set | status: open -> closedresolution: fixedmessages: + |
2011-08-26 21🔞05 | gvanrossum | set | nosy: + gvanrossummessages: + |
2011-08-15 19:28:23 | Arfrever | set | nosy: + Arfrever |
2011-08-15 09:00:31 | tchrist | set | nosy: + tchrist |
2011-08-15 08:28:57 | vstinner | set | messages: + |
2011-08-13 11:55:39 | pitrou | set | nosy: + lemburg, vstinner |
2011-08-13 08:02:13 | ezio.melotti | set | nosy: + ezio.melotticomponents: + Unicode |
2011-08-13 04:12:00 | benjamin.peterson | create |