[Python-Dev] Difference in RE between 3.2 and 3.3 (or Aaron Swartz memorial) (original) (raw)

MRAB python at mrabarnett.plus.com
Mon May 6 00:20:01 CEST 2013


On 05/05/2013 23:01, Matej Cepl wrote:

----- Original Message -----

From: "Armin Rigo" <arigo at tunes.org> To: "Matej Cepl" <mcepl at redhat.com> Cc: python-dev at python.org Sent: Saturday, May 4, 2013 11:59:42 AM Subject: Re: [Python-Dev] Difference in RE between 3.2 and 3.3 (or Aaron Swartz memorial)

Hi Matej, On Thu, Mar 7, 2013 at 11:08 AM, Matej Cepl <mcepl at redhat.com> wrote: > if c is not ' ' and c is not ' ': > if c != ' ' and c != ' ': Sorry for the delay in answering, but I just noticed what is wrong in this "fix": it compares c with the same single-character ' ' twice, whereas the original compared it with ' ' and with the two-character ' Comments on https://github.com/mcepl/html2text/commit/f511f3c78e60d7734d677f8945580f52ef7ef742#L0R765 (perhaps in https://github.com/aaronsw/html2text/pull/77) are more than welcome. When using SPACERE = re.compile(r'\s+') That will match a whitespace character followed by a '+'.

for checking, whole onlywhite function is not needed anymore (and it still made me wonder what Aaron meant when he wrote it). Why line.isspace() doesn't work is weird though. What do you mean by "doesn't work"?



More information about the Python-Dev mailing list