[Python-Dev] cpython (2.7): #14538: HTMLParser can now parse correctly start tags that contain a bare /. (original) (raw)

Georg Brandl g.brandl at gmx.net
Tue Apr 24 21:02:43 CEST 2012


On 24.04.2012 20:34, Benjamin Peterson wrote:

2012/4/24 Georg Brandl <g.brandl at gmx.net>:

On 19.04.2012 03:36, ezio.melotti wrote:

http://hg.python.org/cpython/rev/36c901fcfcda changeset: 76413:36c901fcfcda branch: 2.7 user: Ezio Melotti <ezio.melotti at gmail.com> date: Wed Apr 18 19:08:41 2012 -0600 summary: #14538: HTMLParser can now parse correctly start tags that contain a bare /.

diff --git a/Misc/NEWS b/Misc/NEWS --- a/Misc/NEWS +++ b/Misc/NEWS @@ -50,6 +50,9 @@ Library -------

+- Issue #14538: HTMLParser can now parse correctly start tags that contain + a bare '/'. + I think that's misleading: there's no way to "correctly" parse malformed HTML. There is in the since that you can follow the HTML5 algorithm, which can "parse" any junk you throw at it.

Ah, good. Then I hope we are following the algorithm here (and are slowly coming to use it for htmllib in general).

Georg



More information about the Python-Dev mailing list