[Python-Dev] HTMLParser and HTML5 (original) (raw)

Brett Cannon brett at python.org
Fri Jul 29 22:31:48 CEST 2011


On Fri, Jul 29, 2011 at 11:31, Tres Seaver <tseaver at palladion.com> wrote:

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1

On 07/29/2011 07:22 AM, Joao S. O. Bueno wrote: > I disaagree. Having proper html parsing out of the box is part of > the "batteries included" thing. And it is not a matter of "having > html 5" - as stated on this thread, fixing it for html5 will fix it > for html that exists in the "real world". > > Python has to work with quick 30-50 lines scripts deliverable > everywhere, not just has proper 3rd party libraries that can work as > part of a huge project using buildout. Assuming it were merged today, that parser would only be available on Python 3.3 and later: how is that "everywhere"?

Well, "everywhere, eventually". This gets down to the usual philosophical debate of what should (not) be in the stdlib so that those who have strict third-party code get access to useful libraries while balancing the desire of those who want to keep the stdlib lean or prevent stagnating the API of a module.

Having scripts that work against html5lib (which doesn't need buildout to install, or even setuptools) makes them portable to any version of Python supported by the library (Python 2.3+, AFAICT).

If the library was brought in they could probably continue to be portable with possibly just the addition of a try/finally on the import line.

-Brett

Tres. - -- =================================================================== Tres Seaver +1 540-429-0999 tseaver at palladion.com Palladion Software "Excellence by Design" http://palladion.com -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iEYEARECAAYFAk4y/JYACgkQ+gerLs4ltQ4KKwCgkyOlmb8xxhxg1qWH9RRbEpEw ne0AoL6NgRElbY61QRqnXJjiKoHq0ToW =fk3k -----END PGP SIGNATURE-----


Python-Dev mailing list Python-Dev at python.org http://mail.python.org/mailman/listinfo/python-dev Unsubscribe: http://mail.python.org/mailman/options/python-dev/brett%40python.org -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.python.org/pipermail/python-dev/attachments/20110729/967fcc6d/attachment.html>



More information about the Python-Dev mailing list