[Python-Dev] Fixing the XML batteries (original) (raw)

Paul Moore p.f.moore at gmail.com
Fri Dec 9 19:24:41 CET 2011


On 9 December 2011 18:15, Bill Janssen <janssen at parc.com> wrote:

I use ElementTree for parsing valid XML, but minidom for producing it.

I think another thing that might go into "refreshing the batteries" is a feature comparison of BeautifulSoup and HTML5lib against the stdlib competition, to see what needs to be added/revised.  Having to switch to an outside package for parsing possibly invalid HTML is a pain.

For what little use I make of XML/HTML parsing, I use lxml, simply because it has a parser that covers the sort of HTML I have to deal with in real life. As I have lxml installed, I use it for any XML parsing tasks, just because I'm used to it.

Paul



More information about the Python-Dev mailing list