[Python-Dev] Bytes path support (original) (raw)

Greg Ewing greg.ewing at canterbury.ac.nz
Sun Aug 24 03:11:10 CEST 2014


Isaac Morland wrote:

In HTML 5 it allows non-ASCII-compatible encodings as long as U+FEFF (byte order mark) is used:

http://www.w3.org/TR/html-markup/syntax.html#encoding-declaration Not sure about XML.

According to Appendix F here:

http://www.w3.org/TR/xml/#sec-guessing

an XML parser needs to be prepared to try all the encodings it supports until it finds one that works well enough to decode the XML declaration, then it can find out the exact encoding used.

-- Greg



More information about the Python-Dev mailing list