[Python-Dev] Encoding detection in the standard library? (original) (raw)

Terry Reedy tjreedy at udel.edu
Wed Apr 23 07:26:22 CEST 2008


""Martin v. Löwis"" <martin at v.loewis.de> wrote in message news:480EC376.8070406 at v.loewis.de... |> I certainly agree that if the target set of documents is small enough it || Ok. What advantage would you (or somebody working on a similar project) | gain if chardet was part of the standard library? What if it was not | chardet, but some other algorithm?

It seems to me that since there is not a 'correct' algorithm but only competing heuristics, encoding detection modules should be made available via PyPI and only be considered for stdlib after a best of breed emerges with community support.



More information about the Python-Dev mailing list