(original) (raw)
On Tue, Apr 12, 2016 at 9:20 AM, Chris Angelico <rosuav@gmail.com> wrote:
> latin1? I thought latin1 had a code point for 0-255, so how could using it
\> raise an encoding error?
Latin-1 / ISO-8859-1 defines a character for every byte, so any byte
string will \*decode\*. It only defines 256 characters as having
equivalent bytes, though, so \*encoding\* can fail.
unless it was decoded as latin-1 in the first place. doesn't the surrogate escape thing only work properly if you decode/encode with the same encoding?
-CHB
Christopher Barker, Ph.D.
Oceanographer
Emergency Response Division
NOAA/NOS/OR&R (206) 526-6959 voice
7600 Sand Point Way NE (206) 526-6329 fax
Seattle, WA 98115 (206) 526-6317 main reception
Chris.Barker@noaa.gov
Oceanographer
Emergency Response Division
NOAA/NOS/OR&R (206) 526-6959 voice
7600 Sand Point Way NE (206) 526-6329 fax
Seattle, WA 98115 (206) 526-6317 main reception
Chris.Barker@noaa.gov