java encoding charset suggestion (original) (raw)

Louis Wasserman lowasser at google.com
Mon Mar 18 18:47:25 UTC 2013


This would probably be likely to break lots of existing users depending on the default charset, depressingly enough.

On Mon, Mar 18, 2013 at 11:34 AM, Helio Frota <heliofrota at gmail.com> wrote:

Hi Martin Buchholz,

I believe (but I could be wrong) that only java applications exhibit this behavior, programs made with GTK or QT are not affected, they probably look at the level of the X11 case are not in variable LANG or assume a default locale.

> $ (unset LCALL LCCOLLATE LANG LANGUAGE GDMLANG; locale) > LANG= > LANGUAGE= > LCCTYPE="POSIX" > LCNUMERIC="POSIX" > LCTIME="POSIX" > LCCOLLATE="POSIX" > LCMONETARY="POSIX" > LCMESSAGES="POSIX" > LCPAPER="POSIX" > LCNAME="POSIX" > LCADDRESS="POSIX" > LCTELEPHONE="POSIX" > LCMEASUREMENT="POSIX" > LCIDENTIFICATION="POSIX" > LCALL= > Please try to use á é í ... But I think the operating system should set the default, not the > application. On my Ubuntu system I see the traditional ASCII English > default: > I agree, but the JVM could not be pro-active ? Thanks for reply ! 2013/3/18 Martin Buchholz <martinrb at google.com> > It would be nice if the world agreed on using UTF-8 as a universal > encoding for all text. However: > > Standard says > http://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1chap08.html > > """If the LANG environment variable is not set or is set to the empty > string, the implementation-defined default locale shall be used.""" > > But I think the operating system should set the default, not the > application. On my Ubuntu system I see the traditional ASCII English > default: > > $ (unset LCALL LCCOLLATE LANG LANGUAGE GDMLANG; locale) > LANG= > LANGUAGE= > LCCTYPE="POSIX" > LCNUMERIC="POSIX" > LCTIME="POSIX" > LCCOLLATE="POSIX" > LCMONETARY="POSIX" > LCMESSAGES="POSIX" > LCPAPER="POSIX" > LCNAME="POSIX" > LCADDRESS="POSIX" > LCTELEPHONE="POSIX" > LCMEASUREMENT="POSIX" > LCIDENTIFICATION="POSIX" > LCALL= > > > On Mon, Mar 18, 2013 at 11:09 AM, Helio Frota <heliofrota at gmail.com_ _>wrote: > >> >> I would suggest taking enUS.UTF-8 as default when the LANG variable is >> not >> set to avoid problems with encoding. >> > -- Helio Frota http://www.heliofrota.com/

-- Louis Wasserman



More information about the core-libs-dev mailing list