Unicode Mail List Archive: Re: Exemplar Characters (original) (raw)
Next message: A LeDonne: "Re: Some Missing Astrological Symbols"
- Previous message: Michael Everson: "Re: Apostrophes (was Re: Exemplar Characters)"
- In reply to: =?iso-8859-1?Q?Marc Brugui=E8res?=: "=?iso-8859-1?Q?Re: Re: Exemplar Characters?="
- Next in thread: Mark Davis: "Re: Exemplar Characters"
- Reply: Mark Davis: "Re: Exemplar Characters"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
- Mail actions: [ respond to this message ] [ mail a new topic ]
On 11/15/2005 11:26 AM, Marc Bruguières wrote:
>
>Michael Everson:
>
>
>>At 09:42 -0500 2005-11-15, Chris Harvey wrote:
>>
>>
>>
>>>Would this mean that the choice between U+2019 and U+02BC is decided
>>>by the phonetic realisation of the apostrophe?
>>>
>>>
>>Maybe. There isn't a rule, any more than there is a rule about the
>>phonetic value of in any particular language.
>>
>>Polynesian languages should all use the modifier letters, for
>>consistency. It's a glottal there.
>>
>>
>
>Did they before Unicode? Do they do now? If their usage differs, isn't this causing a bit of confusion? (I doubt U+02BC is very much used as it is not in standard fonts like Times Roman on XP SP2 and U+2019 is readily available...)
>
>Does word highlighting work less well in Breton than in Polynesian languages because Breton, let's say, would use U+2019 and the other U+02BC? Don't think so. At least it is not the case in Word 2003 on XP, in fact U+2019 for Breton works better inside of words than U+02BC which breaks them, incidentally this is strange for a modifier I would say.
>
>Isn't this a case of overunification?
>
I think you meant 'over-dis-unification' here.
>Looks the same to users, seems it should behave the same way (in fact whether an apostrophe breaks a word or not is language dependent[1]). Why two characters? For extra confusion and spoofing fun?
>
>
I've always been troubled by 02BC / 2019 myself. Since it's not possible
to distinguish the two uses of 2019, it seems to make little sense to
pull out some partial use and assign it to 02BC. And the fact that
there's no visual distinction is really troubling.
Your comments imply that implementers have voted with their software and
decided to unify rather than to support 02BC. That this is not limited
to MS is shown by the fact that Google does not support this character
as a modifier letter. c <02BC> t, brings up c't (the German Computer
magazine), but so does "c,t".
A./
>--
>Marc
>
>[1] That is even too simple : "� pied d'oeuvre" makes 4 words (�, pied, de, oeuvre, four dictinct entries in a dictionary) but "aujourd'hui" or "chef-d'oeuvre" are singles words (one entry in a dictionary). Strictly speaking, a dictionary is necessary although obviously cheaper software may approximate algorithmically.
>
>
>
>
>
>
>
>
>
>
- Next message: A LeDonne: "Re: Some Missing Astrological Symbols"
- Previous message: Michael Everson: "Re: Apostrophes (was Re: Exemplar Characters)"
- In reply to: =?iso-8859-1?Q?Marc Brugui=E8res?=: "=?iso-8859-1?Q?Re: Re: Exemplar Characters?="
- Next in thread: Mark Davis: "Re: Exemplar Characters"
- Reply: Mark Davis: "Re: Exemplar Characters"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ] [ attachment ]
- Mail actions: [ respond to this message ] [ mail a new topic ]
This archive was generated by hypermail 2.1.5: Tue Nov 15 2005 - 15:01:46 CST