bug-bison
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Error UTF-8 strings


From: Hans Åberg
Subject: Re: Error UTF-8 strings
Date: Wed, 24 Jun 2020 17:43:48 +0200

> On 24 Jun 2020, at 16:05, Ken Moffat via Bug reports for Bison, the GNU 
> parser generator <bug-bison@gnu.org> wrote:
> 
> On Wed, Jun 24, 2020 at 10:20:48AM +0200, Hans Åberg wrote:
>> 
>> I pointed out that out: There is a double bug, locale dependent generation 
>> of the parser file, and relying on software that can't handle LC_CTYPE=UTF-8.
> 
> On (at least) linux using glibc, LC_CTYPE requires a valid locale.
> And UTF-8 on its own is not a valid locale.
> 
> A quick search on google suggests that LC_CTYPE will, among other
> things, control what is a valid letter, and lowercase|uppercase
> conversions.

I have found no information about what POSIX says is a valid locale.

> Taking an easy case, with languages written in latin alphabets, what
> is the uppercase of 'i' ?  In Turkey it is İ (with a dot), because
> in turkish dotted-i and dotless-i are different letters.

There is a LANG variable that might be set.





reply via email to

[Prev in Thread] Current Thread [Next in Thread]