[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Error UTF-8 strings
From: |
Hans Åberg |
Subject: |
Re: Error UTF-8 strings |
Date: |
Wed, 24 Jun 2020 17:43:48 +0200 |
> On 24 Jun 2020, at 16:05, Ken Moffat via Bug reports for Bison, the GNU
> parser generator <bug-bison@gnu.org> wrote:
>
> On Wed, Jun 24, 2020 at 10:20:48AM +0200, Hans Åberg wrote:
>>
>> I pointed out that out: There is a double bug, locale dependent generation
>> of the parser file, and relying on software that can't handle LC_CTYPE=UTF-8.
>
> On (at least) linux using glibc, LC_CTYPE requires a valid locale.
> And UTF-8 on its own is not a valid locale.
>
> A quick search on google suggests that LC_CTYPE will, among other
> things, control what is a valid letter, and lowercase|uppercase
> conversions.
I have found no information about what POSIX says is a valid locale.
> Taking an easy case, with languages written in latin alphabets, what
> is the uppercase of 'i' ? In Turkey it is İ (with a dot), because
> in turkish dotted-i and dotless-i are different letters.
There is a LANG variable that might be set.
- Re: Error UTF-8 strings, (continued)
- Re: Error UTF-8 strings, Hans Åberg, 2020/06/23
- Re: Error UTF-8 strings, Akim Demaille, 2020/06/24
- Re: Error UTF-8 strings, Hans Åberg, 2020/06/24
- Re: Error UTF-8 strings, Akim Demaille, 2020/06/24
- Re: Error UTF-8 strings, Hans Åberg, 2020/06/24
- Re: Error UTF-8 strings, Ken Moffat, 2020/06/24
- Re: Error UTF-8 strings,
Hans Åberg <=
- Re: Error UTF-8 strings, Hans Åberg, 2020/06/24