[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: `texindex` output depends on locale settings
From: |
Eli Zaretskii |
Subject: |
Re: `texindex` output depends on locale settings |
Date: |
Sun, 06 Nov 2022 17:13:19 +0200 |
> Date: Sun, 06 Nov 2022 17:05:00 +0200
> From: Eli Zaretskii <eliz@gnu.org>
> Cc: wl@gnu.org, bug-texinfo@gnu.org
>
> > It could instead be
> >
> > function islower(c)
> > {
> > return c ~ /[[:lower:]]/
> > }
> >
> > And similar for the others. That would work for any unicode character.
>
> Sure, but is the issue only with lower-case letters? What about
> collation order or even determining what is and isn't a character (as
> opposed to incomplete byte sequence)?
I mean: what if the document is encoded in Latin-1, but the locale's
codeset is UTF-8? Then the 0xE0 byte (à in Latin-1 encoding) will
probably not be identified as lower-case.
- `texindex` output depends on locale settings, Werner LEMBERG, 2022/11/06
- Re: `texindex` output depends on locale settings, arnold, 2022/11/06
- Re: `texindex` output depends on locale settings, Eli Zaretskii, 2022/11/06
- Re: `texindex` output depends on locale settings, Werner LEMBERG, 2022/11/06
- Re: `texindex` output depends on locale settings, Eli Zaretskii, 2022/11/06
- Re: `texindex` output depends on locale settings, Werner LEMBERG, 2022/11/06
- Re: `texindex` output depends on locale settings, Eli Zaretskii, 2022/11/06
- Re: `texindex` output depends on locale settings, arnold, 2022/11/06
- Re: `texindex` output depends on locale settings, Eli Zaretskii, 2022/11/06
- Re: `texindex` output depends on locale settings, arnold, 2022/11/06
- Re: `texindex` output depends on locale settings, Werner LEMBERG, 2022/11/06
- Re: `texindex` output depends on locale settings, Patrice Dumas, 2022/11/06