[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#37036: [PATCH] Inconsistent ASCII and Latin char categories
From: |
Eli Zaretskii |
Subject: |
bug#37036: [PATCH] Inconsistent ASCII and Latin char categories |
Date: |
Thu, 15 Aug 2019 18:27:28 +0300 |
> From: Mattias Engdegård <mattiase@acm.org>
> Date: Thu, 15 Aug 2019 14:17:15 +0200
>
> The ASCII (a) and Latin (l) character categories are inconsistent in what
> characters they contain.
>
> It should be clear what the ASCII category means, but it omits 00-1f
> (contrary to a comment in the code).
>
> The Latin category isn't exactly defined anywhere but should reasonably
> comprise letters from Latin-based scripts. Currently, it also includes many
> control characters and symbols from the ASCII and Latin-1 Supplement blocks,
> which seems hard to justify.
>
> Other changes to Latin could be argued: what modifiers/combining chars should
> be included? What about IPA and non-IPA phonetics? Ligatures? What about
> Latin-derived forms such as circled letters? &c. The attached patch does not
> go there but only fixes the glaring errors in the 00-ff range.
Did you try moving by words after these changes? What happens in
words that consist of ASCII and non-ASCII Latin characters, for
example?
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Mattias Engdegård, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories,
Eli Zaretskii <=
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Mattias Engdegård, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Eli Zaretskii, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Mattias Engdegård, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Eli Zaretskii, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Mattias Engdegård, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Eli Zaretskii, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Eli Zaretskii, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Mattias Engdegård, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Eli Zaretskii, 2019/08/16
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Mattias Engdegård, 2019/08/16