bug-gawk
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [bug-gawk] Does gawk character classes follow this?


From: Peng Yu
Subject: Re: [bug-gawk] Does gawk character classes follow this?
Date: Thu, 14 Feb 2019 08:55:24 -0600

> See the cited section in The GNU Awk User's Guide for an excellent reason why 
> this would not be a good idea. Quote: "Character classes are a feature 
> introduced in the POSIX standard. A character class is a special notation for 
> describing lists of characters that have a specific attribute, but the actual 
> characters can vary from country to country and/or from character set to 
> character set. For example, the notion of what is an alphabetic character 
> differs between the United States and France."

At least, there is a common demonimator that would work for the ASCII subset.

As in the current manual, one has to write a program to try an
individual character class to be exactly sure what it means. The
manual should be written in a way so that users don't need to write
the code to try for at least the ASCII subset.

> Why don't you provide a draft for this table, and we'll see whether it is not 
> only "easy to read" but also valid for any character set and locale 
> combination where gawk is supposed to be installable and runnable?

It is already in the link that I provide.

-- 
Regards,
Peng



reply via email to

[Prev in Thread] Current Thread [Next in Thread]