Re: Unicode collation for readline completion in bash

bug-bash

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Unicode collation for readline completion in bash

From:	Linda Walsh
Subject:	Re: Unicode collation for readline completion in bash
Date:	Fri, 22 Jun 2012 20:37:19 -0700
User-agent:	Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.8.1.24) Gecko/20100228 Lightning/0.9 Thunderbird/2.0.0.24 Mnenhy/0.7.6.666



Rob Hoelz wrote:

Hello,

I have an idea for a new feature for bash/readline, and I wanted to get
the community's feedback before I go about implementing it.

---
        Have you looked at the Unicode collation algorithm?
        They have one published with each released.


        You say you want to ignore diacritics...ok... but what order would you 
put
them in?

        How about upper/lower case?  Unicode has a specified ordering that that 
most

OS implementations don't follow. Bash doesn't follow the Unicode orderingsaying, wellif POSIX doesn't say I have to, then I won't. POSIX is a mostly provincialstandard basedon ascii standard C -- with their official locale being called the C-localeuntil recently,

when they realize how provincial it made their standard look.   A standard that 
only
supports 127 (or less if the anti-invisi-char freaks get their way) chars?

        I would see no problem with implementing a switch that toggles ignoring
diacritics on or off... MS has a similar option in its OS search function -- for
searches, "should" it ignore diacritics or not.


        But I'd also like to see unicode ordering, as it's yet another worldwide
standard that the US isn't participating in (we got POSIX and the antiquated 
English
measurement system...)...

http://unicode.org/reports/tr10/ - Unicode Collation Algorithm, UnicodeTechnical standard #10



http://unicode.org/charts/uca/  - charts to show you the effect of the 
algorithm...

Note A-Z occur before a-z... unlike in most implementations ofen_US.UTF-8...where they

are intermixed...

[Prev in Thread]

Current Thread

[Next in Thread]

Unicode collation for readline completion in bash, Rob Hoelz, 2012/06/20
- Re: Unicode collation for readline completion in bash, Chet Ramey, 2012/06/22
- Re: Unicode collation for readline completion in bash, Linda Walsh <=

Prev by Date: Re: cd // produces unexpected results
Next by Date: Re: cd // produces unexpected results
Previous by thread: Re: Unicode collation for readline completion in bash
Next by thread: **/ directory wildcards leak file descriptors and memory
Index(es):
- Date
- Thread