|
From: | Linda Walsh |
Subject: | Re: locale specific ordering in EN_US -- why is a<A<b<B<y<Y<z<Z? |
Date: | Mon, 21 May 2012 12:56:34 -0700 |
User-agent: | Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.8.1.24) Gecko/20100228 Lightning/0.9 Thunderbird/2.0.0.24 Mnenhy/0.7.6.666 |
Aharon Robbins wrote:
You'd think unicode would have something to say about collation order that wouldn't allow such randomness, but maybe not.It actually makes sense that it doesn't, since Unicode is more or less a mapping of code points to glyphs, which is language independant. The rules for collating depend upon the language.
---- Actually it makes sense that it would... and it DOES. Bash and POSIX would appear to violate Unicode ordering. Algorithm reference: http://www.unicode.org/reports/tr10/tr10-24.html Collation order chart: http://www.unicode.org/Public/UCA/latest/allkeys.txt ---- It shows that cap A-Z come before a-z. If a sort order is set (as mine used to be) to en_US.UTF-8 then the Unicode sort order should have been used. In bash, it was not. It sounds like this is a bug rooted in the C libraries?
[Prev in Thread] | Current Thread | [Next in Thread] |