bug-coreutils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#32472: sort doesn't sort and uniq loses data for many non-Latin scri


From: Assaf Gordon
Subject: bug#32472: sort doesn't sort and uniq loses data for many non-Latin scripts on UTF-8 locales
Date: Mon, 29 Oct 2018 21:54:59 -0600
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.2.1

tags 32472 notabug
close 32472
stop


On 2018-08-18 11:34 a.m., Paul Eggert wrote:
Vaayda Yaasra wrote:
Here’s an example in Syriac:

ܡܠܬܐ
ܒܝܬܐ
ܒܪܢܫܐ
ܡܠܬܐ

Sort produces the following:

ܡܠܬܐ
ܒܝܬܐ
ܡܠܬܐ
ܒܪܢܫܐ

This is a property of your locale, so I suggest sending a bug report to whoever maintains your locale. You should be able to reproduce the problem by bypassing GNU 'sort' entirely and using the C strcoll function.

For what it's worth, I observe the problem on Ubuntu 18.04 but not on Fedora 28. As Fedora tends to be more up-to-date, perhaps the problem is fixed already in glibc.

Given the above, and with no further comments,
I'm closing this bug.

-assaf





reply via email to

[Prev in Thread] Current Thread [Next in Thread]