bug-coreutils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#36718: uniq treats distinct Korean characters equal


From: Felix Hamme
Subject: bug#36718: uniq treats distinct Korean characters equal
Date: Thu, 18 Jul 2019 16:08:57 +0200
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0

Dear all,

I found that, when performing uniq on some Korean characters, it treats
them as equal (counts as duplicate) although the characters aren't
equal. To be precise, it happened to me on the Characters 프 (U+D504) and
틀 (U+D2C0).

An example (input, expected output, actual output) can be found in the
attachment.
I've tried that using uniq (GNU coreutils) 8.30.

Greetings
Felix Hamme

Attachment: uniq-korean-characters-bug.tar.gz
Description: application/gzip


reply via email to

[Prev in Thread] Current Thread [Next in Thread]