[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#36718: uniq treats distinct Korean characters equal
From: |
Felix Hamme |
Subject: |
bug#36718: uniq treats distinct Korean characters equal |
Date: |
Thu, 18 Jul 2019 16:08:57 +0200 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 |
Dear all,
I found that, when performing uniq on some Korean characters, it treats
them as equal (counts as duplicate) although the characters aren't
equal. To be precise, it happened to me on the Characters 프 (U+D504) and
틀 (U+D2C0).
An example (input, expected output, actual output) can be found in the
attachment.
I've tried that using uniq (GNU coreutils) 8.30.
Greetings
Felix Hamme
uniq-korean-characters-bug.tar.gz
Description: application/gzip
- bug#36718: uniq treats distinct Korean characters equal,
Felix Hamme <=