bug-gnu-libiconv
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug-gnu-libiconv] Converting ks_c_5601-1987 as CP949


From: Anders Thøgersen
Subject: [bug-gnu-libiconv] Converting ks_c_5601-1987 as CP949
Date: Tue, 26 Feb 2013 11:02:39 GMT
User-agent: One.com webmail 2.1.17

Hi,

I am converting emails to UTF-8 and ran into a Korean email which failed conversion to UTF-8.

I ran the command:

    cat korean-text.txt | iconv -f 'ks_c_5601-1987' -t 'UTF-8'

but the output was not correct and contained a number of ▣ and ♥ symbols which were
not there in the original text.  After some googling I discovered that I can use

    cat korean-text.txt | iconv -f 'CP949' -t 'UTF-8'

Instead, and I can see that this works well as my text is now correct.  

However, I do not get a warning "Conversion from ks_c_5601-1987 not supported" which tells
me that this character set is supported, at least partially, although it is not mentioned in the 
output of  iconv -l.  I am a bit confused.  Maybe there are characters of ks_c_5601-1987 that 
are not covered by CP949?

Also, I am wondering if I should be aware of other aliases when decoding.  Should I be referring
in general be referring to character sets by their code page name when converting to UTF-8?

Thanks for any help.

Best Regards
Anders

reply via email to

[Prev in Thread] Current Thread [Next in Thread]