|
From: | Adam Twardoch |
Subject: | Re: [ft] Font family/style name issue (for msgothic shipped with vista). |
Date: | Thu, 30 Nov 2006 22:13:07 +0100 |
User-agent: | Thunderbird 1.5.0.8 (Macintosh/20061025) |
is being treated as UCS4. Where as actual string in this font is utf16. Is "Unicode full repertoire (as per OTF spec.)" means UCS4 ?
No. UTF-16 can represent the full Unicode repertoire (up to 1,114,111). Non-BMP codepoints are represented using surrogate pairs.
UCS-4, UTF-7 and UTF-8 all can also represent the full Unicode range. The only transformation form that cannot is UCS-2, which is essentially UTF-16 minus the surrogates, and is almost never used. Codepoints beyond 1,114,111 are not and never will be encoded in Unicode.
In OpenType fonts, all Unicode strings are encoded using UTF-16BE (i.e. the Motorola big-endian flavor).
-- Adam Twardoch | Language Typography Unicode Fonts OpenType | twardoch.com | silesian.com | fontlab.net
[Prev in Thread] | Current Thread | [Next in Thread] |