emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Strange character set reported in TUTORIAL


From: Eli Zaretskii
Subject: Strange character set reported in TUTORIAL
Date: Mon, 11 Feb 2008 23:34:12 +0200

Today's CVS trunk on MS-Windows, font backend disabled:

  emacs -Q
  C-u C-h t Russian RET

Now go to any Cyrillic letter and type "C-u C-x =".  Surprisingly, I
see this:

          character: Д (1044, #o2024, #x414)
  preferred charset: gb18030 (GB18030)
         code point: 0xA7A5
             syntax: w  which means: word
           category: Y:Cyrillic characters of 2-byte character sets c:Chinese
                     h:Korean j:Japanese y:Cyrillic
        buffer code: #xD0 #x94
          file code: #xE4 (encoded by coding system cyrillic-koi8-unix)
            display: by this font (glyph code)
       -outline-Courier New-normal-r-normal-normal-13-97-96-96-c-*-koi8-r 
(#x414)

For comparison, Emacs 22.1 says this:

  character: Д (3636, #o7064, #xe34, U+0414)
    charset: cyrillic-iso8859-5
             (Right-Hand Part of Latin/Cyrillic Alphabet (ISO/IEC 8859-5): 
ISO-IR-144.)
 code point: #x34
     syntax: w  which means: word
   category: y:Cyrillic
buffer code: #x8C #xB4
  file code: #xE4 (encoded by coding system cyrillic-koi8-unix)
    display: by this font (glyph code)
     -outline-Courier New-normal-r-normal-normal-13-97-96-96-c-*-iso8859-5 
(#x414)

This isn't limited to Cyrillic scripts in any way: I see similar
display with other non-ASCII characters, such as Latin-1, including in
etc/HELLO: they all claim GB18030 as preferred charset and mention CJK
character sets in the category: line.

If this is a feature, can someone explain it?




reply via email to

[Prev in Thread] Current Thread [Next in Thread]