Re: Trying to input Unicode via GNU Emacs 21.3.1

help-gnu-emacs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Trying to input Unicode via GNU Emacs 21.3.1

From:	Peter Dyballa
Subject:	Re: Trying to input Unicode via GNU Emacs 21.3.1
Date:	Sat, 12 Feb 2005 14:29:48 +0100


Am 11.02.2005 um 22:00 schrieb List account:

For instance, I need to be able to display the typical accentedSpanish, Italian and French characters. As an example, I can input"Alarcón" in Emacs and it looks fine, but it displays in my browser(Camino 0.82 on Mac OS X) as "AlarcÃ³n". The odd thing is that Ibasically copied and modified this text from a page that actuallyworks just fine.

Camino is not clever in guessing an HTML file's encoding: I can teachten times and more the right encoding and when I return to that pageit's again the default encoding from the preferences. So you should benot that stupid and start your HTML file this way:


<head>
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
  <!-- ... other things ... -->
</head>

Here all charset names are defined:http://www.iana.org/assignments/character-sets.

The two characters Ã³ explain that, what you've typed in GNU Emacs wascorrectly encoded as UTF-8. Character Palette (in Mac OS X) tells meabout ó that it is in UTF-8 "C3 B3", i.e. Ã followed by ³. Caminoshould be able to display these two characters, if you VIEW it inUTF-8, as one ó. Defining the charset used in the HTML source's headershould Camino, and other browsers, make automatically switch to thecorrect character set -- and maybe you should have set the correct fontthat is Unicode!


I have the following lines in my .emacs:
(setq locale-coding-system 'utf-8)
(set-terminal-coding-system 'utf-8)
(set-keyboard-coding-system 'utf-8)
(set-selection-coding-system 'utf-8)
(prefer-coding-system 'utf-8)

It has been said a few times that this is too much, at leastset-keyboard-coding-system is incorrect. Usually your keyboard willwork in some Latin mode, i.e. produce only *one* character on hittingor releasing a key (UTF-8 is one, two, three, and I think even somemore characters, for example in the case that you input a characterfrom a right-to-left script in a left-to-right script environment, andvice versa). It might be more helpful when you set LANG to some(Spanish? French?) UTF-8 setting (man locale).

I have also tried the technique of hitting [C-q] and entering theUnicode string, but it chokes on the codes for accented characters andinstead of inserting the accented "a" character (0x00E1) by typing C-q0 0 E 1 it produces "^@e1".

As far as I know the C-q syntax supports only *octal* values. So theinputs ends when you input something outside the octal range of 0...7,e is that finishing item, RET another. So you see ASCII NUL, which isrepresented in Emacs as ^@, followed by e and 1, which are unchanged.


--
Greetings

  Pete

  Basic, n.:
        A programming language.  Related to certain social diseases in
that those who have it will not admit it in polite company.

[Prev in Thread]

Current Thread

[Next in Thread]

Trying to input Unicode via GNU Emacs 21.3.1, List account, 2005/02/11
- Re: Trying to input Unicode via GNU Emacs 21.3.1, Peter Dyballa <=
- Re: Trying to input Unicode via GNU Emacs 21.3.1, David Kastrup, 2005/02/11
  - Re: Trying to input Unicode via GNU Emacs 21.3.1, August, 2005/02/11
    - Re: Trying to input Unicode via GNU Emacs 21.3.1, August, 2005/02/12
    - Re: Trying to input Unicode via GNU Emacs 21.3.1, Erik Norvelle, 2005/02/12
    - Re: Trying to input Unicode via GNU Emacs 21.3.1, Peter Dyballa, 2005/02/12
  - Message not available
    - Re: Trying to input Unicode via GNU Emacs 21.3.1, David Kastrup, 2005/02/12
    - Re: Trying to input Unicode via GNU Emacs 21.3.1, Stefan Monnier, 2005/02/13

Prev by Date: Re: Speedbar question
Next by Date: Re: Trying to input Unicode via GNU Emacs 21.3.1
Previous by thread: Trying to input Unicode via GNU Emacs 21.3.1
Next by thread: Re: Trying to input Unicode via GNU Emacs 21.3.1
Index(es):
- Date
- Thread