[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: making "$BEZ20(B $B2mL-(B" say "TSUCHIYA Masatoshi" in Japanese
From: |
Kevin Rodgers |
Subject: |
Re: making "$BEZ20(B $B2mL-(B" say "TSUCHIYA Masatoshi" in Japanese |
Date: |
Wed, 06 Apr 2005 09:45:10 -0600 |
User-agent: |
Mozilla Thunderbird 0.9 (X11/20041105) |
Joe Corneli wrote:
> The question is: how to make sense of the output from
>
> lynx -dump http://emacs-w3m.namazu.org/ml/msg07861.html
>
> This looks harder than chinese utf translations, though I think that
> once I get the pattern down, it won't really be any harder (see
> http://lists.gnu.org/archive/html/help-gnu-emacs/2005-03/msg00689.html).
>
> But what is the pattern?
You have to pay attention to the HTTP headers sent by the server, in
particular:
Content-Type: text/html; charset=iso-2022-jp
According to the lynx man page, the -mime_header option "prints the MIME
header of a fetched document along with its source". Alternatively, you
can call lynx twice, first with the -head option to get just the HTTP
headers and then with -dump to get the content.
You should also try to handle Content-Transfer-Encoding headers.
--
Kevin Rodgers