lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: lynx-dev chartrans broken with SOURCE_CHACHE


From: Leonid Pauzner
Subject: Re: lynx-dev chartrans broken with SOURCE_CHACHE
Date: Fri, 2 Jul 1999 23:33:18 +0400 (MSD)

27-Jun-99 13:54 Klaus Weide wrote:

> 1. Display character set (as saved in .lynxrc is 'Western (ISO-8859-1)'.
> 2. Start lynx as

>      lynx http://www.cogsci.ed.ac.uk/~richard/unicode-sample.html

> 3. Check that things look right, especially in the 'Latin-1 Supplement'
>    block.

> 4. Go to 'O'ptions screen, change display character set to
>    '7 bit approximations (US-ASCII)'.   (I used the old style Options
>    Menu, in case it matters.)

Hmm, yes.
(changing display charset was not tested enough,
probaly used a lot with EXP_CHARTRANS_AUTOSWITCH ?).


> 5. Look at 'Latin-1 Supplement' block.
>    a. If SOURCE_CACHE was none, everything is ok.
>    b. If SOURCE_CACHE was FILE or MEMORY, 8-bit characters are still shown
>       directly.

^R will fix the output. But the problem confirmed (for both options menu).
Will check it later.

Just a brief look on trace log. I have a page with 8bit letters
(actually a local page but my lynx was built with DEBUG_SOURCE_CACHE
so local files also cached in source).

Now I change display charset from cp866 to cp852, see TransParams messages:
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

LYOptions.c/postoptions(): exiting...
                            need_reload = TRUE
                            need_end_reload = FALSE
LYpop[1]: address:file://localhost/c:/add/tmp/17514.htm
     title:Options Menu
HTParse: aName:file://localhost/c:/add/tmp/17514.htm   relatedName:
HTParse:      result:
Entered HTAnchor_findAddress
Anchor 212de8 with address `file://localhost/c:/add/tmp/17514.htm' already 
exists.
HTAccess: loading document file://localhost/c:/add/tmp/17514.htm
HTAccess: Document already in memory.
LYpop[0]: address:file://localhost/c:/uue/koi8/ache/contents.htm
     title:ToC
HTParse: aName:file://localhost/c:/uue/koi8/ache/contents.htm   relatedName:
HTParse:      result:
Entered HTAnchor_findAddress
Anchor 20af40 with address `file://localhost/c:/uue/koi8/ache/contents.htm' 
already exists.
HTAccess: loading document file://localhost/c:/uue/koi8/ache/contents.htm
HTAccess: Document already in memory.

Reparsing from source memory cache 20e8e8
HTFormat: Constructing stream stack for text/html to www/present
HTFormat: Looking up presentation for text/html to www/present
FindPresentation: found exact match: text/html
StreamStack: found exact match: text/html
UCSetTransParams: from koi8-r(27) to cp866(25)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Reusing source memory cache 20e8e8
SGML Doctype:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
SGML: Start <HTML>
UCSetTransParams: from cp866(25) to cp866(25)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^
GridText: Auto-uncaching
UCSetTransParams: from cp866(25) to cp852(18)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^
GridText: start HText_new
GridText: Change to style Normal
GridText: split_line(0 [now:0]) called
me->tag_charset: 0 -> 40 (me->UCLYhndl: 25, tag_charset: 40)
HTML:begin_element[0]: adding style to stack - Normal
SGML: Start <HEAD>
HTML:begin_element[1]: adding style to stack - Normal
SGML: Start <BASE>
SGML: Start <META>
LYHandleMETA: HTTP-EQUIV="Content-Type" NAME="NULL" CONTENT="text/html; 
charset=koi8-r"
UCSetTransParams: from koi8-r(27) to cp866(25)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
LYHandleMETA: New charset: koi8-r
SGML: Start <LINK>
...
Reparse succeeded
LYOptions.c/postoptions(): now really exit.

GridText: HText_pageDisplay at line 1 started
GridText: HText_pageDisplay finished

---->> the output ignore the changing of display charset. Now reload:
       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^


HTuncache.. freeing document for 
'file://localhost/c:/uue/koi8/ache/contents.htm'
Removing memory source cache 20e8e8
getfile: getting file://localhost/c:/uue/koi8/ache/contents.htm

HTParse: aName:file://localhost/c:/uue/koi8/ache/contents.htm   relatedName:
...
Entered HTAnchor_findAddress
New anchor 20c528 has hash 2 and address 
`file://localhost/c:/uue/koi8/ache/contents.htm'
HTAccess: loading document file://localhost/c:/uue/koi8/ache/contents.htm
...
HTLoadFile: Opening `c:\uue\koi8\ache\contents.htm' gives 201e28
HTFormat: Constructing stream stack for text/html to www/present
HTFormat: Looking up presentation for text/html to www/present
FindPresentation: found exact match: text/html
StreamStack: found exact match: text/html
UCSetTransParams: from windows-1251(26) to cp852(18)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Caching source for URL file://localhost/c:/uue/koi8/ache/contents.htm in memory 
cache 21d2f0
SGML Doctype:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
SGML: Start <HTML>
UCSetTransParams: from cp852(18) to cp852(18)
UCSetTransParams: from cp852(18) to cp852(18)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^
GridText: start HText_new
GridText: Change to style Normal
GridText: split_line(0 [now:0]) called
me->tag_charset: 0 -> 40 (me->UCLYhndl: 18, tag_charset: 40)
HTML:begin_element[0]: adding style to stack - Normal
SGML: Start <HEAD>
HTML:begin_element[1]: adding style to stack - Normal
SGML: Start <BASE>
SGML: Start <META>
LYHandleMETA: HTTP-EQUIV="Content-Type" NAME="NULL" CONTENT="text/html; 
charset=koi8-r"
UCSetTransParams: from koi8-r(27) to cp852(18)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
LYHandleMETA: New charset: koi8-r
UCSetTransParams: from koi8-r(27) to cp852(18)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
SGML: Start <LINK>

---->> Now the output match cp852 display charset.




> Best tested with a slang-compiled lynx, it shows C1 characters (illegal
> in the now-current display character set) in slang's escape
> representation.  (With a curses-compiled lynx they should be filtered out
> when displaying).  One can also use 'P'rint, with a printer definition
> that invokes less (or similar) *without* a LESSCHARSET or -r, to see
> what's in the HText structure.

> Modifications:

> 1. isn't strictly necessary.  The same applies if display character set
> is for example KOI8-R, just look at the appropriate block then.

> In 4. try one of the RFC 1345 choices instead of '7 bit approximations
> (US-ASCII)'.

> Leonid, can you look into this?

>   Klaus





reply via email to

[Prev in Thread] Current Thread [Next in Thread]