lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Lynx-dev] Lynx HTML Parsing Bug


From: Isaac Morland
Subject: [Lynx-dev] Lynx HTML Parsing Bug
Date: Wed, 11 Oct 2006 09:37:33 -0400 (EDT)

There is a bug in Lynx' handling of at least some HTML tags. If whitespace precedes the closing '>' of "</script>" then problems occur (see below for Unix and Lynx version, as well as the HTML source obtained using -mime_header and the result of using -dump). If the space is removed, then -dump shows the title and line of text as expected.

This was detected because I have an HTML-generation library that puts a newline ('\n') before every '>' in order to put the output on multiple lines without ever inserting unwanted whitespace between tags. The W3C validator has no problem with this and I have never observed any other browser to have a problem.

Since I'm reporting things I'd like changed in Lynx, I will also mention that I'd really like -mime_header to *always* just show what comes back from the server. If the server response is a 401, then the server headers are not shown; instead the Lynx error message about access denied is shown.

Thanks for putting out Lynx.  It's still useful....

@cpu102[110]% uname -a
SunOS cpu102.cs 5.8 Generic_117350-35 sun4u sparc
@cpu102[111]% lynx -version
Lynx Version 2.8.5dev.7 (06 Jan 2002)
libwww-FM 2.14, SSL-MM 1.4.1, OpenSSL 0.9.7b
Built on solaris2.5 Jul 22 2003 15:32:14

Copyrights held by the University of Kansas, CERN, and other contributors.
Distributed under the GNU General Public License.
See http://lynx.browser.org/ and the online help for more information.

See http://www.moxienet.com/lynx/ for information about SSL for Lynx.
See http://www.openssl.org/ for information about OpenSSL.

@cpu102[112]% lynx -mime_header http://www.cs.uwaterloo.ca/~ijmorlan/lynxbug.html
HTTP/1.1 200 OK
Date: Wed, 11 Oct 2006 12:44:52 GMT
Server: Apache/1.3.34 Ben-SSL/1.55 (Unix) mod_perl/1.21
Last-Modified: Tue, 10 Oct 2006 12:59:09 GMT
ETag: "c7308d-120-452b991d"
Accept-Ranges: bytes
Content-Length: 288
Connection: close
Content-Type: text/html; charset=utf-8

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" 
"http://www.w3.org/TR/html4/strict.dtd";>
<html>
<head>
<script src="/templates/styleswitcher.js" type="text/javascript">
</script >
<title>Test Page</title>
</head>

<body>
<h1>Test Page</h1>

<p>This is a test page.</p>

</body>
</html>
@cpu102[113]% lynx -dump http://www.cs.uwaterloo.ca/~ijmorlan/lynxbug.html



@cpu102[114]%


Isaac Morland                   CSCF Web Guru
DC 2554C, x36650                WWW Software Specialist




reply via email to

[Prev in Thread] Current Thread [Next in Thread]