bug-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#31665: libxml-parse-html-region' doesn't extract text in tables


From: Lars Ingebrigtsen
Subject: bug#31665: libxml-parse-html-region' doesn't extract text in tables
Date: Mon, 30 Sep 2019 07:05:23 +0200
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux)

The HTML in that email is invalid.  It's basically on the form

<table>
  <tbody>
    foo
  </tbody>
</table>

"foo" won't be rendered by shr.

shr does try to deal with invalid tables, though.  If the <tbody>
elements hadn't been there, then the "foo" would have been, so I guess
some more work is required in that area.

-- 
(domestic pets only, the antidote for overdose, milk.)
   bloggy blog: http://lars.ingebrigtsen.no






reply via email to

[Prev in Thread] Current Thread [Next in Thread]