[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#31665: libxml-parse-html-region' doesn't extract text in tables
From: |
Lars Ingebrigtsen |
Subject: |
bug#31665: libxml-parse-html-region' doesn't extract text in tables |
Date: |
Mon, 30 Sep 2019 07:05:23 +0200 |
User-agent: |
Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) |
The HTML in that email is invalid. It's basically on the form
<table>
<tbody>
foo
</tbody>
</table>
"foo" won't be rendered by shr.
shr does try to deal with invalid tables, though. If the <tbody>
elements hadn't been there, then the "foo" would have been, so I guess
some more work is required in that area.
--
(domestic pets only, the antidote for overdose, milk.)
bloggy blog: http://lars.ingebrigtsen.no