[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Bug-wget] Exposing wget functionality for extracting links from a web p
From: |
Ángel González |
Subject: |
[Bug-wget] Exposing wget functionality for extracting links from a web page |
Date: |
Mon, 16 Sep 2013 01:56:03 +0200 |
User-agent: |
Thunderbird |
There have been several requests to reuse the wget html parser for
finding links or requisites.
See "wget questions" thread from August, or "wget source code about -p
option" from June.
These two patches provide a new binary doing just that.
The main drawback of the current code is that it misses the connection
into the autoconf-generated Makefile. Maybe there is here someone
literate on autoconf that can help on that bit?
Meanwhile you can compile it with:
gcc -o get-urls get-urls.c html-url.c html-parse.c hash.c log.c
url.c ../lib/c-ctype.c ../lib/quotearg.c ../lib/c-strcasecmp.c
../lib/xalloc-die.c ../lib/xmalloc.c css_.c css-url.c iri.c utils.c
../lib/regex.c ../lib/exitfail.c exits.c host.c ../lib/localcharset.c
../lib/ioctl.c -I../lib -I. -lidn -lpcre
(you will need to compile wget first for having some of those files created)
Regards
0001-Moved-free_urlpos.patch
Description: Text Data
0002-Expose-wget-functionality-for-extracting-links-from-.patch
Description: Text Data
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- [Bug-wget] Exposing wget functionality for extracting links from a web page,
Ángel González <=