[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] Combining --output-document with --recursive
From: |
Giuseppe Scrivano |
Subject: |
Re: [Bug-wget] Combining --output-document with --recursive |
Date: |
Sat, 26 May 2012 02:26:20 +0200 |
User-agent: |
Gnus/5.13 (Gnus v5.13) Emacs/23.4 (gnu/linux) |
Gijs van Tulder <address@hidden> writes:
> Hi,
>
> There's a problem if you combine --output-document with --recursive or
> --page-requisites. --output-document breaks the recursion.
>
> First you get a warning:
>
> WARNING: combining -O with -r or -p will mean that all downloaded
> content will be placed in the single file you specified.
>
> That is what you'd expect, no problem there.
>
> However, there is a problem with the recursion. Because Wget *appends*
> all downloaded content in the same file, the HTML and CSS parsers get
> confused. The same content is parsed over and over again, each time
> with a different URL context.
a solution would be probably to keep the file opened and continue
reading from the last position. It seems a bit tricky though, maybe the
wise solution is to don't allow at all -r and -O together.
Giuseppe