[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] downloading all files on page (with identical filenames)
From: |
Vinh Nguyen |
Subject: |
Re: [Bug-wget] downloading all files on page (with identical filenames) |
Date: |
Sun, 25 Jul 2010 11:59:08 -0700 |
On Sun, Jul 25, 2010 at 2:10 AM, Micah Cowan <address@hidden> wrote:
> On 07/24/2010 11:15 AM, Vinh Nguyen wrote:
>> Dear list,
>>
>> I'm using wget 1.12 on ubuntu 10.04. I don't know if this is a bug or
>> not. I'm using
>>
>> wget -U firefox -r -l1 -nd -e robots=off -A.pdf http://example.com
>>
>> to download pdf's off a page. The dilemma is that a lot of the pdf
>> links on the page has the same name (example.pdf). Wget is supposed
>> to append .1, .2, etc, to those files. However, with the above
>> command, only .1 is appended, and hence, only one file with .1 is
>> seen. If I set "-A.pdf,.pdf.1", then .1 and .2 gets appended, but .2
>> gets repeated and only one .2 file is available at the end.
>>
>> Are some of my arguments conflicting?
>
> Looks like that blasted delete-after logic again: it's because after the
> rename, the files no longer match -A.pdf, so they get deleted (not sure
> how you still have a .pdf.1 at all at the end, unless you're
> interrupting wget before it gets a chance to delete it). As a
> workaround, you should be able to use something like -A '*.pdf,*.pdf.*'
>
Thanks Micah, this works.
> --
> Micah J. Cowan
> http://micah.cowan.name/
>