[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Bug-wget] Infinite loop, and bad 'adjust extension' on pdf
From: |
Lluís Batlle i Rossell |
Subject: |
[Bug-wget] Infinite loop, and bad 'adjust extension' on pdf |
Date: |
Sat, 24 Nov 2012 11:33:05 +0100 |
User-agent: |
Mutt/1.5.21 (2010-09-15) |
Hello,
I was downloading recursively. Specifically:
wget
--domains="data.inh.cat,data.jordibilbeny.com,www.inh.cat,www.jordibilbeny.com"
\
-H --adjust-extension -k -r -c -l 3 http://www.jordibilbeny.com/
And:
1) It went in an infinite loop while downloading http://www.inh.cat/robots.txt ,
it returning HTTP 416, and retrying again and again. I had to remove '-c' to
make wget go.
2) All links to '.pdf' files had its target changed to '.pdf.html' (that is, -k
and --adjust-extension I guess). But the pdf files downloaded didn't have the
".html" name addition. So the local links failed.
I used a "sed -i" in the files of my interest, to rewrite the anchor targets.
I'm running 1.13.4.
Thank you,
Lluís.
- [Bug-wget] Infinite loop, and bad 'adjust extension' on pdf,
Lluís Batlle i Rossell <=