wget-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: wget | Slashes Repeated Eternally while Generating Adresses during R


From: @rockdaboot
Subject: Re: wget | Slashes Repeated Eternally while Generating Adresses during Recursive Site Download (#8)
Date: Mon, 27 Dec 2021 17:29:38 +0000



Tim Rühsen commented:


Thanks for the report.

I didn't wait for so long, but encountered this in the wget log (when adding 
`-d -olog` to your command line):
```
book.itep.ru/1/intro1.htm: merge(‘http://book.itep.ru/1/intro1.htm’, 
‘..//4/4/rfc2858.htm’) -> http://book.itep.ru/1/..//4/4/rfc2858.htm
appending ‘http://book.itep.ru//4/4/rfc2858.htm’ to urlpos.
```

It looks like wget doesn't do the path normalization correctly (merging 
multiple `/` characters into one).

Using `wget2` instead of `wget` seems to work correctly:
```
27.181439.122 *url = ..//4/4/rfc2858.htm
27.181439.122 path 1/..//4/4/rfc2858.htm ->
27.181439.122      4/4/rfc2858.htm
27.181439.122 *4 http://book.itep.ru/4/4/rfc2858.htm 35
Adding URL: http://book.itep.ru/4/4/rfc2858.htm
```

-- 
Reply to this email directly or view it on GitLab: 
https://gitlab.com/gnuwget/wget/-/issues/8#note_796053443
You're receiving this email because of your account on gitlab.com.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]