wget-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: wget | GNU Wget Couldn't Convert Links Properly (#7)


From: Sam G. (@shamil.gumirov)
Subject: Re: wget | GNU Wget Couldn't Convert Links Properly (#7)
Date: Sun, 12 Dec 2021 04:59:21 +0000



Sam G_ commented on a discussion: 
https://gitlab.com/gnuwget/wget/-/issues/7#note_767563265

Hi,

Issue partially reproduced (macos), namely the absolute link to http scheme is 
stored as absolute one:
> **File 
> [www.eskimo.com\\\~scs\\dmr.html](http://www.eskimo.com%5C\~scs%5Cdmr.html)**
> Expected: index.html  
> Actual result: http://www.eskimo.com/\~scs/
I'm still investigating this issue, will inform progress.


Also, the new issue found for the same command above with the ftp link 
following does not consider `-H` option.

Actual: Ftp scheme link from https scheme page is followed even if no 
`-H/--span-hosts` option is set.
Expected: Ftp link following considers `-H/--span-hosts` option.
Example (ftp://rtfm.mit.edu/* should not be fetched):
```
--2021-12-12 12:11:25--  
https://www.eskimo.com/~scs/readings/portability.20031117.html
Reusing existing connection to www.eskimo.com:443.
HTTP request sent, awaiting response... 200 OK
Length: 19321 (19K) [text/html]
Saving to: ‘www.eskimo.com/~scs/readings/portability.20031117.html’

www.eskimo.com/~scs/readings/portability. 
100%[====================================================================================>]
  18.87K  --.-KB/s    in 0.001s

2021-12-12 12:11:25 (16.7 MB/s) - 
‘www.eskimo.com/~scs/readings/portability.20031117.html’ saved [19321/19321]

--2021-12-12 12:11:25--  
ftp://rtfm.mit.edu/pub/usenet/news.answers/sci-data-formats
           => ‘rtfm.mit.edu/pub/usenet/news.answers/sci-data-formats’
Resolving rtfm.mit.edu (rtfm.mit.edu)... 18.25.130.75
Connecting to rtfm.mit.edu (rtfm.mit.edu)|18.25.130.75|:21... failed: Operation 
timed out.
Retrying.
```

-- 
Reply to this email directly or view it on GitLab: 
https://gitlab.com/gnuwget/wget/-/issues/7#note_767563265
You're receiving this email because of your account on gitlab.com.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]