[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
wget2 | wget2 exits with 8 when the robots.txt job redirects (302) and t
From: |
@omos |
Subject: |
wget2 | wget2 exits with 8 when the robots.txt job redirects (302) and the subsequent job receives HTTP error 404 (#677) |
Date: |
Fri, 09 Aug 2024 20:14:53 +0000 |
Ondrej Mosnáček created an issue: https://gitlab.com/gnuwget/wget2/-/issues/677
It seems the code doesn't handle this situation well and treats the second
(redirected) job differently, so it ends up doing
`set_exit_status(EXIT_STATUS_REMOTE)` in this case, even though without the
redirect a 404 error would be ignored for robots.txt (if I'm reading the code
right).
To reproduce this, you can try to recursively download files from e.g.
`https://download.fedoraproject.org/pub/fedora/linux/releases/40/Cloud/x86_64/images`
(though the behavior may differ depending on the mirror that
`download.fedoraproject.org` picks for you):
```
wget2 -nd -nH -H -r -l 1 -A '*-CHECKSUM'
https://download.fedoraproject.org/pub/fedora/linux/releases/40/Cloud/x86_64/images
```
The above command fails with the return code 8, even though the only failure
seems to be from the robots.txt fetching.
--
Reply to this email directly or view it on GitLab:
https://gitlab.com/gnuwget/wget2/-/issues/677
You're receiving this email because of your account on gitlab.com.
- wget2 | wget2 exits with 8 when the robots.txt job redirects (302) and the subsequent job receives HTTP error 404 (#677),
@omos <=