wget-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

wget2 | Force all pages to individual directories with index.html files


From: Jason Ullstam (@jason.ullstam)
Subject: wget2 | Force all pages to individual directories with index.html files (#591)
Date: Wed, 30 Mar 2022 20:08:00 +0000


Jason Ullstam created an issue: https://gitlab.com/gnuwget/wget2/-/issues/591



When using the following wget2 command 

wget2 --mirror -p --adjust-extension -e robots=off --base=./ -k -P ./ --debug 

I would like to have an option to be able to place every page as a directory 
instead of having the pagename.html format.  I have tried adding the 
--force-directories with no luck.  Is this an option that i'm missing or would 
this be something that could be added.  I have tried this with wget2 and also 
httrack.  httrack does place each page in individual directories but I would 
prefer to use wget2.  I have attached screenshots of the scrapes for reference. 
 

![httrack](/uploads/fc19dac6a36e387de6d7e67b5c564d1c/httrack.png)

![wget2](/uploads/7ab4bde902dcdad6c5253e7d715d0e1b/wget2.png)

-- 
Reply to this email directly or view it on GitLab: 
https://gitlab.com/gnuwget/wget2/-/issues/591
You're receiving this email because of your account on gitlab.com.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]