bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Rejecting 'index.html*' files causes recursion to include parent-directo


From: Carl Ponder
Subject: Rejecting 'index.html*' files causes recursion to include parent-directories
Date: Mon, 7 Aug 2023 12:35:18 -0500
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.14.0


I'm running wget version 1.20.3 (and earlier) using this command-line

   wget -P dir -r -nH -np --cut-dirs 3 https://svn.site.org/X/Y/Z

to retrieve the contents of the remote-directory "Z" into local directory "dir". This works fine except that i also get files "index.html" in all the sub-directories, that I don't want. Yeah, I know I can delete them afterward, but is there a way to just filter them out in the first place?
If I try this form

   wget -P dir -r -R 'index.html*' -nH -np --cut-dirs 3
   https://site.org/X/Y/Z

I find that it's downloading subdirectories from the parent levels as well, even though I set the -np parameter.


reply via email to

[Prev in Thread] Current Thread [Next in Thread]