[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] sudo wget --accept sandwich
From: |
Tim Ruehsen |
Subject: |
Re: [Bug-wget] sudo wget --accept sandwich |
Date: |
Wed, 10 Feb 2016 15:56:46 +0100 |
User-agent: |
KMail/4.14.10 (Linux/4.3.0-1-amd64; KDE/4.14.14; x86_64; ; ) |
Hi Marcel,
wget downloads
"http://www.mfd.mw.tu-dresden.de/mfd/index.php/lehre/wintersemester/videos-winter",
parses it for
URLs and in the end this document becomes removed because it doesn't fit your
settings.
Your '-A asx' means: Just download/keep *.asx files
Your '--accept-regex videos' means: Just download/keep *video* files
That means, you only want to download *video*.asx files.
Is that what you want ?
Did you try -A '*.pdf' -R '*.asx' ? (Quotes to avoid shell wildcard
expansion.)
Sorry, but I can only reach the login page... else I could have a look deeper
inside. Right now I just have to guess.
Maybe you could give us an example HTML page.
Tim
On Wednesday 10 February 2016 09:01:20 Marcel Partap wrote:
> Dear wget devs,
> failed to resolve the following issue on my own, thus asking for
> assistance. I want to download all .asx-files :-( linked from an uni
> internal page. The page doesn't have a suffix.
>
> > wget --load-cookies cookies.txt -A asx -mk -np -l 1 --accept-regex videos
> > -A asx
> > http://www.mfd.mw.tu-dresden.de/mfd/index.php/lehre/wintersemester/videos
> > -winter
> This will throw
>
> > Removing
> > www.mfd.mw.tu-dresden.de/mfd/index.php/lehre/wintersemester/videos-winter
> > since it should be rejected.
> and exit there.
> --accept-regex videos-winter on the other side can not be combined with
> --accept asx it seems.
> So how do I get around this. This should be a common scenario, i.e. get
> all linked PDFs etc. from a web page, skipping all the links to
> dynamically generated CMS pages in the menu often placed before the page
> content.
> & IMHO, wget should simply never filter out URLs explicitly given to it
> on the command line.
> #Best Regards/MPartap
signature.asc
Description: This is a digitally signed message part.