[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-wget] question related to download file in Plant Journal
From: |
Ray Satiro |
Subject: |
Re: [Bug-wget] question related to download file in Plant Journal |
Date: |
Tue, 18 Jan 2011 10:24:17 -0800 (PST) |
--- On Mon, 1/17/11, jinxiang wang <address@hidden> wrote:
[...]
>
> My problem is that I just download the
> html file not the PDF file. But if
> I use firefox, I can save the linked file to PDF file. So
> please help me.
> Thank you in advance!
>
> Jinxiang
>
Hello,
You have to login to download PDF files from that site. Please review the FAQ:
http://wget.addictivecode.org/FrequentlyAskedQuestions?action=show&redirect=Faq#How_do_I_use_wget_to_download_pages_or_files_that_require_login.2BAC8-password.3F
As it says the easiest way is to use your browser's plaintext cookies file
after logging in. If you are using Firefox 3 or later you'll have to export
your cookies to plaintext. There is an extension available that will export
your cookies:
https://addons.mozilla.org/en-US/firefox/addon/cookie-exporter/
That extension will put an 'Export Cookies' entry in one of your menus; the
menu labeled Tools or its chinese equivalent. See the screenshot on the page.
Then invoke like this (all one line):
wget --load-cookies=cookies.txt
http://onlinelibrary.wiley.com/doi/10.1111/j.1365-313X.2010.04411.x/pdf
If it fails you might have to pass in your user agent and/or referer as well
and invoke like this (all one line):
wget --load-cookies=cookies.txt --user-agent="Mozilla/5.0 (Windows; U; Windows
NT 5.1; en-US; rv:1.9.2.13) Gecko/20101203 Firefox/3.6.13"
--referer="http://onlinelibrary.wiley.com/doi/10.1111/j.1365-313X.2010.04411.x/full"
"http://onlinelibrary.wiley.com/doi/10.1111/j.1365-313X.2010.04411.x/pdf"