[Bug-wget] Wget Starting Questions

bug-wget

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] Wget Starting Questions

From:	Jason Todd Slack-Moehrle
Subject:	[Bug-wget] Wget Starting Questions
Date:	Sun, 19 Apr 2009 09:26:29 -0700

Hi All,

I have some starting Wget questions that I am hoping to gain insightabout.

I want to start at Dmoz.org and follow links for entertainment (likeconcerts, art gallery events, etc) and examine the link to see if Ishould get data back about it and from it.


My questions:

1. Can Wget start at a given URL and examine every link (based upon mycriteria)? (obviously I can write Case or If/Else or While to do this)

2. If I find a link that has certain keywords that I find of interest,can I hit that link of interest and get information from that page?

3. How do I get the information about the link of interest and itscontent of interest into a MySQL database? (I know ColdFusion andMySQL and PHP). I think what I am asking is how do I get back to mydatabase from a crawler?

4. I bought Webbots, spiders and screen scrapers in PHP and so far itis interesting, but I am wondering what best practices are..


Am I making any sense?

-Jason

[Prev in Thread]

Current Thread

[Next in Thread]

[Bug-wget] Wget Starting Questions, Jason Todd Slack-Moehrle <=
- Re: [Bug-wget] Wget Starting Questions, Micah Cowan, 2009/04/19

Prev by Date: Re: [Bug-wget] Fails to build on HP-UX B.10.20 A
Next by Date: Re: [Bug-wget] Wget Starting Questions
Previous by thread: [Bug-wget] Fails to build on HP-UX B.10.20 A
Next by thread: Re: [Bug-wget] Wget Starting Questions
Index(es):
- Date
- Thread