So far I’ve been using wget and curl to do screen scraping. Now I would like to switch to perl. What’s a good tutorial that will cover basic web programming in perl (preferably without restating the basics)? I’m talking about basic things like getting and parsing pages, submitting forms, proxies, etc.
Share
Tools you will need besides Perl:
WWW::Mechanizemodule.HTML::TreeBuilder& especiallyHTML::TreeBuilder::XPathandHTML::Query. the last two will become very handy when you will want to get actual data from HTML documents.HTML::TableExtractis also a nice module to extract data from HTML tables when needed.basically, using all of the above will give you the ability to crawl most sites.
Have fun crawling (-: