One website stores information in ids, like that http://website.com/index.php?id=1. It has about 500,000 records.
I need to retrieve them and put it in database (MySQL). Now I use curl and it gets that information, but it’s very slow. I got only ~4,000 records in one hour.
My question, is there any way to do this faster?
If you’re not against using perl, the CPAN module Web::Scraper is the conventional way for scraping pages efficiently. See http://metacpan.org/pod/Web::Scraper .