I’m using Perl.
I have the tag, for example: “XYZ_PKM_HTML”
I would like to be able to provide a base url, for example: http://www.example.com
and the to get the HTML page (not necessarily the main page, thats easy) where this tag appears.
is it possible? any idea? (or already made modules, looked on cpan, there were some interesting stuff, but not installable)
Thanks,
MJD has an extended example on writing a web spider in Higher-Order Perl. It is section 4.7. See page 187 in Chapter 4.
Of course, you can also try the WWW::SimpleRobot module he mentions.