I’m writing a program (in Java) that needs to extract links from webpages. I’m using htmlParser (http://htmlparser.sourceforge.net/) but I’m only able to extract html links (defined with <a href="...">) and I don’t know how to handle javascript code to extract links from… can you help me??
I’m writing a program (in Java) that needs to extract links from webpages. I’m
Share
You can use Rhino with DOM environment, written in JavaScript.
By the way it is written by John Resig.