I need to create a tool that can log in to a website, read the HTML, perhaps navigate to another page, and ultimately pull data down from the page (and export it to a file, or keep it “in memory” for more processing, etc). I will be doing this on Mac OS. Are there any suggestions on how best to do this these days? In the past I have done this in .NET or Java using the web libraries. I need to be able to login to a site that uses HTTPS.
Share
Take a look at HTTPUnit. It’s an easy solution for emulating a web browser using Java code and you’ll be up and running fairly quickly if you’re already familiar with Java and it does support HTTPS