I’m writing the code to access and download data from some web sites using HTMLUnit.
I was wondering whether a tool that could help me exists. I think maybe to a visual tool, something that lets you express the path you want to follow on the web sites visually and generates the Java code.
I’d like something for HTMLUnit, but I’m open to other solutions.
Writing screen-scraping code gets very repetitive 🙁
I used to work at screen-scraper. Not sure if it’s quite what you’re looking for, but it’s definitely a powerful tool for building web-scraping scripts.