Does anyone know of a quick way that I can get information from a webpage in Java? For instance, if I’m looking at a page like this: http://www.ncbi.nlm.nih.gov/pubmed/?term=10952317 and i want to extract the list of words beneath the heading “MeSH Terms”, how would I go about doing so?
I have something that can read the source but it is full of HTML tags and such…
Any help is much appreciated!
As has been mentioned on here countless times before have a look at JSoup, which is a HTML parsing library for Java. Or write your own (not recommended).