I am wondering if there is a way to read the html output of

Question

0

Editorial Team

Asked: June 3, 20262026-06-03T03:38:03+00:00 2026-06-03T03:38:03+00:00

I am wondering if there is a way to read the html output of

0

I am wondering if there is a way to read the html output of a given webpage using Java?

I know in php you can do something like:

$handle = @fopen("'http://www.google.com", "r");
$source_code = fread($handle,9000);

I am looking for the Java equivalent.

Additionally, once I have the rendered html are there any Java utilities that would allow me to strip out a single div by its id?

Thanks for any help with this.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-03T03:38:05+00:00

Use jsoup.

You have the choice between a tree model and a powerful query syntax similar to CSS or jQuery selectors, plus utility methods to quickly get the source of a webpage.

To quote from their website:

Fetch the Wikipedia homepage, parse it to a DOM, and select the
headlines from the In the news section into a list of Elements:
Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
Elements newsHeadlines = doc.select("#mp-itn b a");

Once you found the Element representing the div you want to remove, just call remove() on it.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am wondering if there is a way to read the html output of

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply