I am currently using Jsoup to parse a html. The code is quite simple:

Question

0

Asked: June 1, 20262026-06-01T11:45:15+00:00 2026-06-01T11:45:15+00:00

I am currently using Jsoup to parse a html. The code is quite simple:

0

I am currently using Jsoup to parse a html. The code is quite simple:

Document doc = null;
    try{
        doc = Jsoup.connect(link).get();    
    }
    catch (Exception e) {
        //System.out.println("Some error occured.");
        textView.setText(e.getMessage());
    }

It do gives me the webpage I want, later I can extract the data I need from that webpage with it’s getElementsByTag method and so on. However, I only want to use part of the webpage, for example, I wish to abandon everything after < ! — / foo –> in my webpage. (Actually It’s does not have blank between < and !, but I can’t type that here.) Is there any way of abandon the webpage after that string and get the new Document with only the part I want? I checked the cookbook, but it seems only process the webpage in it’s structure, so I am not quite sure is it OK to do something like string remove. Thanks for your reading.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-01T11:45:17+00:00

Editorial Team

2026-06-01T11:45:17+00:00Added an answer on June 1, 2026 at 11:45 am

You can use Document doc = Jsoup.parse(html) where HTML is a page HTML. I.e. take HTML first by

   Connection connect = Jsoup.connect(url);
   Connection.Response response = connect.execute();
   String html = response.body();

then do whatever operations you need (e.g. cut HTML after marker, but add necessary closing HTML tags), then

   Document doc = Jsoup.parse(html)

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am currently using Jsoup to parse a html. The code is quite simple:

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply