I am working on Android application that parses a website but I can’t seem

Question

0

Editorial Team

Asked: May 31, 20262026-05-31T08:51:13+00:00 2026-05-31T08:51:13+00:00

I am working on Android application that parses a website but I can’t seem

0

I am working on Android application that parses a website but I can’t seem to get Jsoup to work.

I am trying to parse this html:

Here’s a pic

My code just now is:

Document doc = null;
      try{
     doc = Jsoup.connect("URL").get();
      Elements tds = doc.select("table.tr>td");

     for (Element td : tds) {
       String tdText = td.text();
       System.out.println(tdText);
     }
    }

At the moment it does not return anything but if I print ‘doc’ it return the whole website.

I am trying to extract the following information:
Drower, E. S. (Ethel Stefana), Lady, b. 1879, With or without the &nbsp.

But I can’t seam to get it to work.

Thanks for your help!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-31T08:51:14+00:00

You got the selector wrong: it picks td children of a table element with class tr, while you probably want td cells in tr rows in a table. I believe you could get at them just by using "td" as selector.

However, that’s a bit too generic, since it’s going to pick every cell in the table. If the cell you need is always the third cell in the rows of that table, you can refine the selector to pick only those: "td:eq(2)". You should really get a knack of JSoup selectors, and experiment a little bit to see how much you are able to restrict the data extracted from the document to just the elements you really need.

To obtain the text after the <script> element in the fourth cell you could use something along the following snippet:

Element td = doc.select("td:eq(3)").first();
System.out.println(td.text());

because, from a little experiment of mine, it seems that JavaScript code inside <script> tags is skipped when asking the text of an element that contains one of those.

You would use a for loop rather than first, though, since there are as many fourth cells as there are rows in your document, and you got a lot of them.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am working on Android application that parses a website but I can’t seem

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply