Document doc = Jsoup.connect(http://www.utah.edu/).get(); Elements lists = doc.select(ul); for (Element list: lists) { Elements

Question

0

Asked: June 15, 20262026-06-15T06:02:16+00:00 2026-06-15T06:02:16+00:00

Document doc = Jsoup.connect(http://www.utah.edu/).get(); Elements lists = doc.select(ul); for (Element list: lists) { Elements

0

    Document doc = Jsoup.connect("http://www.utah.edu/").get();
    Elements lists = doc.select("ul");
    for (Element list: lists) {
        Elements li = list.select("li a");
        if (li.size() > 0) {
            ArrayList<String> anchors = new ArrayList<String>();
            for (Element e : li) {
                anchors.add(e.text());
            }
            System.out.println(anchors);
        }
    }

I’m trying to grab all html lists rendered by the ul tag from this page. But it failed. I suspect there’s script in the page preventing my program from doing so.

Edit: To make my question even simpler, consider the following code:

Document doc = Jsoup.connect("http://www.utah.edu/").get();
Elements lists = doc.select("ul");
System.out.println(lists.size());

Output:

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-15T06:02:17+00:00

A possible answer is that, the User-Agent header sent by jsoup made utah.edu think it’s a bot instead of a browser. So it returns other page content.

In org/jsoup/helper/HttpConnection.java implemented get(), which doesn’t send User-Agent header by default, unless told otherwise.

So you need manually set it by using userAgent().

Example, faking Chrome:

String ua = "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.11 (KHTML, like Gecko) Chrome/23.0.1271.95 Safari/537.11";
Document doc = Jsoup.connect("http://www.utah.edu/").userAgent(ua).get();

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Document doc = Jsoup.connect(http://www.utah.edu/).get(); Elements lists = doc.select(ul); for (Element list: lists) { Elements

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply