How can I get the title of a web page for a given URL

Question

0

Asked: May 15, 20262026-05-15T17:15:04+00:00 2026-05-15T17:15:04+00:00

How can I get the title of a web page for a given URL

0

How can I get the title of a web page for a given URL using an HTML parser? Is it possible to get the title using regular expressions? I would prefer to use an HTML parser.

I am working in the Java Eclipse IDE.

I have tried using the following code, but was unsuccessful.

Any ideas?

Thank in advance!

import org.htmlparser.Node;

import org.htmlparser.Parser;

import org.htmlparser.util.NodeList;

import org.htmlparser.util.ParserException;

import org.htmlparser.tags.TitleTag;    

public class TestHtml {

public static void main(String... args) {
    Parser parser = new Parser();     
    try {
        parser.setResource("http://www.yahoo.com/");
        NodeList list = parser.parse(null);
        Node node = list.elementAt(0);

        if (node instanceof TitleTag) {
           TitleTag title = (TitleTag) node;


            System.out.println(title.getText());

        }

    } catch (ParserException e) {
        e.printStackTrace();
    }
}

}

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-15T17:15:05+00:00

Editorial Team

2026-05-15T17:15:05+00:00Added an answer on May 15, 2026 at 5:15 pm

According to your (redefined) question, the problem is that you only check the first node Node node = list.elementAt(0); while you should iterate over the list to find the title (which is not the first). You could also use a NodeFilter for your parse() to only return the TitleTag and then the title would be in the first and you wouldn’t have to iterate.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

How can I get the title of a web page for a given URL

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply