I’m using Html Agility Pack to perform a basic web scraping of Google search

Question

0

Asked: June 10, 20262026-06-10T11:29:05+00:00 2026-06-10T11:29:05+00:00

I’m using Html Agility Pack to perform a basic web scraping of Google search

0

I’m using Html Agility Pack to perform a basic web scraping of Google search results. As a newbie to XPath, I make sure my path expression is correct(with the help of FirePath). However, the returned HtmlNodeCollection is always NULL.

HtmlWeb web = new HtmlWeb();
HtmlAgilityPack.HtmlDocument htmlDoc = web.Load("http://www.google.com/search?num=10&q=Hello+World");

// get search result URLs
var items = htmlDoc.DocumentNode.SelectNodes("//div[@id='ires']/ol[@id='rso']/li/div[@class='vsc']/h3/a/@href");

foreach (HtmlNode node in items)
{
    Console.WriteLine(node.Attributes);
}

Am I missing something? Can anyone please enlighten me?

Thanks in advance,

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-10T11:29:06+00:00

HAP can only process the raw HTML that is returned from the url, it will not run any additional javascript that is on the page or whatnot. You need to adjust your query accordingly.

In the raw HTML, the ires div exists but the rso doesn’t get inserted until the javascript is run hence you get no results. There are other transformations done here which you’ll have to adjust for as well.

Here’s a fragment of the HTML:

<div id="ires">
    <ol>
        <li class="g">
            <h3 class="r">
                <a href="...">...</a>

A more appropriate xpath to use for this would be:

var xpath = "//li[contains(concat(' ',@class,' '),' g ')]" +
            "/h3[contains(concat(' ',@class,' '),' r ')]" +
            "/a/@href";

It’d be easier to find all li with the g class as those correspond to all the results. You’ll want to filter all h3 with the r class otherwise you’d include other results (such as image results).

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m using Html Agility Pack to perform a basic web scraping of Google search

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply