I’m a Scrapy & Xpath beginner and I’m looking to parse a website with

Question

0

Asked: June 4, 20262026-06-04T14:12:41+00:00 2026-06-04T14:12:41+00:00

I’m a Scrapy & Xpath beginner and I’m looking to parse a website with

0

I’m a Scrapy & Xpath beginner and I’m looking to parse a website with the following structure

<dl class="ismSummary ismHomeSummary">
        <dt>cat1</dt>
            <dd>value1</dd>
            <dd>value2</dd>
        <dt>cat2</dt>
            <dd>value1</dd>
            <dd>value2</dd>
</dl>

With Xpath I only want to get value1 & value2 ( the dd‘s ) of cat1

This is what I have right now

//dt[text()="cat1"]/following-sibling::dd

The problem is it doesn’t stop at cat2 and continue selecting value1 & value2 from cat2. 🙁

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-04T14:12:42+00:00

All nodes here are children of dl, so naturally all are siblings of the first dt, so when you use following-sibling you get them all.

Xpath was made with xml in mind, and in xml you probably would have the dd elements as children of dt, but unfortunately that’s not the case here.

The easiest way woule be to just include all siblings of dt (not just the dds) and iterate through the result set until a dt comes up. Using Xpath function to do do the same coule be possible, but is certainly more complicated.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m a Scrapy & Xpath beginner and I’m looking to parse a website with

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply