This is the problem: The script I use stops looking at the first tag.

Question

0

Editorial Team

Asked: May 28, 20262026-05-28T14:13:28+00:00 2026-05-28T14:13:28+00:00

This is the problem: The script I use stops looking at the first tag.

0

This is the problem: The script I use stops looking at the first tag.

I’m sceaping a website, and this is the part of the site I want to ‘extract’.

<div class="i-want-this-div">
    <div class="annoying-sub-div">
        Bla bla bla  
    </div>
    <div class="annoying-sub-div">
        etc...
    </div>
    <div class="annoying-sub-div">
    </div>
    <div class="annoying-sub-div">
    </div>
    <div class="annoying-sub-div">
    </div>
</div>

I want to display all those ‘annoying'(because they mess up the function of the script by being there) divs on my site, but how do I do this?

This is my current approach: get the position of the first tag, get the position of the closing tag and subtract that part form the entire string that holds the whole website source.

$startPos     = strpos($siteIAmScreaping, '<div class="i-want-this-div">');
$endPos       = strpos($siteIAmScreaping, '</div>', $startPos) + 8;
$annoyingDivs = substr($siteIAmScreaping, $startPos, $endPos-$startPos);

The problem is: I want it to stop on the main divs closing tag and not on the first closing tag it finds.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-28T14:13:28+00:00

Editorial Team

2026-05-28T14:13:28+00:00Added an answer on May 28, 2026 at 2:13 pm

Use querypath (or phpquery) for simplicity. You can then extract the <div> content by class or id most easily:

 print htmlqp($page)->find("div.i-want-this-div")->html();

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

This is the problem: The script I use stops looking at the first tag.

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply