Possible Duplicate: Grabbing the href attribute of an A element Im trying to match

Question

0

Editorial Team

Asked: May 25, 20262026-05-25T05:51:21+00:00 2026-05-25T05:51:21+00:00

Possible Duplicate: Grabbing the href attribute of an A element Im trying to match

0

Possible Duplicate:
Grabbing the href attribute of an A element

Im trying to match up in page source :

 <a href="/download/blahbal.html">

I have looked at one other link on this site and used the regex :

   '/<a href=["\']?(\/download\/[^"\'\s>]+)["\'\s>]?/i'

which returns all href links on the page but it misses off the .html on some links.

Any help would be greatly appreciated.

Thank you

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-25T05:51:21+00:00

First use the method described here to retrieve all hrefs, then you can use a regex or strpos to “filter out” those who don’t start with /download/.
The reason why you should use a parser instead of a regex is discussed in many other posts on stack overflow (see this). Once you parsed the document and got the hrefs you need, then you can filter them out with simple functions.

A little code:

$dom = new DOMDocument;
//html string contains your html
$dom->loadHTML($html);
//at the end of the procedure this will be populated with filtered hrefs
$hrefs = array();
foreach( $dom->getElementsByTagName('a') as $node ) {
    //look for href attribute
    if( $node->hasAttribute( 'href' ) ) {
        $href = $node->getAttribute( 'href' );
        // filter out hrefs which don't start with /download/
        if( strpos( $href, "/download/" ) === 0 )
            $hrefs[] = $href; // store href
    }
}

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Possible Duplicate: Grabbing the href attribute of an A element Im trying to match

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply