Possible Duplicate: How to parse and process HTML with PHP? I’m trying to scrape

Question

0

Asked: June 15, 20262026-06-15T00:22:15+00:00 2026-06-15T00:22:15+00:00

Possible Duplicate: How to parse and process HTML with PHP? I’m trying to scrape

0

Possible Duplicate:
How to parse and process HTML with PHP?

I’m trying to scrape a page with PHP using file_get_contents().
This page has some JSON wrapped in a bit of HTML. I’d like to strip out this HTML to be able to use json_decode() on the scraped string so I can deal with the JSON separately.
Is there any clean way to do that? A quick search didn’t really lead to anything.
Thanks

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-15T00:22:16+00:00

parsing/stripping HTML content is always a tricky one because (common?) solutions via regex might crash if the HTML markup is malformed and are painful slow btw. I would suggest using this little HTML DOM parser class:

http://simplehtmldom.sourceforge.net/

edited & added from subcomment:

Okay this is a bad one because the inline javascript is not properly wrapped with CDATA-Tags. Otherwise something like this might work:

$html = new simple_html_dom();
$html->load_file('your-external-file');

foreach($html->find("script") as $obj) {
    if(isset($obj->innertext) && strpos($obj->innertext, 'window._jscalls'))
        echo $obj->innertext;
}

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Possible Duplicate: How to parse and process HTML with PHP? I’m trying to scrape

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply