I’m currenlty trying to gather some datas from politifact using simple html dom, but

Question

0

Asked: June 10, 20262026-06-10T04:04:27+00:00 2026-06-10T04:04:27+00:00

I’m currenlty trying to gather some datas from politifact using simple html dom, but

0

I’m currenlty trying to gather some datas from politifact using simple html dom, but a lot of the time I have weirds errors instead of the html expected.
The goal is not to bruteforce the site but to request it once or twice a day and cache the result.
Here most of the returns I get :

‹������í]{wÛ6²ÿ»=g¿ªn#»1EËJœÄ–µ×vœ&ÙÄñÚn²{r{|(  ’S$Ã‡euÛï~3à¤¨‡c'ÛísNÄ`f0˜Úß=}sxþ¯“#1ŠÆŽ8ùùàÕ‹CQ3Ló]ëÐ4Ÿž?ÿ|~þú•h66Åy`¹¡Ùžk9¦yt\µQù;¦9™L“...

And here’s the super simple code :

$html = file_get_html('http://www.politifact.com/personalities/barack-obama');
print_r($html->plaintext);

Do you have any ideas why ?
Some sort of protection/redirection on the website side ?

Thank you very much !

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-10T04:04:28+00:00

You received the expected page, but in gzip format. It looks like the server doesn’t mind if the accept-encoding header is not included in the request and instead of sending a default plain text response, sends a gzipped data anyway.

I don’t think simple-html-dom can unzip the data, but you can use cURL for that purpose:

$ch = curl_init('http://www.politifact.com/personalities/barack-obama/');
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_ENCODING, 'gzip');

$data = curl_exec($ch);

$html = str_get_html($data);

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m currenlty trying to gather some datas from politifact using simple html dom, but

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply