I’ve again got a strange problem: I’m writing a crawler to index a specific

Question

0

Asked: June 5, 20262026-06-05T13:29:31+00:00 2026-06-05T13:29:31+00:00

I’ve again got a strange problem: I’m writing a crawler to index a specific

0

I’ve again got a strange problem:

I’m writing a crawler to index a specific site. For some weeks it worked fine and I only ran into problems when sending too many requests per hour.

But now I can’t even access a single page.

But what’s even stranger: I have to submit some form values via POST, but the server returns a 404 error – although the URL is definitely correct.

I implemented many techniques to prevent beeing recognized as a bot: changing user-agent, delays, and I’m sending a Referer-header to pretend the form was submitted from their own website.

May this again be a Spam- or DDOS-protection on their server? Or are there other possible sources of error?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-05T13:29:34+00:00

Editorial Team

2026-06-05T13:29:34+00:00Added an answer on June 5, 2026 at 1:29 pm

Okay, just solved it.

A very strange behaviour of the remote server caused the problem: when sending more parameters than expected, it returned 404 instead of ignoring not needed parameters.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’ve again got a strange problem: I’m writing a crawler to index a specific

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply