For a Google sitemap XML, I need all document id’s collected by Sphinx. But

Question

0

Asked: May 27, 20262026-05-27T19:30:41+00:00 2026-05-27T19:30:41+00:00

For a Google sitemap XML, I need all document id’s collected by Sphinx. But

0

For a Google sitemap XML, I need all document id’s collected by Sphinx. But with 1000+ documents, if I try to get them all in a simple loop, it ultimately gives me Error: searchd error: offset out of bounds (offset=1000, max_matches=1000).

I could increase the max_matches setting, but that would kill performance.

And I don’t want to simply run a MySQL query, because there’s a UNION and a bunch of checks/rules in the Sphinx indexer query. And I want my query on one place for maintainability.

So what I’ve done now is, for each category (I need those too for the sitemap), I run a Sphinx query filtered on category. That way I stay below the 1000 documents limit.

There must be a better solution for this. Right?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-27T19:30:41+00:00

Editorial Team

2026-05-27T19:30:41+00:00Added an answer on May 27, 2026 at 7:30 pm

I’ve posted PHP code for this here:
http://sphinxsearch.com/forum/view.html?id=7215

basically you just retreive the results 1000 documents at a time in a while loop. sitemaps dont care about the order of results in the file, so it doesn’t matter tha you need to get the results in document_id order.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

For a Google sitemap XML, I need all document id’s collected by Sphinx. But

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply