EDIT the answer below is outdated. AsyncController has been part…

Question

0

Asked: May 12, 20262026-05-12T12:28:25+00:00 2026-05-12T12:28:25+00:00

We have this PHP application which selects a row from the database, works on

0

We have this PHP application which selects a row from the database, works on it (calls an external API which uses a webservice), and then inserts a new register based on the work done. There’s an AJAX display which informs the user of how many registers have been processed.

The data is mostly text, so it’s rather heavy data.

The process is made by thousands of registers a time. The user can choose how many registers to start working on. The data is obtained from one table, where they are marked as “done”. No “WHERE” condition, except the optional “WHERE date BETWEEN date1 AND date2”.

We had an argument over which approach is better:

Select one register, work on it, and insert the new data
Select all of the registers, work with them in memory and insert them in the database after all the work was done.

Which approach do you consider the most efficient one for a web environment with PHP and PostgreSQL? Why?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-12T12:28:25+00:00

It really depends how much you care about your data (seriously):

Does reliability matter in this case? If the process dies, can you just re-process everything? Or can’t you?

Typically when calling a remote web service, you don’t want to be calling it twice for the same data item. Perhaps there are side effects (like credit card charges), or maybe it is not a free API…

Anyway, if you don’t care about potential duplicate processing, then take the batch approach. It’s easy, it’s simple, and fast.

But if you do care about duplicate processing, then do this:

SELECT 1 record from the table FOR UPDATE (ie. lock it in a transaction)
UPDATE that record with a status of “Processing”
Commit that transaction

And then

Process the record
Update the record contents, AND
SET the status to “Complete”, or “Error” in case of errors.

You can run this code concurrently without fear of it running over itself. You will be able to have confidence that the same record will not be processed twice.

You will also be able to see any records that “didn’t make it”, because their status will be “Processing”, and any errors.

How to approach applying for a job at a company ...

What is a programmer’s life like?

How to handle personal stress caused by utterly incompetent and ...

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions