I have a web crawler that saves information to a database as it crawls

Question

0

Asked: June 7, 20262026-06-07T23:36:23+00:00 2026-06-07T23:36:23+00:00

I have a web crawler that saves information to a database as it crawls

0

I have a web crawler that saves information to a database as it crawls the web. While it does this, it also saves a log file of its actions, and any errors it encounters to a log field in a mysql database (field becomes anywhere from 64kb to 100kb. It accomplishes this by concatenating (using the mysql CONCAT function).

This seems to work fine, but I am concerned about the cpu useage / impact it has on the mysql database. I’ve noticed that the web crawling is performing slower than before I implemented saving the log to the database.

I view this log file from a management webpage, and the current implementation seems to work fine other than the slow loading. Any recommendations for speeding this up, or implementation recommendations?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-07T23:36:26+00:00

Reading 100kb strings into memory numerous time then write them to disk via a db. Of course your going to experience slowdown! Every part of what you are doing is going to task memory, disk, and cpu (especially if memory usage hits the system max and you start swapping to disk). Let me count some of the ways your going to possibly decrease overall site performance:

Sql connections max out and back up as the time to store 100kb records increases time a single process holds a connection
Webserver processes eat up free process pool and max out and take longer to free up because they have to wait on db connections to free.
Web server processes begin to bloat and take more memory each, possibly more than the system can handle without swapping. This is compounded by using the max. Umber of processes due to #2
… A book could be written on your situation.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a web crawler that saves information to a database as it crawls

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply