Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6165087
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 23, 20262026-05-23T22:06:18+00:00 2026-05-23T22:06:18+00:00

I created a crawler that will operate as a cron job. The object of

  • 0

I created a crawler that will operate as a cron job. The object of the crawler is to go through posts on my site and pull keywords from them.

Currently, I am optimizing the script for both speed and server load – but I am curious one what types of benchmarks for each are considered “good”?

For example, here are some configurations I have tested, running through 5,000 posts each time (you’ll notice the trade off between speed and memory):

Test 1 – script optimized for memory conservation:

Run time: 52 seconds
Avg. memory load: ~6mb
Peak memory load: ~7mb

Test 2 – script optimized for speed

Run time: 30 seconds
Avg. memory load: ~40mb
Peak memory load: ~48mb

Clearly the decision here is speed vs. server load. I am curious what your reactions are to these numbers. Is 40mb an expensive number, if it increases speed so drastically (and also minimizes MySQL connections?)

Or is it better to run the script slower with more MySQL connections, and keep the overhead memory low?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-23T22:06:18+00:00Added an answer on May 23, 2026 at 10:06 pm

    This is a really subjective question given that what is “tolerable” depends on many factors such as how many concurrent processes will be running, the specs of the hardware it’ll be running on, and how long you expect it to take.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I would like to run a crawler that can handle javascript created html in
Created .NET WCF service, tested it - works. Generated schemas from Data and service
I just downloaded Scrapy (web crawler) on Windows 32 and have just created a
I want a list of urls from where my crawler can start crawling efficiently
I want to add a trigger that will keep on updating autoincrement column by
I'm building a small application that will crawl sites where the content is growing
I'm writing a web crawler for a specific site. The application is a VB.Net
I need to make a web crawler to extract information from web pages. I
I have a site which has a homepage that contains a great deal of
I am working on an ASP.Net web application that must print dynamically created labels

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.