Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 9225459
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 18, 20262026-06-18T04:34:53+00:00 2026-06-18T04:34:53+00:00

I build a tool for data extraction and transformation. Typical use case – transactionally

  • 0

I build a tool for data extraction and transformation. Typical use case – transactionally processing lots of data.

Numbers are – about 10sec – 5min duration, 200-10000 row updated (long duration caused not by the database itself but by outside services that used during transaction).

There are two types of agents that access database – multiple read agents, and only one write agent (so, there are never multiple concurrent write).

During the transaction:

  • Read agents should be able to read database and see it in the current state.
  • Write agent should be able to read database (it does both – read and write during transaction) and see it in the new (not yet committed) state.

Is PostgreSQL a good choice for that type of load? I know it uses MVCC – so it should be ok in general, but is it ok to use long and big transactions extensively?

What other open-source transactional databases may be a good choice (I am not limited to SQL)?

P.S.

I do not know if the sharding may affect the performance. The database will be sharded. For every shard there will be multiple readers and only one writer, but multiple different shards can be written to at the same time.

I know that it’s better not to use outside services during transaction, but in that case – it’s the goal. The database used as a reliable and consistent index for some heavy, huge, slow and eventually-consistent data processing tool.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-18T04:34:54+00:00Added an answer on June 18, 2026 at 4:34 am

    Huge disclaimer: as always, only real life test can tell you the truth.

    But, I think PostgreSQL will not let you down, if you use most recent version (at least 9.1, better 9.2) and tune it properly.

    I have somewhat similar load in my server, but with slightly worse R/W ratio: about 10:1. Transactions range from few milliseconds up to 1 hour (and sometimes even more), and one transaction can insert or update up to 100k rows. Total number of concurrent writers with long transactions can reach 10 and more.
    So far so good – I don’t really have any serious issues, performance is great (certainly not worse than I expected).

    What really helps is that my hot working data set almost fits into available memory.

    So, give it a try, it should work great for your load.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I really need an easy to use build tool which supports multiple languages with
we develop a data processing tool to extract some scientific results out of a
I'm building a general purpose data translation tool for internal enterprise use, using Java
I want to build a PDF text extraction tool having similar features to this
I am measuring hiccups using jHiccup tool build by Azul Systems. It collects data
Which build tool is the best for Scala? What are the pros and cons
I'm lately feeling the need to learn a build tool. I'm looking through StackOverflow
I'm using Maven in the context of another build-tool (leiningen for Clojure, but this
I'm trying to build a tool for testing the delay of my internet connection,
In order to build a measurement tool, I need to capture packets, forge them

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.