Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 65077
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 10, 20262026-05-10T18:53:24+00:00 2026-05-10T18:53:24+00:00

I need to store large amounts of metering data in a database. A record

  • 0

I need to store large amounts of metering data in a database. A record consists of an id that identifies the data’s source, a timestamp and a value. The records are later retrieved via the id and their timestamp.

According to my previous experience (I am developing the successor of an application that’s been in productive use over the last five years), disk i/o is the relevant performance bottleneck for data retrieval. (See also this other question of mine).

As I am never looking for single rows but always for (possibly large) groups of rows that match a range of ids and timestamps, a pretty obvious optimization seems to be to store larger, compressed chunks of data that are accessed by a much smaller index (e. g. by a day number) and is decompressed and filtered on the fly by the application.

What I’m looking for is the best strategy for deciding what portion of the data to put in one chunk. In a perfect world, each user request would be fulfilled by retrieving one chunk of data and using most or all of it. So I want to minimize the amount of chunks I have to load for each request and I want to minimize excess data per chunk.

I’ll post an answer below containing my ideas so far, and make it community property so you can expand on it. Of course, if you have a different approach, post your own.

ETA: S. Lott has posted this answer below, which is helpful to the discussion even if I can’t use it directly (see my comments). The point here is that the ‘dimensions’ to my ‘facts’ are (and should be) influenced by the end user and change over time. This is a core feature of the app and actually the reason I wound up with this question in the first place.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. 2026-05-10T18:53:25+00:00Added an answer on May 10, 2026 at 6:53 pm

    ‘groups of rows that match a range of ids and timestamps’

    You have two dimensions: the source and time. I’m sure the data source has lots of attributes. Time, I know, has a lot of attributes (year, month, day, hour, day of week, week of year, quarter, fiscal period, etc., etc.)

    While your facts have ‘just’ an ID and a timestamp, they could have have FK’s to the data source dimension and the time dimension.

    Viewed as a star-schema, a query that locates ‘groups of rows that match a range of ids’ may — more properly — be a group of rows with a common data source attribute. It isn’t so much a random cluster of ID’s, it’s a cluster of ID’s defined by some common attribute of your dimensions.

    Once you define these attributes of the data source dimension, your ‘chunking’ strategy should be considerably more obvious.

    Further, you may find that the bit-mapped index capability of some database products makes it possible to simply store your facts in a plain-old table without sweating the chunk design at all.

    If bit-mapped indexes still aren’t fast enough, then perhaps, you have to denormalize the data source attributes into both dimension and fact, and then partition the fact table on this dimensional attribute.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Ask A Question

Stats

  • Questions 78k
  • Answers 78k
  • Best Answers 0
  • User 1
  • Popular
  • Answers
  • Editorial Team

    How to approach applying for a job at a company ...

    • 7 Answers
  • Editorial Team

    How to handle personal stress caused by utterly incompetent and ...

    • 5 Answers
  • Editorial Team

    What is a programmer’s life like?

    • 5 Answers
  • added an answer Given that you don't want to change the existing code… May 11, 2026 at 3:40 pm
  • added an answer It's up to the implementation of your NSOperation object to… May 11, 2026 at 3:40 pm
  • added an answer To elaborate on jdigital's answer: Since it's a hash algorithm… May 11, 2026 at 3:40 pm

Related Questions

I'm about to write a small utility to organze and tag my mp3s. What
I have a large amount of data I need to store, and be able
I'm building an app which is going to have to handle and store a
I want to store a very large amount of vector data on a server

Trending Tags

analytics british company computer developers django employee employer english facebook french google interview javascript language life php programmer programs salary

Top Members

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.