Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 7738575
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 1, 20262026-06-01T08:18:06+00:00 2026-06-01T08:18:06+00:00

Situation: We are working on a project that reads datafeeds into the database at

  • 0

Situation: We are working on a project that reads datafeeds into the database at our company. These datafeeds can contain a high number of fields. We match those fields with certain columns.

At this moment we have about 120 types of fields. Those all needs a column. We need to be able to filter and sort all columns.

The problem is that I’m unsure what database design would be best for this. I’m using MySQL for the job but I’m are open for suggestions. At this moment I’m planning to make a table with all 120 columns since that is the most natural way to do things.

Options: My other options are a meta table that stores key and values. Or using a document based database so I have access to a variable schema and scale it when needed.

Question:
What is the best way to store all this data? The row count could go up to 100k rows and I need a storage that can select, sort and filter really fast.

Update:
Some more information about usage. XML feeds will be generated live from this table. we are talking about 100 – 500 requests per hours but this will be growing. The fields will not change regularly but it could be once every 6 months. We will also be updating the datafeeds daily. So checking if items are updated and deleting old and adding new ones.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-01T08:18:07+00:00Added an answer on June 1, 2026 at 8:18 am

    I’m working on a similar project right now, downloading dumps from the net and loading them into the database, merging changes into the main table and properly adjusting the dictionary tables.

    First, you know the data you’ll be working with. So it is necessary to analyze it in advance and pick the best table/column layout. If you have all your 120 columns containing textual data, then a single row will take several K-bytes of disk space. In such situation you will want to make all queries highly selective, so that indexes are used to minimize IO. Full scans might take significant time with such a design. You’ve said nothing about how big your 500/h requests will be, will each request extract a single row, a small bunch of rows or a big portion (up to whole table)?

    Second, looking at the data, you might outline a number of columns that will have a limited set of values. I prefer to do the following transformation for such columns:

    • setup a dictionary table, making an integer PK for it;
    • replace the actual value in a master table’s column with PK from the dictionary.

    The transformation is done by triggers written in C, so although it gives me upload penalty, I do have some benefits:

    • decreased total size of the database and master table;
    • better options for the database and OS to cache frequently accessed data blocks;
    • better query performance.

    Third, try to split data according to the extracts you’ll be doing. Quite often it turns out that only 30-40% of the fields in the table are typically being used by the all queries, the rest 60-70% are evenly distributed among all of them and used partially. In this case I would recommend splitting main table accordingly: extract the fields that are always used into single “master” table, and create another one for the rest of the fields. In fact, you can have several “another ones”, logically grouping data in a separate tables.

    In my practice we’ve had a table that contained customer detailed information: name details, addresses details, status details, banking details, billing details, financial details and a set of custom comments. All queries on such a table were expensive ones, as it was used in the majority of our reports (reports typically perform Full scans). Splitting this table into a set of smaller ones and building a view with rules on top of them (to make external application happy) we’ve managed to gain a pleasant performance boost (sorry, don’t have numbers any longer).

    To summarize: you know the data you’ll be working with and you know the queries that will be used to access your database, analyze and design accordingly.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm currently working on a project that requires me to match our database of
I've been working on optimizing a query and have ran into a situation that's
Here's my situation: I'm working on a PHP project that uses a few functions
I'm working in a C/C++ mixed project that has the following situation. I need
Working on an iOS project that targets 4.0 and 5.0, using ARC. Running into
Situation I have been working on a project lately where the UI development seems
I recently encountered a situation in some code I am working on that doesn't
We have situation where say we have four engineers that are working on software
I'm working on a Cocoa application, and I've run into a situation where I
This is my first time that I am working on a big project for

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.