We have a data warehouse with denormalized tables ranging from 500K to 6+ million

Question

0

Asked: May 21, 20262026-05-21T11:23:43+00:00 2026-05-21T11:23:43+00:00

We have a data warehouse with denormalized tables ranging from 500K to 6+ million

0

We have a data warehouse with denormalized tables ranging from 500K to 6+ million rows. I am developing a reporting solution, so we are utilizing database paging for performance reasons. Our reports have search criteria and we have created the necessary indexes, however, performance is poor when dealing with the million(s) row tables. The client is set on always knowing the total records, so I have to fetch the data as well as the record count.

Are there any other things I can do to help with performance? I’m not the MySQL dba and he has not really offered anything up, so I’m not sure what he can do configuration wise.

Thanks!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-21T11:23:44+00:00

If you partition the large tables and store the parts on different servers, than your query will run faster.

see: http://dev.mysql.com/doc/refman/5.1/en/partitioning.html

Also note that using NDB tables you can use HASH keys that get looked up in O(1) time.

For the number of lines you can keep a running total in a separate table and update that. For example in a after insert and after delete trigger.
Although the trigger will slow down deletes/inserts this will be spread over time. Note that you don’t have to keep all totals in one row, you can store totals per condition. Something like:

table    field    condition    row_count
----------------------------------------
table1   field1   cond_x       10
table1   field1   cond_y       20

select sum(row_count) as count_cond_xy 
from totals where field = field1 and `table` = table1 
and condition like 'cond_%';
//just a silly example you can come up with more efficient code, but I hope
//you get the gist of it.

If you find yourself always counting along the same conditions, this can speed your redesigned select count(x) from bigtable where ... up from minutes to instantly.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

We have a data warehouse with denormalized tables ranging from 500K to 6+ million

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply