I’ve got a table in a database that stores log data by time. For one day there can be a million rows in the db. The times are not at any regular interval. It has several indexes, including the time. What I want to do is build a query that will return a set of rows with one row per time interval. For example, I could do a query to return 1 row every 15 minutes for a day. This would return 24*60=96 rows. Each row returned would actually be the nearest row in the db prior to the interval requested (since the data in the database will not equal the requested interval).
I am at a loss for how to do it. I can’t just query all rows for a particular set of indexes and time interval, as it would load more than a gigabyte of data into memory, which is too slow. Is there any efficient way to do this using SQL. I’m using a MySQL database. I would be open to changing the table indexes/etc…
TIME
11:58
12:03
12:07
12:09
12:22
12:27
12:33
12:38
12:43
12:49
12:55
If I wanted to query this for a 15 minute interval from 12:00 to 1:00, I’d get back:
11:58 (nearest 12:00)
12:09 (nearest 12:15)
12:27 (nearest 12:30)
12:43 (nearest 12:45)
12:55 (nearest 1:00)
If it makes it any easier, I can also store the time as a number (i.e. ms since 1970). In the above query, this would then be an interval of 900000 ms.
So, I had thought something like:
..would do it for you, but this only returns the MIN(timeValue) over the whole table. It works if the seconds rounded to the nearest 5 min is in its own col.
See SQL Fiddle
Edit per Andiry, this works: ( http://sqlfiddle.com/#!2/bb870/6 )
But this just gives one row: ( http://sqlfiddle.com/#!2/bb870/7 )
Anyone know why?