I have run into a situation where there is ‘bad’ data in a number

Question

0

Asked: June 1, 20262026-06-01T21:59:13+00:00 2026-06-01T21:59:13+00:00

I have run into a situation where there is ‘bad’ data in a number

0

I have run into a situation where there is ‘bad’ data in a number of tables. Data has been cross contaminated from various sources and I need to clean it out.

Specifically there are several hundred tables with identical definitions. They hold timed sensor data with an auto-increment column, Time/Date stamp and other data. The ‘bad’ data can be identified by time/date jumping backwards rather than growing as expected.

Example:

10 2010/01/05 
11 2010/01/06
12 2010/01/07
13 2008/05/09
14 2008/05/10
15 2008/05/11
16 2010/01/08
17 2010/01/09

Im looking for the best way to find these areas.

Some things to note:
– the tables in question have 100s of millions of records
– in my example the dates are sequential – in reality there may be 10 or 1000 entries for a given date (with timestamps on each) and then nothing for a week.

I can imagine a perl script walking through each and looking for these jumps. Im wondering if there is a faster, more sql-esque method.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-01T21:59:14+00:00

Editorial Team

2026-06-01T21:59:14+00:00Added an answer on June 1, 2026 at 9:59 pm

select t.* from t, (select @maxDate := '') init
where not if(date > @maxDate, @maxDate := date, 0)
order by id

This is the fastest way I can think of.

NOTE: I’m assuming you’re expecting to get records with IDs 13, 14, 15 in your example.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have run into a situation where there is ‘bad’ data in a number

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply