I am looking for a tool or system to take a look at the database and identify values that are out of the ordinary. I don’t need anything to do real time checks, just a system which does processing overnight or at scheduled points. I am looking for a system at two levels:
-
Database wide: Eg: Compare salaries of all employees and identify ones that are too low or too high from the average.
-
Per employee: Eg: Check salary history for employee and identify payments that are out of the ordinary for the employee.
The two above are only examples, take for instance the case with ATM withdrawals, Shopping order history, Invoice history, etc.
You could use Analysis Services and a data mining model.
Obviously you’d have to adapt the code, but here’s a sample from Microsoft:
http://www.sqlserverdatamining.com/ssdm/Default.aspx?tabid=101&Id=83
“This sample shows how the clustering algorithm can be used to perform automatic data validation through the use of the PredictCaseLikelihood() function. To exercise the sample, enter values into the form and click the submit button. If the combination of values has a reasonable likelihood, the form will accept the values. If not, additional elements of the prediction query indicate the value likely to be unacceptable. Checking the “Show Details” box on the form will show the query that was sent in addition to the probability ratios used to determine the outlying values.”