I am currently learning data mining and I have the following questions.
- what is the relationship between machine learning and data mining?
- I found many data mining techniques are associated with statistics, while I “hear” data mining has many thing to do with machine learning. So my question is: is machine learning closely related with statistics?
- If they are not closely related, is there such divisions that separate data mining focusing on statistical techniques and data mining focusing on machine learning skills? Because I found department of statistics of some graduate schools open data mining courses.
Data mining is the process of extracting useful information from data, such as patterns, trends, customer/user behavior, liking/disliking etc. This involves the use of algorithms that are related to Artificial Intelligence and statistics.
Wikipedia‘s definition of Data Mining is:
Machine Learning involves making the computers “learn” that behavior, trend etc, and to act according. For example, in credit card fraud, the computer “learns” the behavior of a customer, and if something strange occurs (a transaction involving very high amounts etc), it flags that transaction for potential fraud.
Wikipedia’s definition of machine learning is:
Machine learning uses Data Mining to learn the pattern, behavior, trend etc, because Data Mining is the way of extracting this information from a set of data. Data Mining and Machine Learning both use Statistics make decisions. So yes statistics is involved and is very important in Data Mining and Machine learning.