In linear or logistic regression if we find a hypothesis function which fits the training set perfectly then it should be a good thing because in that case we have used 100 % of the information given to predict new information.
While it is called to be overfitting and said to be bad thing.
By making the hypothesis function simpler we may be actually increasing the noise instead of decreasing it.
Why is it so?
In linear or logistic regression if we find a hypothesis function which fits the
Share
Overfitting occurs when you try “too hard” to make the examples in the training set fit the classification rule.
It is considered bad thing for 2 reasons main reasons:
Example:
According to Occam’s razor, you should tolerate the misclassified sample, and assume it is noise or insignificant, and adopt the simple solution (green line) in this data set:
