I have read stemming harms precision but improves recall in text classification. How does

Question

0

Asked: June 6, 20262026-06-06T06:47:37+00:00 2026-06-06T06:47:37+00:00

I have read stemming harms precision but improves recall in text classification. How does

0

I have read stemming harms precision but improves recall in text classification. How does that happen? When you stem you increase the number of matches between the query and the sample documents right?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-06T06:47:38+00:00

It’s always the same, if you raise recall, your doing a generalisation. Because of that, you’re losing precision. Stemming merge words together.

On the one hand, words which ought to be merged together (such as “adhere” and “adhesion”) may remain distinct after stemming; on the other, words which are really distinct may be wrongly conflated (e.g., “experiment” and “experience”). These are known as understemming errors and overstemming errors respectively.

Overstemming lowers precision and understemming lowers recall. So, since no stemming at all means no over- but max understemming errors, you have a low recall there and a high precision.

Btw, precision means how many of your found ‘documents’ are those you were looking for. Recall means how many of all ‘documents’, which were correct, you received.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have read stemming harms precision but improves recall in text classification. How does

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply