I’m somewhat familiar with stemming, but the stemming library I’ve been given to use

Question

0

Asked: June 2, 20262026-06-02T04:54:05+00:00 2026-06-02T04:54:05+00:00

I’m somewhat familiar with stemming, but the stemming library I’ve been given to use

0

I’m somewhat familiar with stemming, but the stemming library I’ve been given to use for a project doesn’t work very well for a case where I want to find related words like if I do a query for any of these:

“dental”, “dentist”, “dentistry”

I should get a match for the others. I’ve been looking into this and I’m learning about parts of speech I didn’t even know existed, like pertainyms and troponyms so I’m wondering if there isn’t a library out there that has a mapping between all of these different parts of speech that could give back the sort of match I’m looking for?

I’ve been searching on this and haven’t found a whole lot that I can make sense of. I probably don’t know the right terminology, etc and I would greatly appreciate if anyone can point me in the right direction.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-02T04:54:07+00:00

Editorial Team

2026-06-02T04:54:07+00:00Added an answer on June 2, 2026 at 4:54 am

One approach common in IR is to stem all words in the index and the query itself. Meaning, documents containing the word ‘dentistry’ will be stemmed and stored in the index as ‘dentist’. The keyword ‘dental’ is also stemmed as ‘dentist’ thereby matching it in the index.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m somewhat familiar with stemming, but the stemming library I’ve been given to use

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply