I’m looking for feedback on which analyzer to use with an index that has

Question

0

Asked: May 11, 20262026-05-11T20:32:20+00:00 2026-05-11T20:32:20+00:00

I’m looking for feedback on which analyzer to use with an index that has

0

I’m looking for feedback on which analyzer to use with an index that has documents from multiple languages. Currently I am using the simpleanalyzer, as it seems to handle the broadest amount of languages. Most of the documents to be indexed will be english, but there will be the occasional double-byte language indexed as well.

Are there any other suggestions or should I just stick with the simpleanalyzer.

Thanks

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-11T20:32:20+00:00

Editorial Team

2026-05-11T20:32:20+00:00Added an answer on May 11, 2026 at 8:32 pm

SimpleAnalyzer really is simple, all it does is lower-case the terms. I’d have thought that the StandardAnalyzer would give better results than SimpleAnalyzer even with non-english language data. You could perhaps improve it slightly by supplying a custom list of stop words in addition to the default english-language ones.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m looking for feedback on which analyzer to use with an index that has

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply