I am reading about SOLR and indexing a MySQL database into SOLR. What do

Question

0

Editorial Team

Asked: May 13, 20262026-05-13T12:13:17+00:00 2026-05-13T12:13:17+00:00

I am reading about SOLR and indexing a MySQL database into SOLR. What do

0

I am reading about SOLR and indexing a MySQL database into SOLR.

What do they mean by “tokenize” and “un-tokenize”?

And what does it mean when fields are “normalized”?

I know how and what it means to normalize a database, but a field?
How can a simple field be normalized?

Thanks

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-13T12:13:17+00:00

What do they mean by “tokenize” and
“un-tokenize”?

Tokenizing a field enables full text search, i.e. finding any word that occurs anywhere in the field. An Untokenized field will be found only when you have a complete and exact match, e.g. if the field’s content is “blue moon” then it will only be found when you search for “blue moon”, not when you search only for “blue”.

And what does it mean when fields are
“normalized”?

This most likely refers to Unicode normalization – Unicode has separate code points for diacritics, e.g. U+0060 is ` (grave accent), so the accented letter è could either be one Unicode character (U+00E8) or composed of two (U+0060 and U+0065). But of course you want both to be found when you search for è.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am reading about SOLR and indexing a MySQL database into SOLR. What do

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply