I am using a sphinx search server.
Currently if I want to find words as “čebula”, I have to write “čebula” in the search.
I want to configure it to be able to find results with keywords “cebula” or “ćebula” etc.
I also want the same to work with for example “fällen”.
It should work both ways, so searching f.e. “cesnja” should also find a result “češnja”.
This is my current settings in sphinx.conf:
source src
{
type = mysql
sql_host = ****
sql_user = ****
sql_pass = ****
sql_db = ****
sql_port = 3306
sql_query = \
SELECT id, Title, Content \
FROM Post
sql_ranged_throttle = 0
sql_query_pre = SET CHARACTER_SET_RESULTS=utf8
sql_query_pre = SET NAMES utf8
}
index idx1
{
source = src
path = /usr/local/sphinx/var/data/idx1
docinfo = extern
mlock = 0
morphology = metaphone
expand_keywords = 1
enable_star = 1
min_prefix_len = 3
min_word_len = 1
charset_type = utf-8
charset_table = 0..9, a..z, _, A..Z->a..z,-, U+002C, \
U+010C->U+010D, U+0106->U+0107, U+0160->U+0161, U+017D->U+017E, \
U+010D->c,U+0107->c, U+0161->s, U+017E->z, \
U+010D, U+0107, U+0161, U+017E
}
I’ve tried using different morphologys but no luck
index idx2: idx1
{
morphology = metaphone, stem_en, soundex
}
The solution to your problem is the
charset_tableconfiguration variable. The documentation for that can be found here:http://sphinxsearch.com/docs/current.html#conf-charset-table
Here is the
charset_tableI use: