Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 7584263
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 30, 20262026-05-30T18:52:40+00:00 2026-05-30T18:52:40+00:00

I am searching stemming algorithm for Slovenian language that I can use with Sphinx

  • 0

I am searching stemming algorithm for Slovenian language that I can use with Sphinx search.

What I’m trying to achieve is for example when searching for ‘jabolka’, I also want results for documents containing ‘jabolko’, ‘jabolki’, ‘jabolk’, etc.

I found some references about existence of Slovenian stemmer, but I can’t find where to download it, it’s not even for sale anywhere…

Another option I’ve came across is using option wordforms in Sphinx source config (http://sphinxsearch.com/docs/manual-0.9.9.html#conf-wordforms), but building my own dictionary would be too difficult, so I’m wondering are there any publicly accessible dictionaries available already?


In case Slovenian stemmer is not available, can somebody suggest some other approach of achieving similar search results?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-30T18:52:41+00:00Added an answer on May 30, 2026 at 6:52 pm

    I managed to compile slovenian stemmer in following steps:

    1. Download http://snowball.tartarus.org/dist/snowball_code.tgz (source code for snowball) and unpack it
    2. Download slovenian algorithm from http://snowball.tartarus.org/archives/snowball-discuss/0725.html and save it to unpacked project from step 1 in folder /algorithms/slovene. Name of the file has to be stem_ISO_8859_2.sbl
    3. Algorithm is in ISO encoding, so I converted it to UTF8 and saved it as stem_Unicode.sbl (you have to find utf char codes for slovenian special chars like ČŠŽĆ)
    4. Edit both of .txt files in /libstemmer folder and add entries for slovenian:

      slovene         UTF_8,ISO_8859_2        slovene,sl,slv
      
    5. Edit /GNUmakefile and add slovene (once to list of languages for utf and once for ISO_8859_2_algorithms)
    6. go to folder /libstemmer and run:

      ./mkmodules.pl modules.h src_c modules.txt ../mkinc.mak
      ./mkmodules.pl modules_utf8.h src_c  modules_utf8.txt ../mkinc_utf8.mak
      

      This will generate files needed for compiling later.

    7. run make (from root of unpacked files)
    8. If there were no errors during compile you should have /src_c folder and code for slovenian stemmer in them (next to others)

      stem_UTF_8_slovene.c
      stem_ISO_8859_2_slovene.c
      ...
      
    9. Unpack latest sphinx and copy all files from your snowball project to sphinx /libstemmer_c folder (excluding libstemmer.o and GNUmakefile)

    10. compile sphinx:

      touch NEWS README AUTHORS ChangeLog
      autoreconf --force --install
      ./configure --with-libstemmer
      make
      make install
      
    11. if all went fine you should have slovene stemmer for sphinx working, you just have to enable it in you sphinx index configuratiun (on my Debian it is in /usr/local/etc/sphinx.conf):

      charset_type = utf-8
      morphology = libstemmer_slovene
      

    Hope this helps someone, I had no prior experience with autoconf so it took me a while to figure this out.

    This slovene stemmer is not officially released on http://snowball.tartarus.org, but from my tests it works good enough for my project.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

Searching the 'net, I found that I should use :include , but that does
I´m searching some providers or frameworks or something that can deliver a map like
I searching to get all contents that are tagged with some Tags. $OR can't
Searching online, I found a method I needed that can merge arrays. It was
When searching in a tree, my understanding of uniform cost search is that for
Searching here I found that this question was already asked , but I think
Searching from google.com, like www.abc.com Search Result Rank the pages like Title..... Description... www.abc.com
Searching since yesterday, but I only get adance method. I use spring with tomcat
Searching here and on the internet have not let me to a example of
Searching for a script, which can do show/hide functions without framework. Something like: <span

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.