I have an ordered list (a dictionary – 100K words) and many words to

Question

0

Asked: May 16, 20262026-05-16T00:53:41+00:00 2026-05-16T00:53:41+00:00

I have an ordered list (a dictionary – 100K words) and many words to

0

I have an ordered list (a dictionary – 100K words) and many words to search on this list frequently. So performance is an issue. I know that a HashSet.contains(theWord) or Collections.binarySearch(sortedList, theWord) are very fast. But I am actually not looking for the whole word.

What I want is let’s say searching for "se" and getting all the words starts with "se". So is there a ready to use solution in Java or any libraries?

A better example: On a sorted list a quick solution for the following operation

List.subList (String beginIndex, String endIndex) // returns the interval 

myWordList.subList(“ab”, “bc”);

Note: Here is a very similar question but accepted answer is not satisfying.
Overriding HashSet's Contains Method

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-16T00:53:42+00:00

Editorial Team

2026-05-16T00:53:42+00:00Added an answer on May 16, 2026 at 12:53 am

What you’re looking for here is a data structure commanly called a ‘trie’:

http://en.wikipedia.org/wiki/Trie

It stores strings in a tree indexed by prefix, where the first level of the tree contains the first character of the string, the second level the second character, etc. The result is that it allows you to extract subsets of very large sets of strings by prefix extremely quickly.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have an ordered list (a dictionary – 100K words) and many words to

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply