- On a side I have a Java CMS providing a Set of keywords
- On the other side I have a JavaScript Bookmarklet to curate a Webpage
Is there a clever way to cross/match my set of keywords with the webpage content ?
John Resign explain in some articles how to compress then search in a Dictionary terms but it seems really complicated.
In fact, I’m looking for a clever Java or JavaScript algorithm to match a Set of String in a text efficiently.
My question is very close to this one:
Effective search on a small text
But,
- I want to do it on 1 text instead of a database of texts
- Indexing with lucene works great but I don’t know how to match efficiently indexed terms with all my keywords.
We have done some proof of concept based on Lucene indexed Set.
But a good answer needs a large relevant content. So results are sometimes stranges. And it’s server side…