I’m doing a web crawler and I want to use lucene to index while the streaming is progressing or completed.
I’ve seen that the example of lucene.net html library is good. however, I don’t want to keep download into disk. what i want and is just indexing while downloading the web or maybe index of a string of html content.
Is there any example that makes lucence.net html indexer working with memory stream or a string?
something like that?