My project uses the NLTK. How can I list the project’s corpus & model requirements so they can be automatically installed? I don’t want to click through the nltk.download() GUI, installing packages one by one.
Also, any way to freeze that same list of requirements (like pip freeze)?
The NLTK site does list a command line interface for downloading packages and collections at the bottom of this page :
http://www.nltk.org/data
The command line usage varies by which version of Python you are using, but on my Python2.6 install I noticed I was missing the ‘spanish_grammar’ model and this worked fine:
You mention listing the project’s corpus and model requirements and while I’m not sure of a way to automagically do that, I figured I would at least share this.