Stanford CoreNLP can be downloaded via the link below. This will download a large (536 MB) zip file containing (1) the CoreNLP code jar, (2) the CoreNLP models jar (required in your classpath for most tasks) (3) the libraries required to run CoreNLP, and (4) documentation / source code for the project. This is everything for getting going on English! Unzip this file, open the folder that results and you’re ready to use it.
Other languages: For working with another (human) language, you need additional model files. We have model files for several other languages. And we have more
model files for English, including for dealing with uncased English (that is, English which is not conventionally capitalized, whether texting or telegrams).
You can find the latest models in the table below. Versions for earlier releases are available on the release history page.
If you want to change the source code and recompile the files, see these instructions.
Previous releases can be found on the release history page.
Java: Stanford CoreNLP now requires Java 8. If you do not have
this installed you should first of all install Java 8. Probably
but the JRE will do if you are only going to be a user.
GitHub: Here is the Stanford CoreNLP GitHub site.
Maven: You can find Stanford CoreNLP on Maven Central. The crucial thing to know is that CoreNLP needs its models to run (most parts beyond the tokenizer) and so you need to specify both the code jar and the models jar in your
pom.xml, as follows:
(Note: Maven releases are made several days after the release on the website.)
If you want to get a language models jar off of Maven for Arabic, Chinese, German, or Spanish, add this to your
Replace “models-chinese” with one or more of “models-english”, “models-english-kbp”, “models-arabic”, “models-french”, “models-german” or “models-spanish” for resources for other languages!