Stanford CoreNLP can be downloaded via the link below. This will download a large (536 MB) zip file containing (1) the CoreNLP code jar, (2) the CoreNLP models jar (required in your classpath for most tasks) (3) the libraries required to run CoreNLP, and (4) documentation / source code for the project. This is everything for getting going on English! Unzip this file, open the folder that results and you’re ready to use it.

Download CoreNLP 3.8.0

Other languages: For working with another (human) language, you need additional model files. We have model files for several other languages. And we have more model files for English, including for dealing with uncased English (that is, English which is not conventionally capitalized, whether texting or telegrams). You can find the latest models in the table below. Versions for earlier releases are available on the release history page.

Language model jar version
Arabic download 3.8.0
Chinese download 3.8.0
English download 3.8.0
English (KBP) download 3.8.0
French download 3.8.0
German download 3.8.0
Spanish download 3.8.0

If you want to change the source code and recompile the files, see these instructions. Previous releases can be found on the release history page.

Java: Stanford CoreNLP now requires Java 8. If you do not have this installed you should first of all install Java 8. Probably the JDK, but the JRE will do if you are only going to be a user.

GitHub: Here is the Stanford CoreNLP GitHub site.

Maven: You can find Stanford CoreNLP on Maven Central. The crucial thing to know is that CoreNLP needs its models to run (most parts beyond the tokenizer) and so you need to specify both the code jar and the models jar in your pom.xml, as follows: (Note: Maven releases are made several days after the release on the website.)

<dependencies>
<dependency>
    <groupId>edu.stanford.nlp</groupId>
    <artifactId>stanford-corenlp</artifactId>
    <version>3.8.0</version>
</dependency>
<dependency>
    <groupId>edu.stanford.nlp</groupId>
    <artifactId>stanford-corenlp</artifactId>
    <version>3.8.0</version>
    <classifier>models</classifier>
</dependency>
</dependencies>

If you want to get a language models jar off of Maven for Arabic, Chinese, German, or Spanish, add this to your pom.xml:

<dependency>
    <groupId>edu.stanford.nlp</groupId>
    <artifactId>stanford-corenlp</artifactId>
    <version>3.8.0</version>
    <classifier>models-chinese</classifier>
</dependency>

Replace “models-chinese” with one or more of “models-english”, “models-english-kbp”, “models-arabic”, “models-french”, “models-german” or “models-spanish” for resources for other languages!