Release History
Note that prior to version 1.0.0, the Stanza library was named as “StanfordNLP”. To install historical versions prior to to v1.0.0, you’ll need to run pip install stanfordnlp
.
Version | Date | Notes |
---|---|---|
1.7.0 | 2023‑12‑02 | Neural Coref model! (full release log) |
1.6.1 | 2023‑10‑05 | Patch for broken Arabic POS tagger. (full release log) |
1.6.0 | 2023‑10‑02 | Multiple default models and a combined EN NER model. (full release log) |
1.5.1 | 2023‑09‑08 | CharLM and transformer integration in depparse. (full release log) |
1.5.0 | 2023‑03‑13 | Ssurgeon interface, conparser experiments. (full release log) |
1.4.2 | 2022‑09‑14 | Minor version bump to improve (python) dependencies. (full release log) |
1.4.1 | 2022‑09‑14 | Improvements to pos, conparse, and sentiment, jupyter visualization, and wider language coverage. (full release log) |
1.4.0 | 2022‑04‑22 | Transformers added to NER and Constituency Parser. (full release log) |
1.3.0 | 2021‑10‑05 | Multilingual Pipeline, Constituency Parser. (full release log) |
1.2.3 | 2021‑08‑09 | Additional NER models for AF, IT. A couple minor bugfixes. Additional NER finetuning support. (full release log) |
1.2.2 | 2021-07-15 | Fix regression in NER results for non-VI languages, fix a couple other minor bugs (full release log) |
1.2.1 | 2021-06-17 | Additional NER models for BG, HU, FI, VI. bulk_process mechanism which works much faster in certain settings. All models updated to UD 2.8 data. (full release log) |
1.2.0 | 2021-01-29 | Rewriting of the training scripts to python for greater ease of use and system compatibility. Multiple fixes to tokenization models for common punctuation errors and typos. All models updated to UD 2.7 data. (full release log) |
1.1.1 | 2020-08-13 | This release features support for extending the capability of the Stanza pipeline with customized processors, a new sentiment analysis tool for English/German/Chinese, improvements to the CoreNLPClient functionality (including compatibility with CoreNLP 4.1.0), new models for a few languages (including Thai, which is supported for the first time in Stanza), new biomedical and clinical English packages, alternative servers for downloading resource files, and various improvements and bugfixes (full release log). |
1.0.1 | 2020-04-27 | This is a maintenance release of Stanza. It features new support for jieba as Chinese tokenizer, faster lemmatizer implementation, improved compatibility with CoreNLP v4.0.0, and several bugfixes including correct character offsets in NER output and correct Vietnamese tokenization outputs (full release log). |
1.0.0 | 2020-03-17 | This release introduces new multi-lingual named entity recognition (NER) support for 8 languages, expanded UD pipeline coverage of 66 languages, improved download and pipeline interfaces, improved document object interfaces, Anaconda installation support, improved neural lemmatizer, spaCy tokenization integration, and various other enhancements and bugfixes (full release log). |
0.2.0 | 2019-05-16 | This release introduces substantially reduced model size, substantial lemmatizer speed up and more options for customizing server start up and requests (full release log). |
0.1.2 | 2019-02-26 | This release introduces support for pretokenized text, speed ups in the POS/Feats tagger and various bug fixes (full release log). |
0.1.0 | 2019-01-29 | Initial release of StanfordNLP (full release log). |