Processors are units of the neural pipeline that create different annotations for a Document. The neural pipeline now supports the following processors:
  
    
      | Name | Annotator class name | Generated Annotation | Description | 
  
  
    
      | tokenize | TokenizeProcessor | Segments a DocumentintoSentences, each containing a list ofTokens. This processor also predicts which tokens are multi-word tokens, but leaves expanding them to the MWT expander. | Tokenizes the text and performs sentence segmentation. | 
    
      | mwt | MWTProcessor | Expands multi-word tokens into multiple words when they are predicted by the tokenizer. | Expands multi-word tokens (MWT) predicted by the tokenizer. | 
    
      | lemma | LemmaProcessor | Perform lemmatization on a Wordusing theWord.textandWord.uposvalue. The result can be accessed inWord.lemma. | Generates the word lemmas for all tokens in the corpus. | 
    
      | pos | POSProcessor | UPOS, XPOS, and UFeats annotations accessible through Word’s propertiespos,xpos, andufeats. | Labels tokens with their universal POS (UPOS) tags, treebank-specific POS (XPOS) tags, and universal morphological features (UFeats). | 
    
      | depparse | DepparseProcessor | Determines the syntactic head of each word in a sentence and the dependency relation between the two words that are accessible through Word’sgovernoranddependency_relationattributes. | Provides an accurate syntactic dependency parser. |