We use this repository to maintain our releases of pre-trained pipelines and models for the Spark NLP library. For more info please take a look at our releases.
Take a look at our official Spark NLP page: http://nlp.johnsnowlabs.com/ for user documentation and examples
Slack community channel
Join Slack
pretrained(name, lang)
function to use
Model |
Name |
en |
LemmatizerModel (Lemmatizer) |
lemma_antbnc |
Download |
PerceptronModel (POS) |
pos_anc |
Download |
NerCrfModel (NER with GloVe) |
ner_crf |
Download |
NerDLModel (NER with GloVe) |
ner_dl |
Download |
NerDLModel (NER with GloVe) |
ner_dl_contrib |
Download |
NerDLModel (NER with BERT) |
ner_dl_bert_base_cased |
Download |
NerDLModel (OntoNotes with GloVe 100d) |
onto_100 |
Download |
NerDLModel (OntoNotes with GloVe 300d) |
onto_300 |
Download |
WordEmbeddings (GloVe) |
glove_100d |
Download |
BertEmbeddings (base_uncased) |
bert_base_uncased |
Download |
BertEmbeddings (base_cased) |
bert_base_cased |
Download |
BertEmbeddings (large_uncased) |
bert_large_uncased |
Download |
BertEmbeddings (large_cased) |
bert_large_cased |
Download |
DeepSentenceDetector |
ner_dl_sentence |
Download |
ContextSpellCheckerModel (Spell Checker) |
spellcheck_dl |
Download |
SymmetricDeleteModel (Spell Checker) |
spellcheck_sd |
Download |
NorvigSweetingModel (Spell Checker) |
spellcheck_norvig |
Download |
ViveknSentimentModel (Sentiment) |
sentiment_vivekn |
Download |
DependencyParser (Dependency) |
dependency_conllu |
Download |
TypedDependencyParser (Dependency) |
dependency_typed_conllu |
Download |
Model |
Name |
fr |
LemmatizerModel (Lemmatizer) |
lemma |
Download |
PerceptronModel (POS UD) |
pos_ud_gsd |
Download |
NerDLModel (glove_840B_300) |
wikiner_840B_300 |
Download |
Feature |
Description |
Lemma |
Trained by Lemmatizer annotator on lemmatization-lists by Michal Měchura |
POS |
Trained by PerceptronApproach annotator on the Universal Dependencies |
NER |
Trained by NerDLApproach annotator with Char CNNs - BiLSTM - CRF and GloVe Embeddings on the WikiNER corpus and supports the identification of PER , LOC , ORG and MISC entities |
Model |
Name |
de |
LemmatizerModel (Lemmatizer) |
lemma |
de |
PerceptronModel (POS UD) |
pos_ud_hdt |
de |
NerDLModel (glove_840B_300) |
wikiner_840B_300 |
de |
Feature |
Description |
Lemma |
Trained by Lemmatizer annotator on lemmatization-lists by Michal Měchura |
POS |
Trained by PerceptronApproach annotator on the Universal Dependencies |
NER |
Trained by NerDLApproach annotator with Char CNNs - BiLSTM - CRF and GloVe Embeddings on the WikiNER corpus and supports the identification of PER , LOC , ORG and MISC entities |
Model |
Name |
it |
LemmatizerModel (Lemmatizer) |
lemma_dxc |
Download |
SentimentDetector (Sentiment) |
sentiment_dxc |
Download |
PerceptronModel (POS UD) |
pos_ud_isdt |
Download |
NerDLModel (glove_840B_300) |
wikiner_840B_300 |
Download |
Feature |
Description |
Lemma |
Trained by Lemmatizer annotator on DXC Technology dataset |
POS |
Trained by PerceptronApproach annotator on the Universal Dependencies |
NER |
Trained by NerDLApproach annotator with Char CNNs - BiLSTM - CRF and GloVe Embeddings on the WikiNER corpus and supports the identification of PER , LOC , ORG and MISC entities |
Model |
Name |
xx |
WordEmbeddings (GloVe) |
glove_840B_300 |
Download |
WordEmbeddings (GloVe) |
glove_6B_300 |
Download |
BertEmbeddings (multi_cased) |
bert_multi_cased |
Download |
pretrained(name, lang)
function to use
It is required to specify 3rd argument to pretrained(name, lang, loc)
function (location) to add the location of these
Model |
name |
language |
loc |
NerDLModel |
ner_clinical |
en |
clinical/models |
AssertionLogRegModel |
assertion_ml |
en |
clinical/models |
AssertionDLModel |
assertion_dl |
en |
clinical/models |
NerDLModel |
deidentify_dl |
en |
clinical/models |
DeIdentificationModel |
deidentify_rb |
en |
clinical/models |
WordEmbeddingsModel |
embeddings_clinical |
en |
clinical/models |
BertEmbeddingsModel |
biobert_pubmed_cased |
en |
clinical/models |
BertEmbeddingsModel |
biobert_pmc_cased |
en |
clinical/models |
BertEmbeddingsModel |
biobert_pubmed_pmc_cased |
en |
clinical/models |
BertEmbeddingsModel |
biobert_clinical_cased |
en |
clinical/models |
BertEmbeddingsModel |
biobert_discharge_cased |
en |
clinical/models |
PerceptronModel |
pos_clinical |
en |
clinical/models |
EntityResolverModel |
resolve_icd10 |
en |
clinical/models |
EntityResolverModel |
resolve_icd10cm_cl_em |
en |
clinical/models |
EntityResolverModel |
resolve_icd10pcs_cl_em |
en |
clinical/models |
ContextSpellCheckerModel |
context_spell_med |
en |
clinical/models |
NOTE:
noncontrib
pipelines are compatible with Windows
operating systems.
Pipelines |
Name |
en |
Explain Document ML |
explain_document_ml |
Download |
Explain Document DL |
explain_document_dl |
Download |
Explain Document DL Win |
explain_document_dl_noncontrib |
Download |
Explain Document DL Fast |
explain_document_dl_fast |
Download |
Explain Document DL Fast Win |
explain_document_dl_fast_noncontrib |
Download |
Recognize Entities DL |
recognize_entities_dl |
Download |
Recognize Entities DL Win |
recognize_entities_dl_noncontrib |
Download |
OntoNotes Entities Small |
onto_recognize_entities_sm |
Download |
OntoNotes Entities Large |
onto_recognize_entities_lg |
Download |
Match Datetime |
match_datetime |
Download |
Match Pattern |
match_pattern |
Download |
Match Chunk |
match_chunks |
Download |
Match Phrases |
match_phrases |
Download |
Clean Stop |
clean_stop |
Download |
Clean Pattern |
clean_pattern |
Download |
Clean Slang |
clean_slang |
Download |
Check Spelling |
check_spelling |
Download |
Analyze Sentiment |
analyze_sentiment |
Download |
Dependency Parse |
dependency_parse |
Download |
Pipelines |
Name |
fr |
Explain Document Large |
explain_document_lg |
Download |
Explain Document Medium |
explain_document_md |
Download |
Entity Recognizer Large |
entity_recognizer_lg |
Download |
Entity Recognizer Medium |
entity_recognizer_md |
Download |
Pipelines |
Name |
it |
Explain Document Large |
explain_document_lg |
Download |
Explain Document Medium |
explain_document_md |
Download |
Entity Recognizer Large |
entity_recognizer_lg |
Download |
Entity Recognizer Medium |
entity_recognizer_md |
Download |
nlp@johnsnowlabs.com
http://johnsnowlabs.com