Pinned Repositories
commons-text
Mirror of Apache Commons Text
EDGAR
Scripts to get data on public company IPO filings from the SEC website
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
fever2018
System for Fact Extraction and Verification, for http://fever.ai FEVER shared task at EMNLP
Hash-Embeddings
PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.
kafka
Mirror of Apache Kafka
NeuroNLP2
Deep neural models for core NLP tasks (Pytorch version)
nv
Notational Velocity: modeless, mouseless Mac OS X note-taking application
prodigy-recipes
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
redact-pii
Remove personally identifiable information from text.
CAPS50's Repositories
CAPS50/Hash-Embeddings
PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.
CAPS50/commons-text
Mirror of Apache Commons Text
CAPS50/EDGAR
Scripts to get data on public company IPO filings from the SEC website
CAPS50/entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
CAPS50/fever2018
System for Fact Extraction and Verification, for http://fever.ai FEVER shared task at EMNLP
CAPS50/kafka
Mirror of Apache Kafka
CAPS50/NeuroNLP2
Deep neural models for core NLP tasks (Pytorch version)
CAPS50/nv
Notational Velocity: modeless, mouseless Mac OS X note-taking application
CAPS50/prodigy-recipes
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
CAPS50/redact-pii
Remove personally identifiable information from text.
CAPS50/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.