nlproc

There are 55 repositories under nlproc topic.

huggingface/knockknock
🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code
Language:Python2.8k 64 42236
MIND-Lab/OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
Language:Python741 15 103107
jina-ai/agentchain
Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks
Language:Python598 16 652
feralvam/easse
Easier Automatic Sentence Simplification Evaluation
Language:Roff160 5 5036
KennethEnevoldsen/augmenty
Augmenty is an augmentation library based on spaCy for augmenting texts.
Language:Python151 4 4811
ahmedbesbes/media-agent
Scrape data from social media and chat with it using Langchain
Language:Python134 3 818
majumderb/recipe-personalization
EMNLP 2019: Generating Personalized Recipes from Historical User Preferences
Language:Python61 8 618
kasnerz/reffix
A tool for fixing a BibTeX reference list using DBLP API
Language:Python56 1 85
UBC-NLP/turjuman
TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).
Language:Python53 3 312
StatguyUser/TextFeatureSelection
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Language:Python51 2 55
HendrikStrobelt/LMdiff
A diff tool for language models
Language:Python42 1 154
thunlp/HiddenKiller
Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"
Language:Python41 7 89
ahmedbesbes/keywords-extractor-with-bert
A Streamlit app to extract keywords using KeyBert
Language:Jupyter Notebook35 3 012
yoseflaw/nerindo
Named Entity Recognition with BiLSTM, CRF, and Attention-based models implemented in PyTorch for Indonesian News.
Language:Python32 1 110
ahmedbesbes/anonymizer
Text Anonymization app with Streamlit and Spacy
Language:Python26 3 010
RichardLitt/thesis
My thesis on "Open Source Code and Low Resource Languages" for an MSc in Language Science and Technology at Saarland University
Language:TeX20 6 414
ahmedbesbes/multi-label-sentiment-classifier
How to build a multi-label sentiment classifiers with Tez and PyTorch
Language:Jupyter Notebook19 4 25
coastalcph/lexlms
LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development
Language:Python19 3 13
thunlp/BkdAtk-LWS
Code and data of the ACL 2021 paper "Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution"
Language:Python16 8 37
CharlyWargnier/S4_wiki_topic_grapher
Leverage the power of the Google Natural Language API NLP to retrieve entity relationships from Wikipedia URLs or topics! Get interactive networkx graphs of connected entities!
Language:Python14 2 07
Yangyi-Chen/MAYA
Code base for the EMNLP 2021 paper, "Multi-granularity Textual Adversarial Attack with Behavior Cloning".
Language:Python13 1 10
michelecafagna26/cider
Pythonic wrappers for Cider/CiderD evaluation metrics. Provides CIDEr as well as CIDEr-D (CIDEr Defended) which is more robust to gaming effects. We also add the possibility to replace the original PTBTokenizer with the Spacy tekenizer (No java dependincy but slower)
Language:Python12 1 00
Lingwars/GAPLEN
Grupo de Aprendizaje de Procesamiento del Lenguaje Natural, lanzado por Lingwars
Language:Python10 12 38
vgtomahawk/Charmanteau-CamReady
Code for "CharManteau: Character Embedding Models For Portmanteau Creation. EMNLP 2017. Varun Gangal*, Harsh Jhamtani*, Graham Neubig, Eduard Hovy, Eric Nyberg"
Language:Python10 4 23
bubblspace/AIOne
MLOne Powered by AIEdX. Machine Learning Course for Everyone. Tier1 Basic
Language:Jupyter Notebook9 2 06
elleros/DSHealth2019_loinc_embeddings
Code and Word2Vec embeddings of LOINC codes for KDD 2019 DSHealth paper "Evaluation of Embeddings of Laboratory Test Codes for Patients at a Cancer Center": https://arxiv.org/abs/1907.09600
Language:Jupyter Notebook9 2 01
gsarti/svevo-letters-analysis
Topic Modeling and Sentiment Analysis on Italo Svevo Epistolary Corpus
Language:Jupyter Notebook7 2 04
dsfsi/vukuzenzele-nlp
The dataset contains editions from the South African government magazine Vuk'uzenzele. Data was scraped from PDFs that have been placed in the data/raw folder. The PDFS were obtained from the Vuk'uzenzele website.
Language:Jupyter Notebook6 0 224
jmacwan/POSPair
Simplifying representation for Natural Language Processing
Language:Python5 0 11
DemoVersion/nlp_common_codes
Some of My Codes for Natural Language Processing
Language:Python4 1 01
dsfsi/gov-za-multilingual
The data set contains cabinet statements from the South African government. Data was scraped from the governments website: https://www.gov.za/cabinet-statements
Language:Jupyter Notebook4 3 80
dsfsi/PuoBERTa
A Roberta-based language model specially designed for Setswana, using the new PuoData dataset.
Language:Makefile4 1 00
diyclassics/latincy-book
An always-a-work-in-progress combination of documentation and demo notebooks for working with the LatinCy models
Language:HTML3 2 01
dsfsi/za-mavito
DSFSI South African Terminlogy Lists and Lexicon Project
Language:HTML3 1 10
jplasser/CNEP
CNEP (Contrastive Notes Events Pre-training), Contrastive Learning with Clinical Notes and Events Data Pre-training from MIMIC-III
Language:Jupyter Notebook3 2 00
mhmdsabry/BERT_with_Residual_vs_Highway
Comparing between residual stream and highway stream in transformers(BERT) .
Language:Python3 3 00

nlproc

huggingface/knockknock

MIND-Lab/OCTIS

jina-ai/agentchain

feralvam/easse

KennethEnevoldsen/augmenty

ahmedbesbes/media-agent

majumderb/recipe-personalization

kasnerz/reffix

UBC-NLP/turjuman

StatguyUser/TextFeatureSelection

HendrikStrobelt/LMdiff

thunlp/HiddenKiller

ahmedbesbes/keywords-extractor-with-bert

yoseflaw/nerindo

ahmedbesbes/anonymizer

RichardLitt/thesis

ahmedbesbes/multi-label-sentiment-classifier

coastalcph/lexlms

thunlp/BkdAtk-LWS

CharlyWargnier/S4_wiki_topic_grapher

Yangyi-Chen/MAYA

michelecafagna26/cider

Lingwars/GAPLEN

vgtomahawk/Charmanteau-CamReady

bubblspace/AIOne

elleros/DSHealth2019_loinc_embeddings

gsarti/svevo-letters-analysis

dsfsi/vukuzenzele-nlp

jmacwan/POSPair

DemoVersion/nlp_common_codes

dsfsi/gov-za-multilingual

dsfsi/PuoBERTa

diyclassics/latincy-book

dsfsi/za-mavito

jplasser/CNEP

mhmdsabry/BERT_with_Residual_vs_Highway