nlproc

There are 55 repositories under nlproc topic.

  • huggingface/knockknock

    🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code

    Language:Python2.8k6442236
  • OCTIS

    MIND-Lab/OCTIS

    OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

    Language:Python74115103107
  • agentchain

    jina-ai/agentchain

    Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks

    Language:Python59816652
  • feralvam/easse

    Easier Automatic Sentence Simplification Evaluation

    Language:Roff16055036
  • KennethEnevoldsen/augmenty

    Augmenty is an augmentation library based on spaCy for augmenting texts.

    Language:Python15144811
  • ahmedbesbes/media-agent

    Scrape data from social media and chat with it using Langchain

    Language:Python1343818
  • majumderb/recipe-personalization

    EMNLP 2019: Generating Personalized Recipes from Historical User Preferences

    Language:Python618618
  • kasnerz/reffix

    A tool for fixing a BibTeX reference list using DBLP API

    Language:Python56185
  • UBC-NLP/turjuman

    TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).

    Language:Python533312
  • StatguyUser/TextFeatureSelection

    Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models

    Language:Python51255
  • HendrikStrobelt/LMdiff

    A diff tool for language models

    Language:Python421154
  • thunlp/HiddenKiller

    Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"

    Language:Python41789
  • ahmedbesbes/keywords-extractor-with-bert

    A Streamlit app to extract keywords using KeyBert

    Language:Jupyter Notebook353012
  • yoseflaw/nerindo

    Named Entity Recognition with BiLSTM, CRF, and Attention-based models implemented in PyTorch for Indonesian News.

    Language:Python321110
  • anonymizer

    ahmedbesbes/anonymizer

    Text Anonymization app with Streamlit and Spacy

    Language:Python263010
  • RichardLitt/thesis

    My thesis on "Open Source Code and Low Resource Languages" for an MSc in Language Science and Technology at Saarland University

    Language:TeX206414
  • multi-label-sentiment-classifier

    ahmedbesbes/multi-label-sentiment-classifier

    How to build a multi-label sentiment classifiers with Tez and PyTorch

    Language:Jupyter Notebook19425
  • coastalcph/lexlms

    LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development

    Language:Python19313
  • thunlp/BkdAtk-LWS

    Code and data of the ACL 2021 paper "Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution"

    Language:Python16837
  • CharlyWargnier/S4_wiki_topic_grapher

    Leverage the power of the Google Natural Language API NLP to retrieve entity relationships from Wikipedia URLs or topics! Get interactive networkx graphs of connected entities!

    Language:Python14207
  • Yangyi-Chen/MAYA

    Code base for the EMNLP 2021 paper, "Multi-granularity Textual Adversarial Attack with Behavior Cloning".

    Language:Python13110
  • cider

    michelecafagna26/cider

    Pythonic wrappers for Cider/CiderD evaluation metrics. Provides CIDEr as well as CIDEr-D (CIDEr Defended) which is more robust to gaming effects. We also add the possibility to replace the original PTBTokenizer with the Spacy tekenizer (No java dependincy but slower)

    Language:Python12100
  • Lingwars/GAPLEN

    Grupo de Aprendizaje de Procesamiento del Lenguaje Natural, lanzado por Lingwars

    Language:Python101238
  • vgtomahawk/Charmanteau-CamReady

    Code for "CharManteau: Character Embedding Models For Portmanteau Creation. EMNLP 2017. Varun Gangal*, Harsh Jhamtani*, Graham Neubig, Eduard Hovy, Eric Nyberg"

    Language:Python10423
  • bubblspace/AIOne

    MLOne Powered by AIEdX. Machine Learning Course for Everyone. Tier1 Basic

    Language:Jupyter Notebook9206
  • DSHealth2019_loinc_embeddings

    elleros/DSHealth2019_loinc_embeddings

    Code and Word2Vec embeddings of LOINC codes for KDD 2019 DSHealth paper "Evaluation of Embeddings of Laboratory Test Codes for Patients at a Cancer Center": https://arxiv.org/abs/1907.09600

    Language:Jupyter Notebook9201
  • gsarti/svevo-letters-analysis

    Topic Modeling and Sentiment Analysis on Italo Svevo Epistolary Corpus

    Language:Jupyter Notebook7204
  • dsfsi/vukuzenzele-nlp

    The dataset contains editions from the South African government magazine Vuk'uzenzele. Data was scraped from PDFs that have been placed in the data/raw folder. The PDFS were obtained from the Vuk'uzenzele website.

    Language:Jupyter Notebook60224
  • POSPair

    jmacwan/POSPair

    Simplifying representation for Natural Language Processing

    Language:Python5011
  • DemoVersion/nlp_common_codes

    Some of My Codes for Natural Language Processing

    Language:Python4101
  • dsfsi/gov-za-multilingual

    The data set contains cabinet statements from the South African government. Data was scraped from the governments website: https://www.gov.za/cabinet-statements

    Language:Jupyter Notebook4380
  • dsfsi/PuoBERTa

    A Roberta-based language model specially designed for Setswana, using the new PuoData dataset.

    Language:Makefile4100
  • diyclassics/latincy-book

    An always-a-work-in-progress combination of documentation and demo notebooks for working with the LatinCy models

    Language:HTML3201
  • dsfsi/za-mavito

    DSFSI South African Terminlogy Lists and Lexicon Project

    Language:HTML3110
  • CNEP

    jplasser/CNEP

    CNEP (Contrastive Notes Events Pre-training), Contrastive Learning with Clinical Notes and Events Data Pre-training from MIMIC-III

    Language:Jupyter Notebook3200
  • mhmdsabry/BERT_with_Residual_vs_Highway

    Comparing between residual stream and highway stream in transformers(BERT) .

    Language:Python3300