nlp-library

There are 370 repositories under nlp-library topic.

  • transformers

    huggingface/transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Language:Python127k1.1k15k25.2k
  • explosion/spaCy

    💫 Industrial-strength Natural Language Processing (NLP) in Python

    Language:Python29k5575.6k4.3k
  • bharathgs/Awesome-pytorch-list

    A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

  • thunlp/OpenPrompt

    An Open-Source Framework for Prompt-Learning.

    Language:Python4.2k42253436
  • fastnlp/fastNLP

    fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

    Language:Python3k82216451
  • FudanNLP/fnlp

    中文自然语言处理工具包 Toolkit for Chinese natural language processing

    Language:Java2.6k25457727
  • FARM

    deepset-ai/FARM

    :house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

    Language:Python1.7k54406245
  • xavier-zy/Awesome-pytorch-list-CNVersion

    Awesome-pytorch-list 翻译工作进行中......

    Language:Jupyter Notebook1.7k660396
  • chrismattmann/tika-python

    Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

    Language:Python1.4k38278233
  • underthesea

    undertheseanlp/underthesea

    Underthesea - Vietnamese NLP Toolkit

    Language:Python1.3k76244270
  • MilaNLProc/contextualized-topic-models

    A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

    Language:Python1.2k17107140
  • thunlp/OpenDelta

    A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

    Language:Python947176177
  • atilika/kuromoji

    Kuromoji is a self-contained and very easy to use Japanese morphological analyzer designed for search

    Language:Java9346538127
  • PyThaiNLP/pythainlp

    Thai Natural Language Processing in Python.

    Language:Python93447346272
  • NorskRegnesentral/skweak

    skweak: A software toolkit for weak supervision applied to NLP tasks

    Language:Python914267573
  • ashishpatel26/Treasure-of-Transformers

    💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️

    Language:Jupyter Notebook858281183
  • mocobeta/janome

    Japanese morphological analysis engine written in pure Python

    Language:Python833325249
  • ikawaha/kagome

    Self-contained Japanese Morphological Analyzer written in pure Go

    Language:Go792233453
  • WorksApplications/Sudachi

    A Japanese Tokenizer for Business

    Language:Java754447071
  • DataDreamer

    datadreamer-dev/DataDreamer

    DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

    Language:Python70482038
  • OCTIS

    MIND-Lab/OCTIS

    OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

    Language:Python6951410295
  • lingua

    pemistahl/lingua

    The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

    Language:Kotlin6621112760
  • cbaziotis/ekphrasis

    Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

    Language:Python660182892
  • wyounas/homer

    Homer, a text analyser in Python, can help make your text more clear, simple and useful for your readers.

    Language:Python63214536
  • Ailln/cn2an

    📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

    Language:Python61766880
  • taishi-i/awesome-japanese-nlp-resources

    A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese

  • mindspore-lab/mindnlp

    Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.

    Language:Python55510234129
  • fhamborg/Giveme5W1H

    Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?

    Language:HTML501256886
  • medspacy/medspacy

    Library for clinical NLP with spaCy.

    Language:Jupyter Notebook4851612986
  • proycon/pynlpl

    PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).

    Language:Python477322567
  • pyarabic

    linuxscout/pyarabic

    pyarabic

    Language:Python425364884
  • CAMeL-Lab/camel_tools

    A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

    Language:Python388199870
  • nagisa

    taishi-i/nagisa

    A Japanese tokenizer based on recurrent neural networks

    Language:Python376122922
  • WorksApplications/SudachiPy

    Python version of Sudachi, a Japanese tokenizer.

    Language:Python376248248
  • hellohaptik/multi-task-NLP

    multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.

    Language:Python363191154
  • NLP-Natural-Language-Processing

    ElizaLo/NLP-Natural-Language-Processing

    Projects and useful articles / links

    Language:Jupyter Notebook32610069