nlp-library

There are 370 repositories under nlp-library topic.

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python127k 1.1k 15k25.2k
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
Language:Python29k 557 5.6k4.3k
bharathgs/Awesome-pytorch-list
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
15.1k 566 162.8k
thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.
Language:Python4.2k 42 253436
fastnlp/fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Language:Python3k 82 216451
FudanNLP/fnlp
中文自然语言处理工具包 Toolkit for Chinese natural language processing
Language:Java2.6k 254 57727
deepset-ai/FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Language:Python1.7k 54 406245
xavier-zy/Awesome-pytorch-list-CNVersion
Awesome-pytorch-list 翻译工作进行中......
Language:Jupyter Notebook1.7k 66 0396
chrismattmann/tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Language:Python1.4k 38 278233
undertheseanlp/underthesea
Underthesea - Vietnamese NLP Toolkit
Language:Python1.3k 76 244270
MilaNLProc/contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
Language:Python1.2k 17 107140
thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Language:Python947 17 6177
atilika/kuromoji
Kuromoji is a self-contained and very easy to use Japanese morphological analyzer designed for search
Language:Java934 65 38127
PyThaiNLP/pythainlp
Thai Natural Language Processing in Python.
Language:Python934 47 346272
NorskRegnesentral/skweak
skweak: A software toolkit for weak supervision applied to NLP tasks
Language:Python914 26 7573
ashishpatel26/Treasure-of-Transformers
💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️
Language:Jupyter Notebook858 28 1183
mocobeta/janome
Japanese morphological analysis engine written in pure Python
Language:Python833 32 5249
ikawaha/kagome
Self-contained Japanese Morphological Analyzer written in pure Go
Language:Go792 23 3453
WorksApplications/Sudachi
A Japanese Tokenizer for Business
Language:Java754 44 7071
datadreamer-dev/DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
Language:Python704 8 2038
MIND-Lab/OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
Language:Python695 14 10295
pemistahl/lingua
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Language:Kotlin662 11 12760
cbaziotis/ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Language:Python660 18 2892
wyounas/homer
Homer, a text analyser in Python, can help make your text more clear, simple and useful for your readers.
Language:Python632 14 536
Ailln/cn2an
📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）
Language:Python617 6 6880
taishi-i/awesome-japanese-nlp-resources
A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese
605 17 421
mindspore-lab/mindnlp
Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.
Language:Python555 10 234129
fhamborg/Giveme5W1H
Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?
Language:HTML501 25 6886
medspacy/medspacy
Library for clinical NLP with spaCy.
Language:Jupyter Notebook485 16 12986
proycon/pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Language:Python477 32 2567
linuxscout/pyarabic
pyarabic
Language:Python425 36 4884
CAMeL-Lab/camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
Language:Python388 19 9870
taishi-i/nagisa
A Japanese tokenizer based on recurrent neural networks
Language:Python376 12 2922
WorksApplications/SudachiPy
Python version of Sudachi, a Japanese tokenizer.
Language:Python376 24 8248
hellohaptik/multi-task-NLP
multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.
Language:Python363 19 1154
ElizaLo/NLP-Natural-Language-Processing
Projects and useful articles / links
Language:Jupyter Notebook326 10 069

nlp-library

huggingface/transformers

explosion/spaCy

bharathgs/Awesome-pytorch-list

thunlp/OpenPrompt

fastnlp/fastNLP

FudanNLP/fnlp

deepset-ai/FARM

xavier-zy/Awesome-pytorch-list-CNVersion

chrismattmann/tika-python

undertheseanlp/underthesea

MilaNLProc/contextualized-topic-models

thunlp/OpenDelta

atilika/kuromoji

PyThaiNLP/pythainlp

NorskRegnesentral/skweak

ashishpatel26/Treasure-of-Transformers

mocobeta/janome

ikawaha/kagome

WorksApplications/Sudachi

datadreamer-dev/DataDreamer

MIND-Lab/OCTIS

pemistahl/lingua

cbaziotis/ekphrasis

wyounas/homer

Ailln/cn2an

taishi-i/awesome-japanese-nlp-resources

mindspore-lab/mindnlp

fhamborg/Giveme5W1H

medspacy/medspacy

proycon/pynlpl

linuxscout/pyarabic

CAMeL-Lab/camel_tools

taishi-i/nagisa

WorksApplications/SudachiPy

hellohaptik/multi-task-NLP

ElizaLo/NLP-Natural-Language-Processing