pos-tagging

There are 352 repositories under pos-tagging topic.

hankcs/HanLP
中文分词词性标注命名实体识别依存句法分析成分句法分析语义依存分析语义角色标注指代消解风格转换语义相似度新词发现关键词短语提取自动摘要文本分类聚类拼音简繁转换自然语言处理
Language:Python32.6k 1.1k 1.4k9.7k
mesolitica/NLP-Models-Tensorflow
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Language:Jupyter Notebook1.8k 97 29728
undertheseanlp/underthesea
Underthesea - Vietnamese NLP Toolkit
Language:Python1.3k 76 242270
winkjs/wink-nlp
Developer friendly Natural Language Processing ✨
Language:JavaScript1.2k 14 4657
roshan-research/hazm
Persian NLP Toolkit
Language:Python1.1k 23 227179
lionsoul2014/jcseg
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for lucene,solr,elasticsearch,opensearch
Language:Java907 92 57212
ikawaha/kagome
Self-contained Japanese Morphological Analyzer written in pure Go
Language:Go792 23 3453
WorksApplications/Sudachi
A Japanese Tokenizer for Business
Language:Java753 44 7070
VinAIResearch/PhoBERT
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
630 22 4691
vncorenlp/VnCoreNLP
A Vietnamese natural language processing toolkit (NAACL 2018)
Language:Java562 31 45139
CogComp/cogcomp-nlp
CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.
Language:Java469 63 385144
mesolitica/malaya
Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/
Language:Jupyter Notebook459 29 129126
Droidtown/ArticutAPI
API of Articut 中文斷詞 (兼具語意詞性標記)：「斷詞」又稱「分詞」，是中文資訊處理的基礎。Articut 不用機器學習，不需資料模型，只用現代白話中文語法規則，即能達到 SIGHAN 2005 F1-measure 94% 以上，Recall 96% 以上的成績。
Language:Python405 13 135
erickrf/nlpnet
A neural network architecture for NLP tasks, using cython for fast performance. Currently, it can perform POS tagging, SRL and dependency parsing.
Language:Python405 36 39104
CAMeL-Lab/camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
Language:Python385 19 9770
taishi-i/nagisa
A Japanese tokenizer based on recurrent neural networks
Language:Python376 12 2822
WorksApplications/SudachiPy
Python version of Sudachi, a Japanese tokenizer.
Language:Python375 24 8248
ku-nlp/jumanpp
Juman++ (a Morphological Analyzer Toolkit)
Language:C++369 31 11044
sgrvinod/a-PyTorch-Tutorial-to-Sequence-Labeling
Empower Sequence Labeling with Task-Aware Neural Language Model | a PyTorch Tutorial to Sequence Labeling
Language:Python362 14 782
yongzhuo/Pytorch-NLU
Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of spee
Language:Python299 4 1246
WorksApplications/sudachi.rs
Sudachi in Rust 🦀 and new generation of SudachiPy
Language:Rust276 7 13432
yohasebe/engtagger
English Part-of-Speech Tagger Library; a Ruby port of Lingua::EN::Tagger
Language:Ruby257 12 848
monpa-team/monpa
MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
Language:Python245 23 1626
ikegami-yukino/mecab
This repository is for building Windows 64-bit MeCab binary and improving MeCab Python binding.
Language:C++231 11 515
jidasheng/bi-lstm-crf
A PyTorch implementation of the BI-LSTM-CRF model.
Language:Python228 9 1448
WorksApplications/SudachiDict
A lexicon for Sudachi
Language:Python221 13 2019
kirralabs/indonesian-NLP-resources
data resource untuk NLP bahasa indonesia
220 10 050
vunb/vntk
Vietnamese NLP Toolkit for Node
Language:JavaScript210 22 3661
bnosac/udpipe
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Language:C++209 16 10834
janlukasschroeder/nlp-cheat-sheet-python
NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Language:Jupyter Notebook195 5 062
bentrevett/pytorch-pos-tagging
A tutorial on how to implement models for part-of-speech tagging using PyTorch and TorchText.
Language:Jupyter Notebook177 3 927
datquocnguyen/jPTDP
Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)
Language:Python158 7 530
datquocnguyen/RDRPOSTagger
A fast and accurate POS and morphological tagging toolkit (EACL 2014)
Language:HTML138 13 1848
VinAIResearch/PhoNLP
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
Language:Python131 8 818
Qutuf/Qutuf
Qutuf (قُطُوْف): An Arabic Morphological analyzer and Part-Of-Speech tagger as an Expert System.
Language:Python128 7 118
simongray/datalinguist
Stanford CoreNLP in idiomatic Clojure.
Language:Clojure112 8 105

pos-tagging

hankcs/HanLP

mesolitica/NLP-Models-Tensorflow

undertheseanlp/underthesea

winkjs/wink-nlp

roshan-research/hazm

lionsoul2014/jcseg

ikawaha/kagome

WorksApplications/Sudachi

VinAIResearch/PhoBERT

vncorenlp/VnCoreNLP

CogComp/cogcomp-nlp

mesolitica/malaya

Droidtown/ArticutAPI

erickrf/nlpnet

CAMeL-Lab/camel_tools

taishi-i/nagisa

WorksApplications/SudachiPy

ku-nlp/jumanpp

sgrvinod/a-PyTorch-Tutorial-to-Sequence-Labeling

yongzhuo/Pytorch-NLU

WorksApplications/sudachi.rs

yohasebe/engtagger

monpa-team/monpa

ikegami-yukino/mecab

jidasheng/bi-lstm-crf

WorksApplications/SudachiDict

kirralabs/indonesian-NLP-resources

vunb/vntk

bnosac/udpipe

janlukasschroeder/nlp-cheat-sheet-python

bentrevett/pytorch-pos-tagging

datquocnguyen/jPTDP

datquocnguyen/RDRPOSTagger

VinAIResearch/PhoNLP

Qutuf/Qutuf

simongray/datalinguist