shigashiyama's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
fxsjy/jieba
结巴中文分词
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
flairNLP/flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
ymcui/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
arXivTimes/arXivTimes
repository to research & share the machine learning articles
google-research/electra
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
plasticityai/magnitude
A fast, efficient universal vector embedding utility package.
aozorabunko/aozorabunko
huggingface/naacl_transfer_learning_tutorial
Repository of code for the tutorial on Transfer Learning in NLP held at NAACL 2019 in Minneapolis, MN, USA
mhagiwara/github-typo-corpus
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
WorksApplications/SudachiPy
Python version of Sudachi, a Japanese tokenizer.
taishi-i/nagisa
A Japanese tokenizer based on recurrent neural networks
PKSHATechnology-Research/camphr
Camphr - NLP libary for creating pipeline components
tanreinama/gpt2-japanese
Japanese GPT2 Generation Model
polm/cutlet
Japanese to romaji converter in Python
ikegami-yukino/neologdn
Japanese text normalizer for mecab-neologd
takapy0210/nlplot
Visualization Module for Natural Language Processing
WorksApplications/SudachiDict
A lexicon for Sudachi
soskek/bert-chainer
Chainer implementation of "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
BandaiNamcoResearchInc/DistilBERT-base-jp
himkt/awesome-bert-japanese
📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information
chakki-works/Japanese-Company-Lexicon
yagays/kanjivg-radical
ncaq/dic-nico-intersection-pixiv
ニコニコ大百科とピクシブ百科事典の共通部分のIME辞書
musyoku/python-npylm
ベイズ階層言語モデルによる教師なし形態素解析
p-geon/DropoutCheatSheet
ideuchi/trans
translation tool
kmr-y/NTCIR14-QALab-PoliInfo-FormalRunDataset
eiennohito/jumanpp
JUMAN++ (a Japanese Morphological Analyzer)