shelleyyyyu's Stars
google-research/google-research
Google Research
sebastianruder/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
ShangtongZhang/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
chatopera/Synonyms
:herb: 中文近义词:聊天机器人,智能问答工具包
MLNLP-World/Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
RUCAIBox/RecBole
A unified, comprehensive and efficient recommendation library
jasonwei20/eda_nlp
Data augmentation for NLP, presented at EMNLP 2019
PrithivirajDamodaran/Gramformer
A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
zhanlaoban/EDA_NLP_for_Chinese
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
iqiyi/FASPell
2019-SOTA简繁中文拼写检查工具:FASPell Chinese Spell Checker (Chinese Spell Check / 中文拼写检错 / 中文拼写纠错 / 中文拼写检查)
ibm-aur-nlp/PubLayNet
grammarly/gector
Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)
styfeng/DataAug4NLP
Collection of papers and resources for data augmentation for NLP.
nonamestreet/weixin_public_corpus
微信公众号语料库
HillZhang1999/MuCGEC
MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"
mhagiwara/github-typo-corpus
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
wdimmy/Automatic-Corpus-Generation
This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"
TaoMiner/joint-kg-recommender
xwhan/One-shot-Relational-Learning
Code for One-shot Relational Learning for Knowledge Graphs (EMNLP18)
nusnlp/m2scorer
MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.
jcyk/BERT
a simple yet complete implementation of the popular BERT model
gotutiyan/GEC-Info
Repository to collect and categorize Grammatical Error Correction papers.
lipiji/TtT
code for ACL2021 paper "Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction"
HillZhang1999/CTC-Report
CTC2021-中文文本纠错大赛的SOTA方案及在线演示
bitallin/MiduCTC-competition
文本智能校对大赛(Chinese Text Correction)的baseline
Aolin-MIR/soft-masked-bert-for-spelling-error-correction
A third-party implementation of paper《Spelling Error Correction with Soft-Masked BERT》using tensorflow==1.12.0
pkucoli/NLPCC2018_GEC
Data for NLPCC2018 Shared Task--Grammatical Error Correction (GEC).
getao/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.