gpengzhi's Stars
gpengzhi/Bi-SimCut
Code for NAACL 2022 main conference paper "Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation"
rsennrich/subword-nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
yuchenlin/rebiber
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
PeterTheOne/slideslive-slides-dl
slideslive slides downloading script
microsoft/MASS
MASS: Masked Sequence to Sequence Pre-training for Language Generation
uvavision/visual-pivoting
[EMNLP 2020] Using Visual Feature Space as a Pivot Across Languages
dinghanshen/Cutoff
The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".
dropreg/R-Drop
ImperialNLP/VTLM
Cross-lingual Visual Pre-training for Multimodal Machine Translation
petuum/adaptdl
Resource-adaptive cluster scheduler for deep learning training.
kahne/SpeechTransProgress
Tracking the progress in end-to-end speech translation
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
facebookresearch/flores
Facebook Low Resource (FLoRes) MT Benchmark
saffsd/langid.py
Stand-alone language identification system
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
QData/TextAttack
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
thunlp/TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
nxphi47/data_diversification
Instruction to data diversification
yaodongyu/TRADES
TRADES (TRadeoff-inspired Adversarial DEfense via Surrogate-loss minimization)
allenai/allennlp
An open-source NLP research library, built on PyTorch.
huggingface/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
PaddlePaddle/models
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
facebookresearch/XLM
PyTorch original implementation of Cross-lingual Language Model Pretraining.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
paperswithcode/ai-deadlines
:alarm_clock: AI conference deadline countdowns
neubig/lowresource-nlp-bootcamp-2020
The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020
THUNLP-MT/MT-Reading-List
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
amusi/Deep-Learning-Interview-Book
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
neubig/nlptutorial
A Tutorial about Programming for Natural Language Processing
asyml/aaai_tutorial
This site holds materials of the AAAI 2020 Tutorial on Modularizing Natural Language Processing