Pinned Repositories
Lunas
A Python-based data processing pipeline with minimal dependencies for machine learning.
NLLB-inference
thseq
PyTorch-based seq2seq learning toolkit that mainly focuses on Neural Machine Translation.
pluiez's Repositories
pluiez/NLLB-inference
pluiez/thseq
PyTorch-based seq2seq learning toolkit that mainly focuses on Neural Machine Translation.
pluiez/Lunas
A Python-based data processing pipeline with minimal dependencies for machine learning.
pluiez/Chinese-Mandarin-Dictionaries
中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.
pluiez/ChineseWiki
维基百科中文语料整理
pluiez/Classical-Modern
非常全的文言文(古文)-现代文平行语料
pluiez/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
pluiez/compare-mt
A tool for holistic analysis of language generations systems
pluiez/contiguous_pytorch_params
Accelerate training by storing parameters in one contiguous chunk of memory.
pluiez/dotfiles
pluiez/ECDICT
Free English to Chinese Dictionary Database
pluiez/FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
pluiez/flores
Facebook Low Resource (FLoRes) MT Benchmark
pluiez/gradio
Create UIs for your machine learning model in Python in 3 minutes
pluiez/GTS-Engine
GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。
pluiez/kdictionary-lingoes
A Lingoes dictionary file (LD2/LDX) reader/extractor. Written in C++ with Qt
pluiez/lazynlp
Library to scrape and clean web pages to create massive datasets.
pluiez/learnxinyminutes-docs
Code documentation written as code! How novel and totally my idea!
pluiez/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
pluiez/nbnhhsh
😩「能不能好好说话?」 拼音首字母缩写翻译工具
pluiez/neural_sequence_labeling
A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.
pluiez/Paper-Writing-Tips
Paper Writing Tips
pluiez/pluiez.github.io
pluiez/pysonar2
a type inferencer and indexer for Python
pluiez/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
pluiez/sd-1click-colab
pluiez/sentsplit
A flexible sentence segmentation library using CRF model and regex rules