ymcui
NLP Researcher. Mainly interested in Pre-trained Language Model, Machine Reading Comprehension, Question Answering, etc.
Joint Laboratory of HIT and iFLYTEK Research (HFL)Beijing, China
Pinned Repositories
Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
Chinese-ELECTRA
Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
Chinese-Mixtral
中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)
Chinese-XLNet
Pre-Trained Chinese XLNet(中文XLNet预训练模型)
cmrc2018
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
MacBERT
Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)
PERT
PERT: Pre-training BERT with Permuted Language Model
ymcui's Repositories
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
ymcui/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
ymcui/Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
ymcui/Chinese-XLNet
Pre-Trained Chinese XLNet(中文XLNet预训练模型)
ymcui/Chinese-ELECTRA
Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)
ymcui/MacBERT
Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)
ymcui/Chinese-Mixtral
中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)
ymcui/cmrc2018
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
ymcui/PERT
PERT: Pre-training BERT with Permuted Language Model
ymcui/Chinese-RC-Datasets
Collections of Chinese reading comprehension datasets
ymcui/LERT
LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)
ymcui/Chinese-Cloze-RC
A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)
ymcui/cmrc2019
A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)
ymcui/LAMB_Optimizer_TF
LAMB Optimizer for Large Batch Training (TensorFlow version)
ymcui/Chinese-MobileBERT
Chinese MobileBERT(中文MobileBERT模型)
ymcui/cmrc2017
The First Evaluation Workshop on Chinese Machine Reading Comprehension (CMRC 2017)
ymcui/Eval-on-NN-of-RC
Empirical Evaluation on Current Neural Networks on Cloze-style Reading Comprehension
ymcui/ChatGPT-in-Academia
Policies of scientific publisher and conferences towards large language model (LLM), such as ChatGPT
ymcui/Cross-Lingual-MRC
Cross-Lingual Machine Reading Comprehension (EMNLP 2019)
ymcui/expmrc
ExpMRC: Explainability Evaluation for Machine Reading Comprehension
ymcui/NLP-Review-Scorer
Score your NLP paper review
ymcui/ACL2020-PC-Blogs-Chinese
Chinese Version of ACL 2020 PC Blogs (ACL 2020程序委员会博文中文版)
ymcui/mrc-model-analysis
Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models (iScience)
ymcui/xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding
ymcui/llama.cpp
Port of Facebook's LLaMA model in C/C++
ymcui/VLE
VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)
ymcui/ARR-SAC-Tool
Helpful notebook for ARR SACs