Minghua5's Stars
paperswithcode/galai
Model API for GALACTICA
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
meta-llama/llama
Inference code for Llama models
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
facebookresearch/MUSE
A library for Multilingual Unsupervised or Supervised word Embeddings
lencx/ChatGPT
🔮 ChatGPT Desktop Application (Mac, Windows and Linux)
interstellard/chatgpt-advanced
WebChatGPT: A browser extension that augments your ChatGPT prompts with web results.
berzak/celer
snastase/narratives
Narratives: fMRI data for evaluating models of naturalistic language comprehension
baidu/DuReader
Baseline Systems of DuReader Dataset
norahollenstein/reading-task-classification
NR/TSR
PhPeKe/OB1_SAM
Model that simulates the processes behind reading in the brain.
langcog/wordbank
open repository of children's vocabulary data
virginiakm1988/ML2022-Spring
**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2022 Spring
graykode/nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
leerumor/nlp_tutorial
NLP超强入门指南,包括各任务sota模型汇总(文本分类、文本匹配、序列标注、文本生成、语言模型),以及代码、技巧
d0r1h/ML-University
Machine Learning Open Source University
lab-lab/nndb
Analysis scripts and movie annotations for NNDb
peaceiris/actions-gh-pages
GitHub Actions for GitHub Pages 🚀 Deploy static files and publish your site easily. Static-Site-Generators-friendly.
ctapweb/AutoSubClause
Automatic subordinate clause extractor
justjavac/free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍,欢迎投稿
EbookFoundation/free-programming-books
:books: Freely available programming books
brightmart/roberta_zh
RoBERTa中文预训练模型: RoBERTa for Chinese
blculyn/The-spoken-L1-corpus
The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus to the spoken L2 corpus. It comprises L1-L1 conversational interactions between L1 speakers of Chinese and a native Chinese speaker in informal settings. This corpus contains 228,306 words of transcribed interaction gathered in 2018, featuring 22 L1 speakers of Chinese in 26 audio recordings.
CLUEbenchmark/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
amrisi/amr-guidelines
didi/ChineseNLP
Datasets, SOTA results of every fields of Chinese NLP