Minghua5

Minghua5's Stars

paperswithcode/galai
Model API for GALACTICA
Language:Jupyter Notebook2.7k276
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Language:Python22.5k5.5k
Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca
Language:C4.1k421
meta-llama/llama
Inference code for Llama models
Language:Python56.3k9.6k
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Language:Python4.1k487
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.3k1.9k
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
Language:C++10.2k1.2k
facebookresearch/MUSE
A library for Multilingual Unsupervised or Supervised word Embeddings
Language:Python3.2k552
lencx/ChatGPT
🔮 ChatGPT Desktop Application (Mac, Windows and Linux)
Language:Rust52.8k5.9k
interstellard/chatgpt-advanced
WebChatGPT: A browser extension that augments your ChatGPT prompts with web results.
Language:TypeScript6.5k835
berzak/celer
Language:Python141
snastase/narratives
Narratives: fMRI data for evaluating models of naturalistic language comprehension
Language:Python584
baidu/DuReader
Baseline Systems of DuReader Dataset
Language:Python1.1k308
norahollenstein/reading-task-classification
NR/TSR
Language:Python51
PhPeKe/OB1_SAM
Model that simulates the processes behind reading in the brain.
Language:Python15
langcog/wordbank
open repository of children's vocabulary data
Language:Python6410
virginiakm1988/ML2022-Spring
**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2022 Spring
Language:Jupyter Notebook2.1k485
graykode/nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
Language:Jupyter Notebook14.2k3.9k
leerumor/nlp_tutorial
NLP超强入门指南，包括各任务sota模型汇总（文本分类、文本匹配、序列标注、文本生成、语言模型），以及代码、技巧
1.8k303
d0r1h/ML-University
Machine Learning Open Source University
854110
lab-lab/nndb
Analysis scripts and movie annotations for NNDb
Language:Shell131
peaceiris/actions-gh-pages
GitHub Actions for GitHub Pages 🚀 Deploy static files and publish your site easily. Static-Site-Generators-friendly.
Language:TypeScript4.7k374
ctapweb/AutoSubClause
Automatic subordinate clause extractor
Language:Java101
justjavac/free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍，欢迎投稿
112k28.2k
EbookFoundation/free-programming-books
:books: Freely available programming books
338k61.6k
brightmart/roberta_zh
RoBERTa中文预训练模型: RoBERTa for Chinese
Language:Python2.6k409
blculyn/The-spoken-L1-corpus
The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus to the spoken L2 corpus. It comprises L1-L1 conversational interactions between L1 speakers of Chinese and a native Chinese speaker in informal settings. This corpus contains 228,306 words of transcribed interaction gathered in 2018, featuring 22 L1 speakers of Chinese in 26 audio recordings.
174
CLUEbenchmark/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Language:Python4k539
amrisi/amr-guidelines
24887
didi/ChineseNLP
Datasets, SOTA results of every fields of Chinese NLP
Language:HTML1.8k273