uloveqian2021's Stars
txsun1997/MOSS
MOSS is a conversational language model like ChatGPT.
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
AIWintermuteAI/Speech-to-Intent-Micro
An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs
pengzhendong/welm
One command to build TLG.fst for WeNet.
NielsEscarfail/FeatureExtractionKWS
SP: Feature Extraction for Speech Recognition using OFA
wenet-e2e/WenetSpeech
A 10000+ hours dataset for Chinese speech recognition
hulucky1102/Task-Oriented-Dialogue-Systems
My first repository on Github