Pinned Repositories
Explore-Instruct
EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration
MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
mindspore
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
Awesome-Incremental-Learning
Awesome Incremental Learning
Deep-Learning
Image processing
heaven
ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
mml-book.github.io
Companion webpage to the book "Mathematics For Machine Learning"
MyML
My Machine Learning
how-to-train-tokenizer
怎么训练一个LLM分词器
wangchunlin's Repositories
wangchunlin/Awesome-Incremental-Learning
Awesome Incremental Learning
wangchunlin/Deep-Learning
Image processing
wangchunlin/heaven
wangchunlin/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
wangchunlin/mml-book.github.io
Companion webpage to the book "Mathematics For Machine Learning"
wangchunlin/MyML
My Machine Learning