LucienJi's Stars
afshinea/stanford-cs-229-machine-learning
VIP cheatsheets for Stanford's CS 229 Machine Learning
NLP-LOVE/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
kailashahirwar/cheatsheets-ai
Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
A-make/awesome-control-theory
Awesome resources for learning control theory
amkatrutsa/optimization_course
A course on Optimization Methods
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
chauncygu/Safe-Reinforcement-Learning-Baselines
The repository is for safe reinforcement learning baselines.
singhaman1750/Legged-Robots
This repository contains papers in the field of legged robots.
AtsushiSakai/PythonRobotics
Python sample codes for robotics algorithms.
Skylark0924/Reinforcement-Learning-in-Robotics
This is a private learning repository for reinforcement learning techniques used in robotics.
eleurent/phd-bibliography
References on Optimal Control, Reinforcement Learning and Motion Planning
MJianM/ros_best_practices
Best practices, conventions, and tricks for ROS. Do you want to become a robotics master? Then consider graduating or working at the Robotics Systems Lab at ETH in Zürich!
opendilab/awesome-diffusion-model-in-rl
A curated list of Diffusion Model in RL resources (continually updated)
Improbable-AI/walk-these-ways
Sim-to-real RL training and deployment tools for the Unitree Go1 robot.
rvl-lab-utoronto/lab_onboarding_recommended_reading
This repository is a collection of papers and research material that students need to be aware of when they are getting started with research in the lab
jackaduma/ChatGLM-LoRA-RLHF-PyTorch
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
karpathy/ng-video-lecture
lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
X-PLUG/CValues
面向中文大模型价值观的评估与对齐研究
jianzhnie/awesome-instruction-datasets
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
OpenLMLab/MOSS-RLHF
Secrets of RLHF in Large Language Models Part I: PPO
KernelA/nds-py
A Python implementation of the non-dominated sorting.
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
kenjihiranabe/The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible