flammingRaven's Stars
doocs/leetcode
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
karpathy/LLM101n
LLM101n: Let's build a Storyteller
afatcoder/LeetcodeTop
汇总各大互联网公司容易考察的高频leetcode题🔥
ddbourgin/numpy-ml
Machine learning, in numpy
huggingface/trl
Train transformer language models with reinforcement learning.
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
OpenDriveLab/UniAD
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
OpenDriveLab/End-to-end-Autonomous-Driving
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
karpathy/nano-llama31
nanoGPT style version of Llama 3.1
Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
WeThinkIn/Interview-for-Algorithm-Engineer
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
matteocourthoud/awesome-causal-inference
A curated list of causal inference libraries, resources, and applications.
samwit/llm-tutorials
A set of LLM Tutorials from my youtube channel
kochbj/Deep-Learning-for-Causal-Inference
Extensive tutorials for learning how to build deep learning models for causal inference (HTE) using selection on observables in Tensorflow 2 and Pytorch.
yanring/jianzhi-Offer-Leetcode
剑指Offer与Leetcode对应题目
cszhangzhen/DRL4Recsys
Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems
LAION-AI/lucidrains-projects
A summary of all lucidrains repositores and links to training / research approaches by LAION or other communities.
fuxiAIlab/RL4RS
A Real-World Benchmark for Reinforcement Learning based Recommender System
johnjim0816/rl-tutorials
basic algorithms of reinforcement learning
xbpeng/awr
Implementation of advantage-weighted regression.
James0618/unmas
the source code of UNMAS: Multi-Agent Reinforcement Learning for Unshaped Cooperative Scenarios
hailun66/https-github.com-luwill-Machine_Learning_Code_Implementation
LAMDA-RL/ODIS
The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".
MAS-anony/ASN