flammingRaven

flammingRaven's Stars

doocs/leetcode
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer（第 2 版）》、《程序员面试金典（第 6 版）》题解
Language:Java32.4k 323 497.9k
karpathy/LLM101n
LLM101n: Let's build a Storyteller
30.8k 2.6k 01.7k
afatcoder/LeetcodeTop
汇总各大互联网公司容易考察的高频leetcode题🔥
18.8k 291 652.7k
ddbourgin/numpy-ml
Machine learning, in numpy
Language:Python15.8k 461 503.8k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.5k 77 1.3k1.4k
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
Language:Python8.2k 45 713900
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python8.1k 85 161822
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python6k 37 186679
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.9k 111 137420
OpenDriveLab/UniAD
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
Language:Python3.7k 38 191420
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.6k 61 4219
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
3.2k 99 13250
OpenDriveLab/End-to-end-Autonomous-Driving
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
2.6k 61 1246
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Language:Python2.4k 29 64456
karpathy/nano-llama31
nanoGPT style version of Llama 3.1
Language:Python1.3k 23 667
Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Language:Python1.1k 27 37190
WeThinkIn/Interview-for-Algorithm-Engineer
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
954 8 0148
matteocourthoud/awesome-causal-inference
A curated list of causal inference libraries, resources, and applications.
888 41 1150
samwit/llm-tutorials
A set of LLM Tutorials from my youtube channel
Language:Jupyter Notebook666 19 2191
kochbj/Deep-Learning-for-Causal-Inference
Extensive tutorials for learning how to build deep learning models for causal inference (HTE) using selection on observables in Tensorflow 2 and Pytorch.
305 11 968
yanring/jianzhi-Offer-Leetcode
剑指Offer与Leetcode对应题目
300 4 127
cszhangzhen/DRL4Recsys
Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems
294 18 060
LAION-AI/lucidrains-projects
A summary of all lucidrains repositores and links to training / research approaches by LAION or other communities.
Language:Jupyter Notebook263 10 014
fuxiAIlab/RL4RS
A Real-World Benchmark for Reinforcement Learning based Recommender System
Language:Python222 6 1026
johnjim0816/rl-tutorials
basic algorithms of reinforcement learning
Language:Jupyter Notebook200 6 654
xbpeng/awr
Implementation of advantage-weighted regression.
Language:Python181 9 536
James0618/unmas
the source code of UNMAS: Multi-Agent Reinforcement Learning for Unshaped Cooperative Scenarios
Language:Python48 8 15
hailun66/https-github.com-luwill-Machine_Learning_Code_Implementation
Language:Jupyter Notebook4220
LAMDA-RL/ODIS
The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".
Language:Python38 1 55
MAS-anony/ASN
Language:Python32 2 17