LLLiuTC's Stars
SingleZombie/DL-Demos
Demos for deep learning
thu-ml/DiffusionBridge
Official codebase for "Diffusion Bridge Implicit Models" (ICLR 2025)
rainmaker22/SMART
[NeurIPS 2024] SMART: Scalable Multi-agent Real-time Motion Generation via Next-token Prediction
bhyang/diffusion-es
volcengine/verl
verl: Volcano Engine Reinforcement Learning for LLMs
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
ZihanWang314/RAGEN
RAGEN is the first open-source reproduction of DeepSeek-R1 on AGENT training.
OscarHuangWind/Safe-Human-in-the-Loop-RL
[T-ITS'24] A safety-aware human-in-the-loop Reinforcment Learning (SafeHiL-RL) approach for end-to-end autonomous driving.
changchencc/Simple-Hierarchical-Planning-with-Diffusion
kengz/awesome-deep-rl
A curated list of awesome Deep Reinforcement Learning resources.
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
huawei-noah/SMARTS
Scalable Multi-Agent RL Training School for Autonomous Driving
Ola-Omni/Ola
Ola: Pushing the Frontiers of Omni-Modal Language Model
datawhalechina/easy-rl
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
Deep-Agent/R1-V
Witness the aha moment of VLM with less than $3.
ZhengYinan-AIR/FISOR
[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
hkust-nlp/simpleRL-reason
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
LLVM-AD/MAPLM
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding
PKU-Alignment/aligner
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
ZhengYinan-AIR/Diffusion-Planner
[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"
deepseek-ai/DeepSeek-R1
BerkeleyLearnVerify/VerifAI
VerifAI is a software toolkit for the formal design and analysis of systems that include artificial intelligence (AI) and machine learning (ML) components.
metadriverse/SimGen
Simulator-conditioned Driving Scene Generation
openai/openai-realtime-agents
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
zhaohengyin/EfficientImitate
Codebase of NeurIPS 2022 paper ''Planning for Sample Efficient Imitation Learning''
OpenDriveLab/DriveLM
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
aailabkaist/DiffRS
Official PyTorch implementation for Diffusion Rejection Sampling (DiffRS) in ICML 2024.
KTH-RPL/DeFlow
[ICRA'24] DeFlow: Decoder of Scene Flow Network in Autonomous Driving
happy-yan/DACER-Diffusion-with-Online-RL
NeurIPS 2024 DACER