LLLiuTC

LLLiuTC's Stars

SingleZombie/DL-Demos
Demos for deep learning
Language:Python507116
thu-ml/DiffusionBridge
Official codebase for "Diffusion Bridge Implicit Models" (ICLR 2025)
Language:Python202
rainmaker22/SMART
[NeurIPS 2024] SMART: Scalable Multi-agent Real-time Motion Generation via Next-token Prediction
Language:Python10015
bhyang/diffusion-es
Language:Python887
volcengine/verl
verl: Volcano Engine Reinforcement Learning for LLMs
Language:Python3.2k275
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
21.5k1.8k
ZihanWang314/RAGEN
RAGEN is the first open-source reproduction of DeepSeek-R1 on AGENT training.
Language:Python83458
OscarHuangWind/Safe-Human-in-the-Loop-RL
[T-ITS'24] A safety-aware human-in-the-loop Reinforcment Learning (SafeHiL-RL) approach for end-to-end autonomous driving.
Language:Python181
changchencc/Simple-Hierarchical-Planning-with-Diffusion
Language:Python14
kengz/awesome-deep-rl
A curated list of awesome Deep Reinforcement Learning resources.
72172
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Language:Python1.8k227
huawei-noah/SMARTS
Scalable Multi-Agent RL Training School for Autonomous Driving
Language:Python987192
Ola-Omni/Ola
Ola: Pushing the Frontiers of Omni-Modal Language Model
Language:Python1656
datawhalechina/easy-rl
强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/
Language:Jupyter Notebook10.2k1.9k
Deep-Agent/R1-V
Witness the aha moment of VLM with less than $3.
Language:Python2.5k184
ZhengYinan-AIR/FISOR
[ICLR 2024] The official implementation of "Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model"
Language:Python905
hkust-nlp/simpleRL-reason
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Language:Python2.7k203
LLVM-AD/MAPLM
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding
Language:Python1213
PKU-Alignment/aligner
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
Language:Python1528
ZhengYinan-AIR/Diffusion-Planner
[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"
Language:Python21225
deepseek-ai/DeepSeek-R1
76.1k9.8k
BerkeleyLearnVerify/VerifAI
VerifAI is a software toolkit for the formal design and analysis of systems that include artificial intelligence (AI) and machine learning (ML) components.
Language:Python18250
metadriverse/SimGen
Simulator-conditioned Driving Scene Generation
Language:Python919
openai/openai-realtime-agents
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
Language:TypeScript5k512
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.8k490
zhaohengyin/EfficientImitate
Codebase of NeurIPS 2022 paper ''Planning for Sample Efficient Imitation Learning''
Language:Python393
OpenDriveLab/DriveLM
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
Language:HTML96963
aailabkaist/DiffRS
Official PyTorch implementation for Diffusion Rejection Sampling (DiffRS) in ICML 2024.
Language:Python192
KTH-RPL/DeFlow
[ICRA'24] DeFlow: Decoder of Scene Flow Network in Autonomous Driving
Language:Python1109
happy-yan/DACER-Diffusion-with-Online-RL
NeurIPS 2024 DACER
Language:Python709