Pinned Repositories
air-dream-website
🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
D2C
D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.
DecisionNCE
[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
Diffusion-Planner
[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"
H2Oplus
[ICRA 2025] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps.
ODICE-Pytorch
official implementation of ODICE
OMIGA
The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization" (NeurIPS 2023)
openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
TSRL
UniAct
Universal Actions for Enhanced Embodied Foundation Models
AIR-DREAM's Repositories
AIR-DI/D2C
D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.
AIR-DI/air-dream-website
🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
AIR-DI/OMIGA
The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization" (NeurIPS 2023)
AIR-DI/H2O
[NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
AIR-DI/H2Oplus
[ICRA 2025] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps.
AIR-DI/ODICE-Pytorch
official implementation of ODICE
AIR-DI/TSRL
AIR-DI/.github
AIR-DI/AIDC
AIR-DI/DecisionNCE
[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
AIR-DI/Diffusion-Planner
[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"
AIR-DI/DOGE
The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)
AIR-DI/FISOR
[ICLR 2024] The official implementation of "Feasibility-Guided Safe Offline Reinforcement Learning"
AIR-DI/IVM
The offical Implementation of "Instruction-Guided Visual Masking"
AIR-DI/onerl
One RL Platform is all you need -- Event-driven fully distributed reinforcement learning framework
AIR-DI/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
AIR-DI/UniAct
Universal Actions for Enhanced Embodied Foundation Models
AIR-DI/BigFiles
AIR-DI/CPQ
Author's implementation of Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
AIR-DI/d4rl
A benchmark for offline reinforcement learning.
AIR-DI/DWBC
Author's implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
AIR-DI/IVR
Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
AIR-DI/LBP
[ICML 2025] The official Implementation of "Efficient Robotic Policy Learning via Latent Space Backward Planning"
AIR-DI/POR
Author's implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
AIR-DI/PROTO
AIR-DI/PSEC
[ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilitate efficient and flexible skill expansion and composition, iteratively evolve the agents' capabilities and efficiently address new challenges
AIR-DI/QPA
AIR-DI/RGM
The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)
AIR-DI/Robo_MUTUAL
The official implementation of "Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning"
AIR-DI/RSP_JAX
[AAAI'25] Are Expressive Models Truly Necessary for Offline RL?