AIR-DREAM

China

Pinned Repositories

air-dream-website
🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
Language:TeX4 0 01
D2C
D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.
Language:Python28 1 02
DecisionNCE
[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
Language:Python0 0 00
Diffusion-Planner
[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"
Language:Python00
H2Oplus
[ICRA 2025] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps.
Language:Python10
ODICE-Pytorch
official implementation of ODICE
Language:Python1 0 00
OMIGA
The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization" (NeurIPS 2023)
Language:Python2 0 01
openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Language:Jupyter Notebook0 0 00
TSRL
Language:Python1 0 00
UniAct
Universal Actions for Enhanced Embodied Foundation Models
Language:Python00

AIR-DREAM's Repositories

AIR-DI/D2C
D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.
Language:Python28 1 02
AIR-DI/air-dream-website
🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
Language:TeX4 0 01
AIR-DI/OMIGA
The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization" (NeurIPS 2023)
Language:Python2 0 01
AIR-DI/H2O
[NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
Language:Python1 0 00
AIR-DI/H2Oplus
[ICRA 2025] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps.
Language:Python10
AIR-DI/ODICE-Pytorch
official implementation of ODICE
Language:Python1 0 00
AIR-DI/TSRL
Language:Python1 0 00
AIR-DI/.github
0 1 00
AIR-DI/AIDC
0 1 00
AIR-DI/DecisionNCE
[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
Language:Python0 0 00
AIR-DI/Diffusion-Planner
[ICLR 2025 Oral] The official implementation of "Diffusion-Based Planning for Autonomous Driving with Flexible Guidance"
Language:Python00
AIR-DI/DOGE
The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)
Language:Python0 0 00
AIR-DI/FISOR
[ICLR 2024] The official implementation of "Feasibility-Guided Safe Offline Reinforcement Learning"
Language:Python0 0 00
AIR-DI/IVM
The offical Implementation of "Instruction-Guided Visual Masking"
Language:Jupyter Notebook0 0 00
AIR-DI/onerl
One RL Platform is all you need -- Event-driven fully distributed reinforcement learning framework
Language:Python0 0 00
AIR-DI/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Language:Jupyter Notebook0 0 00
AIR-DI/UniAct
Universal Actions for Enhanced Embodied Foundation Models
Language:Python00
AIR-DI/BigFiles
AIR-DI/CPQ
Author's implementation of Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
Language:Python0 0
AIR-DI/d4rl
A benchmark for offline reinforcement learning.
Language:Python0 0
AIR-DI/DWBC
Author's implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
Language:Python0 0
AIR-DI/IVR
Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
Language:Python0 0
AIR-DI/LBP
[ICML 2025] The official Implementation of "Efficient Robotic Policy Learning via Latent Space Backward Planning"
AIR-DI/POR
Author's implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
Language:Python0 0
AIR-DI/PROTO
Language:Python0 0
AIR-DI/PSEC
[ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilitate efficient and flexible skill expansion and composition, iteratively evolve the agents' capabilities and efficiently address new challenges
Language:Python0 0
AIR-DI/QPA
Language:Python0 0
AIR-DI/RGM
The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)
Language:Python0 0
AIR-DI/Robo_MUTUAL
The official implementation of "Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning"
AIR-DI/RSP_JAX
[AAAI'25] Are Expressive Models Truly Necessary for Offline RL?