Pinned Repositories
air-dream-website
🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
D2C
D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.
DecisionNCE
[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
FISOR
[ICLR 2024] The official implementation of "Feasibility-Guided Safe Offline Reinforcement Learning"
H2O
[NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
IVM
The offical Implementation of "Instruction-Guided Visual Masking"
ODICE-Pytorch
official implementation of ODICE
OMIGA
The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization" (NeurIPS 2023)
openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
TSRL
AIR-DREAM's Repositories
AIR-DI/D2C
D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.
AIR-DI/air-dream-website
🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
AIR-DI/OMIGA
The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization" (NeurIPS 2023)
AIR-DI/H2O
[NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
AIR-DI/TSRL
AIR-DI/.github
AIR-DI/DecisionNCE
[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
AIR-DI/DeepThermal
Author's implementation of "DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning"
AIR-DI/DOGE
The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)
AIR-DI/FISOR
[ICLR 2024] The official implementation of "Feasibility-Guided Safe Offline Reinforcement Learning"
AIR-DI/IVM
The offical Implementation of "Instruction-Guided Visual Masking"
AIR-DI/ODICE-Pytorch
official implementation of ODICE
AIR-DI/onerl
One RL Platform is all you need -- Event-driven fully distributed reinforcement learning framework
AIR-DI/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
AIR-DI/CPQ
Author's implementation of Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
AIR-DI/d4rl
A benchmark for offline reinforcement learning.
AIR-DI/DWBC
Author's implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
AIR-DI/IVR
Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
AIR-DI/MOPP
Official codebase of "Model-Based Offline Planning with Trajectory Pruning (MOPP)"
AIR-DI/POR
Author's implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
AIR-DI/PROTO
AIR-DI/QPA
AIR-DI/RGM
The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)