Pinned Repositories
batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
chainer
A flexible framework of neural networks for deep learning
d4rl
Fixes to D4RL datasets for them to be compatible with recent Gymnasium apis.
d4rl_evaluations
learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
m3ddpg
maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
multi_source_behavior_modeling
Code for our AAAI 2023 paper: Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
pbrl_with_state_importance
Altriaex's Repositories
Altriaex/multi_source_behavior_modeling
Code for our AAAI 2023 paper: Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning
Altriaex/pbrl_with_state_importance
Altriaex/batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
Altriaex/chainer
A flexible framework of neural networks for deep learning
Altriaex/d4rl
Fixes to D4RL datasets for them to be compatible with recent Gymnasium apis.
Altriaex/d4rl_evaluations
Altriaex/learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
Altriaex/m3ddpg
Altriaex/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Altriaex/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Altriaex/optnet
OptNet: Differentiable Optimization as a Layer in Neural Networks
Altriaex/payoff_learning
Altriaex/qpth
A fast and differentiable QP solver for PyTorch.
Altriaex/query_for_rank_difference