Altriaex

Pinned Repositories

batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
Language:Python0 1 00
chainer
A flexible framework of neural networks for deep learning
Language:Python0 2 00
d4rl
Fixes to D4RL datasets for them to be compatible with recent Gymnasium apis.
Language:Python0 1 01
d4rl_evaluations
Language:Python0 1 00
learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
Language:Python0 1 00
m3ddpg
Language:Python0 1 00
maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python0 2 00
multi_source_behavior_modeling
Code for our AAAI 2023 paper: Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning
Language:Python5 2 00
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python0 2 00
pbrl_with_state_importance
Language:Python1 1 01

Altriaex's Repositories

Altriaex/multi_source_behavior_modeling
Code for our AAAI 2023 paper: Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning
Language:Python5 2 00
Altriaex/pbrl_with_state_importance
Language:Python1 1 01
Altriaex/batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
Language:Python0 1 00
Altriaex/chainer
A flexible framework of neural networks for deep learning
Language:Python0 2 00
Altriaex/d4rl
Fixes to D4RL datasets for them to be compatible with recent Gymnasium apis.
Language:Python0 1 01
Altriaex/d4rl_evaluations
Language:Python0 1 00
Altriaex/learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
Language:Python0 1 00
Altriaex/m3ddpg
Language:Python0 1 00
Altriaex/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python0 2 00
Altriaex/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python0 2 00
Altriaex/optnet
OptNet: Differentiable Optimization as a Layer in Neural Networks
Language:Python0 2 00
Altriaex/payoff_learning
Language:Jupyter Notebook
Altriaex/qpth
A fast and differentiable QP solver for PyTorch.
Language:Python
Altriaex/query_for_rank_difference
Language:Python1 0