matthewlanders's Stars
chongminggao/EasyRL4Rec
koulanurag/ma-gym
A collection of multi agent environments based on OpenAI gym.
DaRL-LibSignal/LibSignal
uoe-agents/smaclite
The Starcraft Multi-Agent challenge lite
cityflow-project/CityFlow
A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario
AI4Finance-Foundation/FinRL
FinRL: Financial Reinforcement Learning. 🔥
BY571/Implicit-Q-Learning
PyTorch implementation of the implicit Q-learning algorithm (IQL)
corl-team/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
MLD3/OfflineRL_FactoredActions
[NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare.
gwthomas/IQL-PyTorch
A PyTorch implementation of Implicit Q-Learning
Farama-Foundation/PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
ikostrikov/implicit_q_learning
clinicalml/gumbel-max-scm
Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)
nikhil3456/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, Ben Coppin).
jimkon/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
ChangyWen/wolpertinger_ddpg
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.
YerevaNN/mimic3-benchmarks
Python suite to construct benchmark machine learning datasets from the MIMIC-III 💊 clinical database.
automl/CARL
Benchmarking RL generalization in an interpretable way.
liuzuxin/FSRL
🚀 A fast safe reinforcement learning library in PyTorch
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
facebookresearch/minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
seungeunrho/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
learnables/learn2learn
A PyTorch Library for Meta-learning Research
rlworkgroup/garage
A toolkit for reproducible reinforcement learning research.
tristandeleu/pytorch-maml-rl
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
mike-gimelfarb/deep-successor-features-for-transfer
A reusable framework for successor features for transfer in deep reinforcement learning using keras.
Farama-Foundation/D4RL-Evaluations
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning