yubryanj's Stars
google/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
cbfinn/maml
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
marlbenchmark/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
facebookresearch/mbrl-lib
Library for Model Based RL
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
rlworkgroup/metaworld
An open source robotics benchmark for meta- and multi-task reinforcement learning
google-research/recsim
A Configurable Recommender Systems Simulation Platform
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
uoe-agents/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
deepmind/meltingpot
A suite of test scenarios for multi-agent reinforcement learning.
marlbenchmark/off-policy
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
abides-sim/abides
ABIDES: Agent-Based Interactive Discrete Event Simulation
rll-research/url_benchmark
gsartoretti/PRIMAL
PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Distributed RL/IL code for Multi-Agent Path Finding (MAPF)
xbresson/CS4243_2022
Computer Vision and Pattern Recognition, NUS CS4243, 2022
forestagostinelli/DeepCubeA
Code for DeepCubeA, a Deep Reinforcement Learning algorithm that can learn to solve the Rubik's cube.
marmotlab/PRIMAL2
Training code PRIMAL2 - Public Repo
facebookresearch/CollaQ
A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"
wjh720/QPLEX
pvjosue/pytorch_convNd
Functional N-dimensional convolution in Pytorch, recursively calling convNd until reaching conv3d.
hijkzzz/noisy-mappo
Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)
brandontrabucco/design-baselines
Baselines for Model-Based Optimization
eli-b/mapf
A MAPF framework in C#, with implementations for MA-CBS, ICBS, CBSH, ID, A*, A*+OD, and EPEA*
proroklab/rllib_differentiable_comms
This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralized critics can be realized in RLLib. This example serves as a reference implementation and starting point for making RLLib more compatible with such architectures.
xtma/simple-pytorch-rl
Reinforcement Learning Methods with PyTorch
ZhengyuLiang24/Conv4d-PyTorch
Implementation of Conv4d for PyTorch
facebookresearch/measuring-emergent-comm
On the pitfalls of measuring emergent communication
proroklab/adversarial_comms
asappresearch/emergent-comms-negotiation
Reproduce ICLR2018 submission "Emergent Communication through Negotiation"