This repository involves a few experiments on Warehouse environment as well as implementations of Graph-DQN and Relational-RL. Environment is taken from the Graph-DQN paper. We used A2C as baseline on all of the experiments.
TolgaOk/Graph-Warehouse
Reinforcement Learning experiments with graphs on warehouse environment
Python