/d4pg-pytorch

PyTorch implementation of Distributed Distributional Deterministic Policy Gradients (https://arxiv.org/abs/1804.08617)

Primary LanguagePython

D4PG-pytorch

PyTorch implementation of Distributed Distributional Deterministic Policy Gradients (https://arxiv.org/abs/1804.08617). arch Supported environments

  • Pendulum-v0
  • LunarLanderContinous-v2
  • BipedalWalker-v2

Usage

Run train.py to run experiment specified in config.yaml.

Tests

In progress, for now tests can be used for reproducing results.

Demo

demo

Detailed results of training can be found at

Acknowledgements

The project partly based on the Mark Sinton TensorFlow implementation, which helped greatly in difficult parts.