Reimplementing DDPG from Continuous Control with Deep Reinforcement Learning based on OpenAI Gym and Tensorflow
http://arxiv.org/abs/1509.02971
It is still a problem to implement Batch Normalization on the critic network. However the actor network works well with Batch Normalization.
Some Mujoco environments are still unsolved on OpenAI Gym.
git clone https://github.com/songrotek/DDPG.git
cd DDPG
python gym_ddpg.py
If you want to change the Gym environment, change ENV_NAME in gym_ddpg.py.
If you want to change the Network type, change import in ddpg.py such as
from actor_network_bn import ActorNetwork
to
from actor_network import ActorNetwork
1 https://github.com/rllab/rllab