RLTF: Reinforcement Learning in TensorFlow

RLTF is a research framework that provides high-quality implementations of common Reinforcement Learning algorithms. It also allows fast-prototyping and benchmarking of new methods.

Status: This work is under active development (breaking changes might occur).

Implemented Algorithms

Algorithm	Model	Agent
DQN	DQN	AgentDQN
Double DQN	DDQN	AgentDQN
Dueling DQN	next	next
Prioritized Experience Replay	next	next
C51	C51	AgentDQN
QR-DQN	QRDQN	AgentDQN
Bootstrapped DQN	BstrapDQN	AgentDQN
Bootstrapped UCB	BstrapDQN_UCB	AgentDQN
DQN Ensemble	DQN_Ensemble	AgentDQN
BDQN	BDQN	AgentBDQN
DQN-IDS	DQN-IDS	AgentDQN
C51-IDS	C51-IDS	AgentDQN
DDPG	DDPG	AgentDDPG
REINFORCE	REINFORCE	AgentPG
PPO	PPO	AgentPPO
TRPO	TRPO	AgentTRPO

Coming additions:

Official release for DQN-IDS and C51-IDS
MPI support for policy gradients
Dueling DQN
Prioritized Experience Replay
n-step returns
Rainbow

Reproducibility and Known Issues

Implemented models are able to achieve comparable results to the ones reported in the corresponding papers. With tiny exceptions, all implementations should be equivalent to the ones described in the original papers.

Implementations known to misbehave:

QR-DQN (in progress)

About

The goal of this framework is to provide stable implementations of standard RL algorithms and simultaneously enable fast prototyping of new methods. Some important features include:

Exact reimplementation and competitive performance of original papers
Unified and reusable modules
Clear hierarchical structure and easy code control
Efficient GPU utilization and fast training
Detailed logs of hyperparameters, train and eval scores, git diff, TensorBoard visualizations
Episode video recordings with plots of network outputs
Compatible with OpenAI gym, MuJoCo, PyBullet and Roboschool
Restoring the training process from where it stopped, retraining on a new task, fine-tuning

Installation

Dependencies

Python >= 3.5
Tensorflow >= 1.6.0
OpenAI gym >= 0.9.6
opencv-python (either pip package or OpenCV library with python bindings)
matplotlib (with TkAgg backend)
pybullet (optional)
roboschool (optional)

Install

git clone https://github.com/nikonikolov/rltf.git

pip package coming soon

Documentation

For brief documentation see docs/.

If you use this repository for you research, please cite:

@misc{baselines,
  author = {Nikolay Nikolov},
  title = {RLTF: Reinforcement Learning in TensorFlow},
  year = {2018},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/nikonikolov/rltf}},
}

rdednl/rltf