/td3_her_rlbench_reacher

A implementation for soving reach target task based on TD3 with HER using PaddlePaddle.

Primary LanguagePythonBSD 2-Clause "Simplified" LicenseBSD-2-Clause

td3_her_rlbench_reacher

Author: CAO RUI

A implementation for solving reach target task based on Twin Delayed DDPG(TD3) with Hindsight Experience Replay(HER) using PaddlePaddle.

image

Prerequisites

Python: Python 3.6+

PaddlePaddle : Deep learning framework

PARL : Reinforcement learning toolbox based on PaddlePaddle

gym : Universal environment builder for RL tasks

RLBench: RL tasks extension for robotics researches.

Install

First, create a virtual environment by virtualenv, in it, install PaddlePaddle, gym and PARL by

pip install requirements.txt

Then install RLBench via RLBench.

Train

python rlbench_reach_td3_train.py

Evaluate

python rlbench_reach_td3_eval.py

Result

4 stages (initial model, after 40000-episode trained model, after 80000-episode trained model, fianl model) of training model are uploaded and the corresponded render results are recorded in the folder records.

The success rate as shown in the following figure, here every epoch equals to 200 training episodes. image