Deep Reinforcement Learning for robotic manuplation

Tensorflow implementation of DDPG for our manuplation system

A WIP Repo

This implementation contains:

DDPG with input of image(use CNN to extract state presentation)
Experience replay memory and history for consecutive 4 frames
- to reduce the correlations between consecutive updates
V-Rep simulation for robotic grasping

What Remains

Pause since 2019.3.30
left things:

action definition and how to use ounoise
what to print out and what to inject to summary and when to save both two networks
the replay memory function need to be adapted to our new model(different action dimension)
about the sess.run and .eval()
main function(where to add session graph)
debug with simulation

And to be honest, although things remaining are not too much and actually the framework is established, maybe recently I'm not going to supplement this algorithm for our grasping system. But it still can provide some ideas on how to use image as input of DDPG.

About V-rep Simulation(environment of deep RL)

please ref to our DQN version

License

MIT License.

QITAOHOU/DDPG_Grasping

Deep Reinforcement Learning for robotic manuplation

What Remains

About V-rep Simulation(environment of deep RL)

License