robotic-arm-visual-servoing-RL: An OpenEdge ABL repository from RakVish1342

EEE 587: Optimal Controls - Robotic Arm

Arm is given random control inputs, and current box location, next box location and corresponding control input values are saved.
If box goes out of camera frame, a arm is reinitialized at a random location from which bix is visible again.
Data is used to fix weights for bilinear model, which helps predict the next box location given the current box location and control input.

Fitted Q iteration has been performed for iterations = 10, batch_samples = 2000, gamma = 0.9, exploration_policy = ~N(current_policy, 0.2)
No oscillations seen. Arm tracks box smoothly.

scipy
pyyaml
rospkg
defusedxml
theano
netifaces

https://www.youtube.com/watch?v=fxEXsMEiook&feature=emb_title

https://www.youtube.com/watch?v=f0ve1BEQK2k&feature=emb_title

https://www.youtube.com/watch?v=u14srrIapOg&feature=emb_title

To launch Robot Arm Nodes Alone (without Q Learning Node):

source devel/setup.bash
roslaunch simple_arm robot_spawn.launch

Python scripts located at:

catkin_ws_py/src/simple_arm/scripts

Richard S. Sutton, Andrew Barto, "Reinforcement Learning: An Introduction"
Alex Lee, et. al., Learning Visual Servoing with Deep Features and Fitted Q-Iteration (paper, code/notes/slides)
Udacity Simple Arm URDF (link)