Exercise to implement DQN on Pytorch Maybe I will write a blog post to document stuff.
On cartpole ver 1. Seperate branch working on invert pendulm problem. Planning to compare to LQR as well.
Reference and credit:
https://www.youtube.com/watch?v=UlJzzLYgYoE
https://www.youtube.com/watch?v=2pWv7GOvuf0&list=PLqYmG7hTraZDM-OYHWgPebj2MfCFzFObQ