iitimii/Gradient-Based-Reinforcement-Learning-Exploration
Experimental exploration of gradient-based methods in RL. Features a simple naive algorithm derivation, REINFORCE implementation, in the CartPole environment. Bridges supervised and reinforcement learning paradigms
Jupyter Notebook