vanilla-policy-gradient
There are 7 repositories under vanilla-policy-gradient topic.
Ehsan2754/lunarlander_gym
This project is implementation of multiple AI agents based on different Reinforcement Learning methods to OpenAI Gymnasium Lunar-Lander environment which is classic rocket landing trajectory optimization problem.
kanji95/Topics-in-Machine-Learning-CS7.502
Topics in Machine Learning @ IIIT Hyderabad (Fall 2021)
cbanerji/RL_implementations
Pytorch implementation of classic and latest Model-Free RL algorithms.
Directorman9/Gym-minigrid-games
This notebook trains an agent to navigate a maze and reach a desired destination. It uses the Gym-MiniGrid's fourRoom-v0 environment as the maze. The agent is trained by using reiforcement learning's vanilla policy gradient (REINFORCE) algorithm.
bay3s/reinforce-rl
Vanilla Policy Gradient (REINFORCE) implementation with PyTorch
dodoseung/vpg-vanilla-policy-gradient-pytorch
The pytorch implementation of vpg
nslyubaykin/relax_vpg_example
Example VPG implementation with ReLAx