vanilla-policy-gradient

There are 7 repositories under vanilla-policy-gradient topic.

Ehsan2754/lunarlander_gym
This project is implementation of multiple AI agents based on different Reinforcement Learning methods to OpenAI Gymnasium Lunar-Lander environment which is classic rocket landing trajectory optimization problem.
Language:Python3 2 00
kanji95/Topics-in-Machine-Learning-CS7.502
Topics in Machine Learning @ IIIT Hyderabad (Fall 2021)
Language:Jupyter Notebook1 1 01
cbanerji/RL_implementations
Pytorch implementation of classic and latest Model-Free RL algorithms.
Language:Python0 1 00
Directorman9/Gym-minigrid-games
This notebook trains an agent to navigate a maze and reach a desired destination. It uses the Gym-MiniGrid's fourRoom-v0 environment as the maze. The agent is trained by using reiforcement learning's vanilla policy gradient (REINFORCE) algorithm.
0 1 00
bay3s/reinforce-rl
Vanilla Policy Gradient (REINFORCE) implementation with PyTorch
Language:Jupyter Notebook2 0
dodoseung/vpg-vanilla-policy-gradient-pytorch
The pytorch implementation of vpg
Language:Python1 0
nslyubaykin/relax_vpg_example
Example VPG implementation with ReLAx
Language:Jupyter Notebook1 0