ppo-pytorch
There are 59 repositories under ppo-pytorch topic.
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Lizhi-sjtu/DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
CherryPieSexy/imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
akjayant/PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
faildeny/Multi_Agent_PPO
Multi agent PPO implementation in Pytorch for Unity ML Agents environments.
philtabor/ProtoRL
A Torch Based RL Framework for Rapid Prototyping of Research Papers
jatinarora2702/gail-pytorch
PyTorch implementation of GAIL and PPO reinforcement learning algorithms
LittleWebCat/DRL-Base-EMS
DRL-Base-EMS for HEVs
davide97l/PPO-GAIL-cartpole
GAIL learning to imitate PPO playing CartPole.
rvdweerd/simmodel
Solving pursuit-evasion problems on graphs using Reinfocement Learning and GNNs
francofgp/Tic-Tac-Toe-Gym
This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning
paulchen2713/RIS-MISO-HWI-DRL
Worst-case MSE Minimization for RIS-assisted mmWave MU-MISO Systems with Hardware Impairments and CSI Imperfection
wegfawefgawefg/wegs-drl-baselines
Minimum viable reinforcement learning algorithms for your educational convenience.
CherryPieSexy/rl_mario
Reinforcement learning (PPO) plays Mario.
houssameehsain/CutnFill_DeepRL
Positioning a building mass on topography while minimizing the required cut and fill excavation volume using actor critic methods.
Git-123-Hub/reinforcement-learning-algorithm
implementation of reinforcement learning algorithm that is easy to read and understand
nkoorty/rl_parking
Repository with all source files relating to the 6CCE3EEP Final Year Project titled "Self Parking with Reinforcement Learning." The project was implemented using Python, and used PyGame, OpenAI Gym, and the Stable Baselines-3 libraries in order to implement a Proximal Policy Optimisation (PPO) algorithm.
SchweizerischeBundesbahnen/flatland-torchrl
An adaption of the Flatland environment for TorchRL.
faildeny/PPO_pytorch_implementation
Proximal Policy Optimization method in Pytorch
imoneoi/xrl-ppo
Automated & super fast PyTorch deep reinforcement learning platform for autonomous driving
rshnn/battleship
Agent trained to play battleship using reinforcement learning (PPO) and openAI gym
akashe/DeepReinforcementLearning
Deep RL implementations. DQN, SAC, DDPG, TD3, PPO and VPG implemented in pytorch. Tested Env: LunarLander-v2 and Pendulum-v0.
c2d08y/LearningBot
A deep reinforcement learning Bot for https://kana.byha.top:444/
leonjovanovic/drl-ppo-bipedal-walker
PyTorch application of reinforcement learning Advanced Policy Gradient algorithms in OpenAI BipedalWalker- PPO
Nikunj-Gupta/HAMMER
HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging (Paper: https://ala2021.vub.ac.be/papers/ALA2021_paper_35.pdf)
anshdavid/pytorch-driving-torcs
self driving car using Torcs-1.3.7 simulator with server-patch
GuillermoVR92/Deep-RL-Pong_with_PPO_Agent
Deep RL Agent using Proximal Policy Optimization for solving the Pong game.
leonjovanovic/drl-ml-agents-3dball
PyTorch application of reinforcement learning DDPG and PPO algorithms in Unity 3D-Ball
steph-koopmanschap/PyLife2
The Improved version of PyLife (now with AI)
tomasspangelo/proximal-policy-optimization
An implementation from the state-of-the-art family of reinforcement learning algorithms Proximal Policy Optimization using normalized Generalized Advantage Estimation and optional batch mode training. The loss function incorporates an entropy bonus.
bantu-4879/Atari_Games-Deep_Reinforcement_Learning
This repository hosts Jupyter notebooks showcasing the training of Atari games using a variety of Deep Reinforcement Learning (RL) algorithms such as Proximal Policy Optimization (PPO), Deep Deterministic Policy Gradient (DDPG), Deep Q-Networks (DQN), Advantage Actor-Critic (A2C), and more.
DataRohit/AI-Mario-Game
This is a Deep-Q Learning [Stable Baseline] based AI Mario Game where the Model Incrementally Learns and Improves to Play the Game.
EnriqManComp/smart-disks-PPO
This project aims to find a possible solution to a search problem in a given environment with two players using Proximal Policy Optimization as AI algorithm.
Icyfiremario/PPO-Jumpstart
Basic PPO based AI template