ppo-pytorch

There are 59 repositories under ppo-pytorch topic.

nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python1.5k 9 61330
Lizhi-sjtu/DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Language:Python934 5 12153
CherryPieSexy/imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Language:Python129 5 314
akjayant/PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
Language:Python25 2 110
faildeny/Multi_Agent_PPO
Multi agent PPO implementation in Pytorch for Unity ML Agents environments.
Language:Python22 4 13
philtabor/ProtoRL
A Torch Based RL Framework for Rapid Prototyping of Research Papers
Language:Python22 2 40
jatinarora2702/gail-pytorch
PyTorch implementation of GAIL and PPO reinforcement learning algorithms
Language:Python17 2 06
LittleWebCat/DRL-Base-EMS
DRL-Base-EMS for HEVs
Language:HTML16 1 51
davide97l/PPO-GAIL-cartpole
GAIL learning to imitate PPO playing CartPole.
Language:Jupyter Notebook12 1 04
rvdweerd/simmodel
Solving pursuit-evasion problems on graphs using Reinfocement Learning and GNNs
Language:Python12 3 01
francofgp/Tic-Tac-Toe-Gym
This is the Tic-Tac-Toe game made with Python using the PyGame library and the Gym library to implement the AI with Reinforcement Learning
Language:Python8 1 01
paulchen2713/RIS-MISO-HWI-DRL
Worst-case MSE Minimization for RIS-assisted mmWave MU-MISO Systems with Hardware Impairments and CSI Imperfection
Language:Python8 2 01
wegfawefgawefg/wegs-drl-baselines
Minimum viable reinforcement learning algorithms for your educational convenience.
Language:Python8 3 00
CherryPieSexy/rl_mario
Reinforcement learning (PPO) plays Mario.
Language:Python7 1 00
houssameehsain/CutnFill_DeepRL
Positioning a building mass on topography while minimizing the required cut and fill excavation volume using actor critic methods.
Language:Python7 1 00
Git-123-Hub/reinforcement-learning-algorithm
implementation of reinforcement learning algorithm that is easy to read and understand
Language:Python6 1 00
nkoorty/rl_parking
Repository with all source files relating to the 6CCE3EEP Final Year Project titled "Self Parking with Reinforcement Learning." The project was implemented using Python, and used PyGame, OpenAI Gym, and the Stable Baselines-3 libraries in order to implement a Proximal Policy Optimisation (PPO) algorithm.
Language:Python6 1 00
SchweizerischeBundesbahnen/flatland-torchrl
An adaption of the Flatland environment for TorchRL.
Language:Python6 2 0
alirezakazemipour/Mario-PPO
Language:Python4 2 00
faildeny/PPO_pytorch_implementation
Proximal Policy Optimization method in Pytorch
Language:Python4 3 01
imoneoi/xrl-ppo
Automated & super fast PyTorch deep reinforcement learning platform for autonomous driving
Language:Python4 3 01
rshnn/battleship
Agent trained to play battleship using reinforcement learning (PPO) and openAI gym
Language:Jupyter Notebook4 2 01
akashe/DeepReinforcementLearning
Deep RL implementations. DQN, SAC, DDPG, TD3, PPO and VPG implemented in pytorch. Tested Env: LunarLander-v2 and Pendulum-v0.
Language:Python3 1 11
c2d08y/LearningBot
A deep reinforcement learning Bot for https://kana.byha.top:444/
Language:Python3 1 00
leonjovanovic/drl-ppo-bipedal-walker
PyTorch application of reinforcement learning Advanced Policy Gradient algorithms in OpenAI BipedalWalker- PPO
Language:Python3 2 00
Nikunj-Gupta/HAMMER
HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging (Paper: https://ala2021.vub.ac.be/papers/ALA2021_paper_35.pdf)
Language:Python3 3 00
alex-nooj/champion_league
Language:Python2 1 50
anshdavid/pytorch-driving-torcs
self driving car using Torcs-1.3.7 simulator with server-patch
Language:Python2 1 00
GuillermoVR92/Deep-RL-Pong_with_PPO_Agent
Deep RL Agent using Proximal Policy Optimization for solving the Pong game.
Language:Jupyter Notebook2 1 00
leonjovanovic/drl-ml-agents-3dball
PyTorch application of reinforcement learning DDPG and PPO algorithms in Unity 3D-Ball
Language:Python2 2 00
steph-koopmanschap/PyLife2
The Improved version of PyLife (now with AI)
Language:Python2 3 00
tomasspangelo/proximal-policy-optimization
An implementation from the state-of-the-art family of reinforcement learning algorithms Proximal Policy Optimization using normalized Generalized Advantage Estimation and optional batch mode training. The loss function incorporates an entropy bonus.
Language:Python2 2 00
bantu-4879/Atari_Games-Deep_Reinforcement_Learning
This repository hosts Jupyter notebooks showcasing the training of Atari games using a variety of Deep Reinforcement Learning (RL) algorithms such as Proximal Policy Optimization (PPO), Deep Deterministic Policy Gradient (DDPG), Deep Q-Networks (DQN), Advantage Actor-Critic (A2C), and more.
Language:Jupyter Notebook10
DataRohit/AI-Mario-Game
This is a Deep-Q Learning [Stable Baseline] based AI Mario Game where the Model Incrementally Learns and Improves to Play the Game.
Language:Jupyter Notebook1 1 00
EnriqManComp/smart-disks-PPO
This project aims to find a possible solution to a search problem in a given environment with two players using Proximal Policy Optimization as AI algorithm.
Language:Python10
Icyfiremario/PPO-Jumpstart
Basic PPO based AI template
Language:Python10

ppo-pytorch

nikhilbarhate99/PPO-PyTorch

Lizhi-sjtu/DRL-code-pytorch

CherryPieSexy/imitation_learning

akjayant/PPO_Lagrangian_PyTorch

faildeny/Multi_Agent_PPO

philtabor/ProtoRL

jatinarora2702/gail-pytorch

LittleWebCat/DRL-Base-EMS

davide97l/PPO-GAIL-cartpole

rvdweerd/simmodel

francofgp/Tic-Tac-Toe-Gym

paulchen2713/RIS-MISO-HWI-DRL

wegfawefgawefg/wegs-drl-baselines

CherryPieSexy/rl_mario

houssameehsain/CutnFill_DeepRL

Git-123-Hub/reinforcement-learning-algorithm

nkoorty/rl_parking

SchweizerischeBundesbahnen/flatland-torchrl

alirezakazemipour/Mario-PPO

faildeny/PPO_pytorch_implementation

imoneoi/xrl-ppo

rshnn/battleship

akashe/DeepReinforcementLearning

c2d08y/LearningBot

leonjovanovic/drl-ppo-bipedal-walker

Nikunj-Gupta/HAMMER

alex-nooj/champion_league

anshdavid/pytorch-driving-torcs

GuillermoVR92/Deep-RL-Pong_with_PPO_Agent

leonjovanovic/drl-ml-agents-3dball

steph-koopmanschap/PyLife2

tomasspangelo/proximal-policy-optimization

bantu-4879/Atari_Games-Deep_Reinforcement_Learning

DataRohit/AI-Mario-Game

EnriqManComp/smart-disks-PPO

Icyfiremario/PPO-Jumpstart