ppo

There are 871 repositories under ppo topic.

datawhalechina/easy-rl
强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/
Language:Jupyter Notebook12.4k 86 1602.1k
MorvanZhou/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Language:Python9.3k 291 1955k
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
Language:Python8.8k 92 7711.2k
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python7.9k 38 195843
udacity/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
Language:Jupyter Notebook5.1k 176 352.4k
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python4.4k 35 35890
andri27-ts/Reinforcement-Learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Language:Jupyter Notebook4.4k 245 8628
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Language:Python4.2k 52 273942
simoninithomas/Deep_reinforcement_learning_Course
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
Language:Jupyter Notebook3.9k 131 741.2k
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.8k 65 233840
ShangtongZhang/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
Language:Python3.3k 89 92696
seungeunrho/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Language:Python3.1k 49 41478
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Language:Python2.9k 12 10365
AI4Finance-Foundation/FinRL-Trading
For trading. Please star.
Language:Jupyter Notebook2.4k 98 42817
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python2.2k 6 62398
marlbenchmark/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
Language:Python1.7k 9 102345
kengz/SLM-Lab
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Language:Python1.3k 46 72277
Khrylx/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Language:Python1.2k 26 37190
vietnh1009/Super-mario-bros-PPO-pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Language:Python1.2k 28 26209
ericyangyu/PPO-for-Beginners
A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.
Language:Python1.1k 12 9134
qfettes/DeepRL-Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Language:Jupyter Notebook1.1k 30 10328
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
Language:Python946 13 2569
agi-brain/xuance
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
Language:Python944 16 110135
Rafael1s/Deep-Reinforcement-Learning-Algorithms
32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
Language:Jupyter Notebook940 15 3208
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Language:Python885 7 2451
sudharsan13296/Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Language:Jupyter Notebook858 43 3325
rohanpsingh/LearningHumanoidWalking
Training a humanoid robot for locomotion using Reinforcement Learning
Language:Python823 2 2897
lcswillems/rl-starter-files
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
Language:Python702 13 58187
TianhongDai/reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Language:Python683 14 10109
cpnota/autonomous-learning-library
A PyTorch library for building deep reinforcement learning agents.
Language:Python651 21 10471
jianzhnie/LLamaTuner
Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
Language:Python613 9 4365
archsyscall/DeepRL-TensorFlow2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
Language:Python609 18 8140
ChenglongChen/pytorch-DRL
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
Language:Python598 11 7107
Omegastick/pytorch-cpp-rl
PyTorch C++ Reinforcement Learning
Language:C++524 19 2087
dongminlee94/deep_rl
PyTorch implementation of deep reinforcement learning algorithms
Language:Python496 12 459
RLE-Foundation/rllte
Long-Term Evolution Project of Reinforcement Learning
Language:Python473 54 1486

ppo

datawhalechina/easy-rl

MorvanZhou/Reinforcement-learning-with-tensorflow

thu-ml/tianshou

vwxyzjn/cleanrl

udacity/deep-reinforcement-learning

sweetice/Deep-reinforcement-learning-with-pytorch

andri27-ts/Reinforcement-Learning

AI4Finance-Foundation/ElegantRL

simoninithomas/Deep_reinforcement_learning_Course

ikostrikov/pytorch-a2c-ppo-acktr-gail

ShangtongZhang/DeepRL

seungeunrho/minimalRL

XinJingHao/DRL-Pytorch

AI4Finance-Foundation/FinRL-Trading

nikhilbarhate99/PPO-PyTorch

marlbenchmark/on-policy

kengz/SLM-Lab

Khrylx/PyTorch-RL

vietnh1009/Super-mario-bros-PPO-pytorch

ericyangyu/PPO-for-Beginners

qfettes/DeepRL-Tutorials

luchris429/purejaxrl

agi-brain/xuance

Rafael1s/Deep-Reinforcement-Learning-Algorithms

ContextualAI/HALOs

sudharsan13296/Hands-On-Reinforcement-Learning-With-Python

rohanpsingh/LearningHumanoidWalking

lcswillems/rl-starter-files

TianhongDai/reinforcement-learning-algorithms

cpnota/autonomous-learning-library

jianzhnie/LLamaTuner

archsyscall/DeepRL-TensorFlow2

ChenglongChen/pytorch-DRL

Omegastick/pytorch-cpp-rl

dongminlee94/deep_rl

RLE-Foundation/rllte