on-policy
There are 15 repositories under on-policy topic.
MarcoMeter/episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
MarcoMeter/recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
wisnunugroho21/reinforcement_learning_truly_ppo
Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch
wisnunugroho21/reinforcement_learning_v_mpo
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
narjesno/Reinforcement-Learning
This repository contains all of the Reinforcement Learning-related projects I've worked on. The projects are part of the graduate course at the University of Tehran.
amirhosein-mesbah/Reinforcement_learning
This repository contains the implementation of a wide variety of Reinforcement Learning Projects in different applications of Bandit Algorithms, MDPs, Distributed RL and Deep RL. These projects include university projects and projects implemented due to interest in Reinforcement Learning.
kristogj/on-policy-mcts
Monte Carlo Search Tree for training shared Actor-Critic-Network on the game Hex🏋️
nima-siboni/simplest-world-Actor-Critic
Reinforcement learning, Policy Gradient, Actor-Critic, AC, Agent-based Simulation, Simple-world
OpenRL-Lab/RL_Tutorial
Reinforcement Learning Tutorial (强化学习教程)
BY571/pytorch-vmpo
PyTorch implementation of V-MPO
TheUnsolvedDev/ReinforcementLearning
Repository containing basic algorithm applied in python.
fardinabbasi/Tabulated_RL
Interactive Learning [ECE 641] - Fall 2023 - University of Tehran - Prof. Nili
mabirck/CS294-DeepRL
My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.
SPozder3/RLFinanceProject
Stock Portfolio Management using tabular and deep Q-learning methods - extension of FinRL repo
srefsland/deep-rl-mcts
On-policy MCTS combined with deep learning to train an actor-critic neural network that plays Hex (Con-tac-tix).