on-policy

There are 15 repositories under on-policy topic.

MarcoMeter/episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
Language:Python128 4 1415
MarcoMeter/recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
Language:Jupyter Notebook115 4 1115
wisnunugroho21/reinforcement_learning_truly_ppo
Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch
Language:Python17 2 11
wisnunugroho21/reinforcement_learning_v_mpo
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
Language:Python16 2 01
narjesno/Reinforcement-Learning
This repository contains all of the Reinforcement Learning-related projects I've worked on. The projects are part of the graduate course at the University of Tehran.
Language:HTML5 1 00
amirhosein-mesbah/Reinforcement_learning
This repository contains the implementation of a wide variety of Reinforcement Learning Projects in different applications of Bandit Algorithms, MDPs, Distributed RL and Deep RL. These projects include university projects and projects implemented due to interest in Reinforcement Learning.
Language:Jupyter Notebook4 2 00
kristogj/on-policy-mcts
Monte Carlo Search Tree for training shared Actor-Critic-Network on the game Hex🏋️
Language:Python4 1 00
nima-siboni/simplest-world-Actor-Critic
Reinforcement learning, Policy Gradient, Actor-Critic, AC, Agent-based Simulation, Simple-world
Language:Python4 1 01
OpenRL-Lab/RL_Tutorial
Reinforcement Learning Tutorial (强化学习教程)
4 1 00
BY571/pytorch-vmpo
PyTorch implementation of V-MPO
Language:Python3 1 0
TheUnsolvedDev/ReinforcementLearning
Repository containing basic algorithm applied in python.
Language:Jupyter Notebook2 1 01
fardinabbasi/Tabulated_RL
Interactive Learning [ECE 641] - Fall 2023 - University of Tehran - Prof. Nili
1 1 0
mabirck/CS294-DeepRL
My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.
Language:Python1 2 0
SPozder3/RLFinanceProject
Stock Portfolio Management using tabular and deep Q-learning methods - extension of FinRL repo
Language:Jupyter Notebook1
srefsland/deep-rl-mcts
On-policy MCTS combined with deep learning to train an actor-critic neural network that plays Hex (Con-tac-tix).
Language:Python0 2 00

on-policy

MarcoMeter/episodic-transformer-memory-ppo

MarcoMeter/recurrent-ppo-truncated-bptt

wisnunugroho21/reinforcement_learning_truly_ppo

wisnunugroho21/reinforcement_learning_v_mpo

narjesno/Reinforcement-Learning

amirhosein-mesbah/Reinforcement_learning

kristogj/on-policy-mcts

nima-siboni/simplest-world-Actor-Critic

OpenRL-Lab/RL_Tutorial

BY571/pytorch-vmpo

TheUnsolvedDev/ReinforcementLearning

fardinabbasi/Tabulated_RL

mabirck/CS294-DeepRL

SPozder3/RLFinanceProject

srefsland/deep-rl-mcts