trust-region-policy-optimization

There are 15 repositories under trust-region-policy-optimization topic.

TianhongDai/reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
Language:Python649 15 10104
ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
Language:Python416 13 2092
funnydman/BFGS-NelderMead-TrustRegion
Python implementation of some numerical (optimization) methods
Language:Python29 1 03
GioStamoulos/BTC_RL_Trading_Bot
A trading bitcoin agent was created with deep reinforcement learning implementations.
Language:Jupyter Notebook27 2 06
MahanFathi/TRPO-TensorFlow
Trust Region Policy Optimization (TRPO) in pure TensorFlow
Language:Python18 2 29
Akella17/Deep-Bayesian-Quadrature-Policy-Optimization
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
Language:Python16 4 17
hcnoh/rl-collection-pytorch
A collection of Reinforcement Learning implementations with PyTorch
Language:Python13 1 01
khansel01/nes-npg
Benchmarking the Natural Gradient in Policy Gradient Methods and Evolution Strategies
Language:Python9 0 00
RLOpensource/spinning_up_kr
Language:Python6 3 03
YixiongRen/Dynamics
works about solving nonlinear dynamic systems
Language:MATLAB5 3 02
kparnis3/Final-Year-Project
Undergraduate Dissertation (University of Malta) 2020-2023 - 'Autonomous Drone Control using Reinforcement Learning''
Language:Jupyter Notebook4 2 00
waynemystir/deep-RL-bootcamp
My solutions to the labs from this bootcamp:
Language:Jupyter Notebook3 3 00
LihangLiu/CS395T-Numerical-Optimization
Course projects of CS395T Numerical Optimization, UT Austin
Language:Python2 2 02
dodoseung/trpo-trust-region-policy-optimization-pytorch
The pytorch implemetation of trpo
Language:Python1 0
nslyubaykin/trpo_schedule_kl
Scheduling TRPO's KL Divergence Constraint
Language:Jupyter Notebook1 0

trust-region-policy-optimization

TianhongDai/reinforcement-learning-algorithms

ikostrikov/pytorch-trpo

funnydman/BFGS-NelderMead-TrustRegion

GioStamoulos/BTC_RL_Trading_Bot

MahanFathi/TRPO-TensorFlow

Akella17/Deep-Bayesian-Quadrature-Policy-Optimization

hcnoh/rl-collection-pytorch

khansel01/nes-npg

RLOpensource/spinning_up_kr

YixiongRen/Dynamics

kparnis3/Final-Year-Project

waynemystir/deep-RL-bootcamp

LihangLiu/CS395T-Numerical-Optimization

dodoseung/trpo-trust-region-policy-optimization-pytorch

nslyubaykin/trpo_schedule_kl