trust-region-policy-optimization
There are 15 repositories under trust-region-policy-optimization topic.
TianhongDai/reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
funnydman/BFGS-NelderMead-TrustRegion
Python implementation of some numerical (optimization) methods
GioStamoulos/BTC_RL_Trading_Bot
A trading bitcoin agent was created with deep reinforcement learning implementations.
MahanFathi/TRPO-TensorFlow
Trust Region Policy Optimization (TRPO) in pure TensorFlow
Akella17/Deep-Bayesian-Quadrature-Policy-Optimization
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
hcnoh/rl-collection-pytorch
A collection of Reinforcement Learning implementations with PyTorch
khansel01/nes-npg
Benchmarking the Natural Gradient in Policy Gradient Methods and Evolution Strategies
YixiongRen/Dynamics
works about solving nonlinear dynamic systems
kparnis3/Final-Year-Project
Undergraduate Dissertation (University of Malta) 2020-2023 - 'Autonomous Drone Control using Reinforcement Learning''
waynemystir/deep-RL-bootcamp
My solutions to the labs from this bootcamp:
LihangLiu/CS395T-Numerical-Optimization
Course projects of CS395T Numerical Optimization, UT Austin
dodoseung/trpo-trust-region-policy-optimization-pytorch
The pytorch implemetation of trpo
nslyubaykin/trpo_schedule_kl
Scheduling TRPO's KL Divergence Constraint