Yuantian013

0 0

Pinned Repositories

DDPG-CARTPOLE
Stable and robust control a cartpole with DDPG in continuous actions
Language:Python93
Deep-Policy-Compression
Bayesian Policy Network Reduction in Deep Reinforcement Learning
Language:Python10
E2GAN
[ECCV 2020]"Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search" By Yuan Tian, Qin Wang, Zhiwu Huang, Wen Li, Dengxin Dai, Minghao Yang, Jun Wang, Olga Fink
Language:Python41 5 111
GLC-abandon
Guaranteed Learning Control
Language:Python11
Guarantee_Learning_Control
Model Free Reinforcement Learning with Control Theoretic Guarantee
Language:Python31
Kronecker_Product
Kronecker_Product in TensorFlow
Language:Python21
Project
Language:Matlab10
RL_COMPRESSION
Language:Python3 2 00
RL_QUADROTOR
Language:Python10
TDOM-AC
Multi-agent Actor-Critic with Time Dynamical Opponent Model
Language:Python7 2 12

Yuantian013's Repositories

Yuantian013/E2GAN
[ECCV 2020]"Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search" By Yuan Tian, Qin Wang, Zhiwu Huang, Wen Li, Dengxin Dai, Minghao Yang, Jun Wang, Olga Fink
Language:Python41 5 111
Yuantian013/DDPG-CARTPOLE
Stable and robust control a cartpole with DDPG in continuous actions
Language:Python93
Yuantian013/TDOM-AC
Multi-agent Actor-Critic with Time Dynamical Opponent Model
Language:Python7 2 12
Yuantian013/Guarantee_Learning_Control
Model Free Reinforcement Learning with Control Theoretic Guarantee
Language:Python31
Yuantian013/RL_COMPRESSION
Language:Python3 2 00
Yuantian013/Kronecker_Product
Kronecker_Product in TensorFlow
Language:Python21
Yuantian013/Deep-Policy-Compression
Bayesian Policy Network Reduction in Deep Reinforcement Learning
Language:Python10
Yuantian013/GLC-abandon
Guaranteed Learning Control
Language:Python11
Yuantian013/Project
Language:Matlab10
Yuantian013/RL_QUADROTOR
Language:Python10
Yuantian013/Sym-Q
An official Pytorch implementation for the paper "Sym-Q: Adaptive Symbolic Regression via Sequential Decision-Making".
1
Yuantian013/AGTF30
Yuantian013/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python
Yuantian013/cpo
Constrained Policy Optimization
Language:Python
Yuantian013/DDPG
Reimplementation of DDPG(Continuous Control with Deep Reinforcement Learning) based on OpenAI Gym + Tensorflow
Language:Python
Yuantian013/deep-symbolic-optimization
Source code for deep symbolic optimization.
Language:Python
Yuantian013/E2GAN_Industrial
Language:Python
Yuantian013/gym-soccer
Language:Python
Yuantian013/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Yuantian013/maddpg-mpe
Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).
Yuantian013/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
Language:Python
Yuantian013/pytorch-maddpg
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
Yuantian013/SAC
Soft-Actor-Critic
Yuantian013/SCLSAC
SCLSAC
Language:Python
Yuantian013/v139
Proceedings of ICML 2021

Yuantian013

Pinned Repositories

DDPG-CARTPOLE

Deep-Policy-Compression

E2GAN

GLC-abandon

Guarantee_Learning_Control

Kronecker_Product

Project

RL_COMPRESSION

RL_QUADROTOR

TDOM-AC

Yuantian013's Repositories

Yuantian013/E2GAN

Yuantian013/DDPG-CARTPOLE

Yuantian013/TDOM-AC

Yuantian013/Guarantee_Learning_Control

Yuantian013/RL_COMPRESSION

Yuantian013/Kronecker_Product

Yuantian013/Deep-Policy-Compression

Yuantian013/GLC-abandon

Yuantian013/Project

Yuantian013/RL_QUADROTOR

Yuantian013/Sym-Q

Yuantian013/AGTF30

Yuantian013/baselines

Yuantian013/cpo

Yuantian013/DDPG

Yuantian013/deep-symbolic-optimization

Yuantian013/E2GAN_Industrial

Yuantian013/gym-soccer

Yuantian013/maddpg

Yuantian013/maddpg-mpe

Yuantian013/mujoco-py

Yuantian013/pytorch-maddpg

Yuantian013/SAC

Yuantian013/SCLSAC

Yuantian013/v139