Pinned Repositories
DDPG-CARTPOLE
Stable and robust control a cartpole with DDPG in continuous actions
Deep-Policy-Compression
Bayesian Policy Network Reduction in Deep Reinforcement Learning
E2GAN
[ECCV 2020]"Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search" By Yuan Tian, Qin Wang, Zhiwu Huang, Wen Li, Dengxin Dai, Minghao Yang, Jun Wang, Olga Fink
GLC-abandon
Guaranteed Learning Control
Guarantee_Learning_Control
Model Free Reinforcement Learning with Control Theoretic Guarantee
Kronecker_Product
Kronecker_Product in TensorFlow
Project
RL_COMPRESSION
RL_QUADROTOR
TDOM-AC
Multi-agent Actor-Critic with Time Dynamical Opponent Model
Yuantian013's Repositories
Yuantian013/E2GAN
[ECCV 2020]"Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search" By Yuan Tian, Qin Wang, Zhiwu Huang, Wen Li, Dengxin Dai, Minghao Yang, Jun Wang, Olga Fink
Yuantian013/DDPG-CARTPOLE
Stable and robust control a cartpole with DDPG in continuous actions
Yuantian013/TDOM-AC
Multi-agent Actor-Critic with Time Dynamical Opponent Model
Yuantian013/Guarantee_Learning_Control
Model Free Reinforcement Learning with Control Theoretic Guarantee
Yuantian013/RL_COMPRESSION
Yuantian013/Kronecker_Product
Kronecker_Product in TensorFlow
Yuantian013/Deep-Policy-Compression
Bayesian Policy Network Reduction in Deep Reinforcement Learning
Yuantian013/GLC-abandon
Guaranteed Learning Control
Yuantian013/Project
Yuantian013/RL_QUADROTOR
Yuantian013/Sym-Q
An official Pytorch implementation for the paper "Sym-Q: Adaptive Symbolic Regression via Sequential Decision-Making".
Yuantian013/AGTF30
Yuantian013/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Yuantian013/cpo
Constrained Policy Optimization
Yuantian013/DDPG
Reimplementation of DDPG(Continuous Control with Deep Reinforcement Learning) based on OpenAI Gym + Tensorflow
Yuantian013/deep-symbolic-optimization
Source code for deep symbolic optimization.
Yuantian013/E2GAN_Industrial
Yuantian013/gym-soccer
Yuantian013/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Yuantian013/maddpg-mpe
Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).
Yuantian013/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
Yuantian013/pytorch-maddpg
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
Yuantian013/SAC
Soft-Actor-Critic
Yuantian013/SCLSAC
SCLSAC
Yuantian013/v139
Proceedings of ICML 2021