xtma
Ph.D. of Tsinghua University. Interested in Reinforcement Learning and Agent.
Tsinghua University
Pinned Repositories
apo
Average-Reward Reinforcement Learning with Trust Region Methods
dsac
Distributional Soft Actor Critic
msvpo
The official implementation of "Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning"
PGPortfolio
PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
pytorch_car_caring
Reinforcement Learning for Gym CarRacing-v0 with PyTorch
ray-maddpg
MADDPG implementation with Ray
simple-pytorch-rl
Reinforcement Learning Methods with PyTorch
vimrc
The ultimate Vim configuration: vimrc
xtma.github.io
xtma's Repositories
xtma/pytorch_car_caring
Reinforcement Learning for Gym CarRacing-v0 with PyTorch
xtma/dsac
Distributional Soft Actor Critic
xtma/simple-pytorch-rl
Reinforcement Learning Methods with PyTorch
xtma/apo
Average-Reward Reinforcement Learning with Trust Region Methods
xtma/msvpo
The official implementation of "Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning"
xtma/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
xtma/ray-maddpg
MADDPG implementation with Ray
xtma/vimrc
The ultimate Vim configuration: vimrc
xtma/xtma.github.io
xtma/PGPortfolio
PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).
xtma/rl-portfolio-management
Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)
xtma/rlpyt
Reinforcement Learning in PyTorch
xtma/self-play-pong
RoboSchool Pony in Self-Play Mode
xtma/VEM
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796)