Pinned Repositories
21stholmes.github.io
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
DunkCityDynasty
jekyll-theme-prologue
A Jekyll version of the "Prologue" theme by HTML5 UP
Machine-learning-learning-notes
周志华《机器学习》又称西瓜书是一本较为全面的书籍,书中详细介绍了机器学习领域不同类型的算法(例如:监督学习、无监督学习、半监督学习、强化学习、集成降维、特征选择等),记录了本人在学习过程中的理解思路与扩展知识点,希望对新人阅读西瓜书有所帮助!
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
21stholmes's Repositories
21stholmes/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
21stholmes/21stholmes.github.io
21stholmes/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
21stholmes/DunkCityDynasty
21stholmes/jekyll-theme-prologue
A Jekyll version of the "Prologue" theme by HTML5 UP
21stholmes/Machine-learning-learning-notes
周志华《机器学习》又称西瓜书是一本较为全面的书籍,书中详细介绍了机器学习领域不同类型的算法(例如:监督学习、无监督学习、半监督学习、强化学习、集成降维、特征选择等),记录了本人在学习过程中的理解思路与扩展知识点,希望对新人阅读西瓜书有所帮助!
21stholmes/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
21stholmes/random-network-distillation
Code for the paper "Exploration by Random Network Distillation"