21stholmes

Pinned Repositories

21stholmes.github.io
00
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 1 00
DunkCityDynasty
Language:Python0 0 00
jekyll-theme-prologue
A Jekyll version of the "Prologue" theme by HTML5 UP
Language:CSS0 1 00
Machine-learning-learning-notes
周志华《机器学习》又称西瓜书是一本较为全面的书籍，书中详细介绍了机器学习领域不同类型的算法(例如：监督学习、无监督学习、半监督学习、强化学习、集成降维、特征选择等)，记录了本人在学习过程中的理解思路与扩展知识点，希望对新人阅读西瓜书有所帮助！
0 1 00
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python00
PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Language:Python10
random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
Language:Python00

21stholmes's Repositories

21stholmes/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Language:Python10
21stholmes/21stholmes.github.io
00
21stholmes/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 1 00
21stholmes/DunkCityDynasty
Language:Python0 0 00
21stholmes/jekyll-theme-prologue
A Jekyll version of the "Prologue" theme by HTML5 UP
Language:CSS0 1 00
21stholmes/Machine-learning-learning-notes
周志华《机器学习》又称西瓜书是一本较为全面的书籍，书中详细介绍了机器学习领域不同类型的算法(例如：监督学习、无监督学习、半监督学习、强化学习、集成降维、特征选择等)，记录了本人在学习过程中的理解思路与扩展知识点，希望对新人阅读西瓜书有所帮助！
0 1 00
21stholmes/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python00
21stholmes/random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
Language:Python00