Pinned Repositories
ARS
An implementation of the Augmented Random Search algorithm
Discount_as_Regularizer
Code for the paper "Discount Factor as a Regularizer in Reinforcement Learning" ICML 2020
IRL-Toolkit
IRL Toolkit developed by Sergey Levine (Taken from https://graphics.stanford.edu/projects/gpirl/)
learn500lines
500 Lines or Less
nash_q_learning
path_tracking_with_MPC-DDP-_and_parameter_least_square_matlab
Python-100-Days
Python - 100天从新手到大师
pytorch-handbook
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
Q-Learning-SARSA-Policy-and-Value-Iteration
Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs (GridWorld, SmallWorld and CliffWorld)
reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
whisht120's Repositories
whisht120/ARS
An implementation of the Augmented Random Search algorithm
whisht120/Discount_as_Regularizer
Code for the paper "Discount Factor as a Regularizer in Reinforcement Learning" ICML 2020
whisht120/IRL-Toolkit
IRL Toolkit developed by Sergey Levine (Taken from https://graphics.stanford.edu/projects/gpirl/)
whisht120/learn500lines
500 Lines or Less
whisht120/nash_q_learning
whisht120/path_tracking_with_MPC-DDP-_and_parameter_least_square_matlab
whisht120/Python-100-Days
Python - 100天从新手到大师
whisht120/pytorch-handbook
pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行
whisht120/Q-Learning-SARSA-Policy-and-Value-Iteration
Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs (GridWorld, SmallWorld and CliffWorld)
whisht120/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
whisht120/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
whisht120/VFIToolkit-matlab
A Matlab Toolkit for Macroeconomic Models using Value Function Iteration