whisht120

Pinned Repositories

ARS
An implementation of the Augmented Random Search algorithm
Language:Python0 0 00
Discount_as_Regularizer
Code for the paper "Discount Factor as a Regularizer in Reinforcement Learning" ICML 2020
Language:Python0 0 00
IRL-Toolkit
IRL Toolkit developed by Sergey Levine (Taken from https://graphics.stanford.edu/projects/gpirl/)
Language:MATLAB0 0 00
learn500lines
500 Lines or Less
Language:JavaScript0 0 00
nash_q_learning
Language:Python0 0 00
path_tracking_with_MPC-DDP-_and_parameter_least_square_matlab
Language:MATLAB0 0 00
Python-100-Days
Python - 100天从新手到大师
Language:Jupyter Notebook0 0 00
pytorch-handbook
pytorch handbook是一本开源的书籍，目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门，其中包含的Pytorch教程全部通过测试保证可以成功运行
Language:Jupyter Notebook0 0 00
Q-Learning-SARSA-Policy-and-Value-Iteration
Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs (GridWorld, SmallWorld and CliffWorld)
Language:MATLAB0 0 00
reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Language:Jupyter Notebook0 0 00

whisht120's Repositories

whisht120/ARS
An implementation of the Augmented Random Search algorithm
Language:Python0 0 00
whisht120/Discount_as_Regularizer
Code for the paper "Discount Factor as a Regularizer in Reinforcement Learning" ICML 2020
Language:Python0 0 00
whisht120/IRL-Toolkit
IRL Toolkit developed by Sergey Levine (Taken from https://graphics.stanford.edu/projects/gpirl/)
Language:MATLAB0 0 00
whisht120/learn500lines
500 Lines or Less
Language:JavaScript0 0 00
whisht120/nash_q_learning
Language:Python0 0 00
whisht120/path_tracking_with_MPC-DDP-_and_parameter_least_square_matlab
Language:MATLAB0 0 00
whisht120/Python-100-Days
Python - 100天从新手到大师
Language:Jupyter Notebook0 0 00
whisht120/pytorch-handbook
pytorch handbook是一本开源的书籍，目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门，其中包含的Pytorch教程全部通过测试保证可以成功运行
Language:Jupyter Notebook0 0 00
whisht120/Q-Learning-SARSA-Policy-and-Value-Iteration
Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs (GridWorld, SmallWorld and CliffWorld)
Language:MATLAB0 0 00
whisht120/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Language:Jupyter Notebook0 0 00
whisht120/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Language:Python0 0
whisht120/VFIToolkit-matlab
A Matlab Toolkit for Macroeconomic Models using Value Function Iteration
Language:MATLAB0 0