whisht120

whisht120's Stars

1c7/chinese-independent-developer
👩🏿‍💻👨🏾‍💻👩🏼‍💻👨🏽‍💻👩🏻‍💻**独立开发者项目列表 -- 分享大家都在做什么
37.3k 1.2k 1433.1k
timqian/chinese-independent-blogs
中文独立博客列表
Language:Python20.4k 292 812.5k
WeNeedHome/SummaryOfLoanSuspension
全国各省市停贷通知汇总
Language:HTML20.3k 325 02.1k
ustctug/ustcthesis
LaTeX template for USTC thesis
Language:TeX1.6k 35 337397
mlii/mfrl
Mean Field Multi-Agent Reinforcement Learning
Language:Python374 10 30100
kentsommer/pytorch-value-iteration-networks
Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)
Language:Python318 9 1162
avivt/VIN
Value Iteration Networks
Language:Python288 19 1469
yu-jiang/radpbook
Source code for examples in Book "Robust Adaptive Dynamic Programming"
Language:MATLAB118 7 745
savinay95n/Reinforcement-learning-Algorithms-and-Dynamic-Programming
Reinforcement learning Algorithms such as SARSA, Q learning, Actor-Critic Policy Gradient and Value Function Approximation were applied to stabilize an inverted pendulum system and achieve optimal control. So essentially, the concept of Reinforcement Learning Controllers has been established. The Reinforcement Learning Controllers have been compared on the basis of performance and efficiency and they are separately compared with the classical Linear Quadratic Regulator Controller. Each of the RL controller have been integrated with a Swing up controller. A virtual switch toggles between the Swing up controller and the RL controller automatically, based on the value of the angular deviation theta with respect to the vertical plane. My research paper and my undergraduate thesis have been uploaded for reference. All the codes have also been uploaded.
Language:MATLAB101 7 025
TSummersLab/Distributionally-robust-stochastic-OPF
Language:MATLAB78 2 228
vfitoolkit/VFIToolkit-matlab
A Matlab Toolkit for Macroeconomic Models using Value Function Iteration
Language:MATLAB77 11 751
FarnazAdib/Crash_course_on_RL
This is a self-contained repository to explain two basic Reinforcement (RL) algorithms.
Language:Jupyter Notebook73 4 117
kunqian2025/reinforcement-learning
Implementation of Single-Agent and Multi-Agent Reinforcement Learning Algorithms. MATLAB.
Language:Matlab57 1 027
MathewMithraNoel/Reinforcement-Learning-for-Nonlinear-Control
Control of a nonlinear liquid level system using a new artificial neural network based reinforcement learning approach
Language:MATLAB38 0 022
modestyachts/robust-adaptive-lqr
Implementation of robust adaptive control methods for the linear quadratic regulator
Language:Jupyter Notebook36 5 010
Riashat/Q-Learning-SARSA-Policy-and-Value-Iteration
Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs (GridWorld, SmallWorld and CliffWorld)
Language:MATLAB36 6 015
vroulet/ilqc
Iterative Linearized Control Toolbox
Language:Jupyter Notebook34 1 28
nuria95/O-RAAC
Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting
Language:Python33 2 24
Riashat/Policy-Gradient-Reinforcement-Learning
Language:Matlab33 4 010
sparisi/td-reg
TD-Regularized Actor-Critic Methods
Language:MATLAB32 1 26
sufengniu/GVIN
Generalized Value Iteration Network
Language:Python23 6 410
instadeepai/EGTA-NMARL
Experiments for performing empirical game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning.
Language:Python16 5 02
msareebhakak/Reinforcement-Learning-based-LQR
Reinforcement based gain calculation for a tracking LQR using actor-critic method
Language:MATLAB16 2 09
jgeisler0303/DDP-Generator
Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time Optimal Control Problems (OCP)
Language:C15 3 32
idaohang/Linear-Quadratic-Regulator-with-KF
LQR combined with a Kalman Filter, example developed in Simulink/Matlab
Language:MATLAB8 2 03
ron-amit/Discount_as_Regularizer
Code for the paper "Discount Factor as a Regularizer in Reinforcement Learning" ICML 2020
Language:Python62
JapSethi/Linear-Quadratic-Gaussian-Control-Inverted-Pendulum-On-A-Cart
Mini Side Project to check observability and do optimal linear full state estimation of Inverted Pendulum on a Cart
Language:MATLAB4 3 03
EmmanuelOwusu/-Linear-Quadratic-Regulators
This is a presentation on Linear Quadratic Regulators
30
I2RLab/RegretMeasurement-GUI
This is a survey instrument coded in Matlab for quantitative measure of regret theory. Regret theory is a model for human-like decision-making which can describe the risk-seeking and risk-averse behaviors. The explanation of the HCI design is published in IFAC conference on Cyber Physical and Human System 2018. The preprint is available at https://arxiv.org/abs/1810.00462.
Language:MATLAB2 3 01
cvxgrp/extquadcontrol
Language:Python1 3 0