whisht120's Stars
1c7/chinese-independent-developer
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻**独立开发者项目列表 -- 分享大家都在做什么
timqian/chinese-independent-blogs
中文独立博客列表
WeNeedHome/SummaryOfLoanSuspension
全国各省市停贷通知汇总
ustctug/ustcthesis
LaTeX template for USTC thesis
mlii/mfrl
Mean Field Multi-Agent Reinforcement Learning
kentsommer/pytorch-value-iteration-networks
Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)
avivt/VIN
Value Iteration Networks
yu-jiang/radpbook
Source code for examples in Book "Robust Adaptive Dynamic Programming"
savinay95n/Reinforcement-learning-Algorithms-and-Dynamic-Programming
Reinforcement learning Algorithms such as SARSA, Q learning, Actor-Critic Policy Gradient and Value Function Approximation were applied to stabilize an inverted pendulum system and achieve optimal control. So essentially, the concept of Reinforcement Learning Controllers has been established. The Reinforcement Learning Controllers have been compared on the basis of performance and efficiency and they are separately compared with the classical Linear Quadratic Regulator Controller. Each of the RL controller have been integrated with a Swing up controller. A virtual switch toggles between the Swing up controller and the RL controller automatically, based on the value of the angular deviation theta with respect to the vertical plane. My research paper and my undergraduate thesis have been uploaded for reference. All the codes have also been uploaded.
TSummersLab/Distributionally-robust-stochastic-OPF
vfitoolkit/VFIToolkit-matlab
A Matlab Toolkit for Macroeconomic Models using Value Function Iteration
FarnazAdib/Crash_course_on_RL
This is a self-contained repository to explain two basic Reinforcement (RL) algorithms.
kunqian2025/reinforcement-learning
Implementation of Single-Agent and Multi-Agent Reinforcement Learning Algorithms. MATLAB.
MathewMithraNoel/Reinforcement-Learning-for-Nonlinear-Control
Control of a nonlinear liquid level system using a new artificial neural network based reinforcement learning approach
modestyachts/robust-adaptive-lqr
Implementation of robust adaptive control methods for the linear quadratic regulator
Riashat/Q-Learning-SARSA-Policy-and-Value-Iteration
Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs (GridWorld, SmallWorld and CliffWorld)
vroulet/ilqc
Iterative Linearized Control Toolbox
nuria95/O-RAAC
Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting
Riashat/Policy-Gradient-Reinforcement-Learning
sparisi/td-reg
TD-Regularized Actor-Critic Methods
sufengniu/GVIN
Generalized Value Iteration Network
instadeepai/EGTA-NMARL
Experiments for performing empirical game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning.
msareebhakak/Reinforcement-Learning-based-LQR
Reinforcement based gain calculation for a tracking LQR using actor-critic method
jgeisler0303/DDP-Generator
Generate taylored code for Differential Dynamic Programming (DDP) aka Iterative Linear Quadratic Gaussian (iLQG) solvers for finite time Optimal Control Problems (OCP)
idaohang/Linear-Quadratic-Regulator-with-KF
LQR combined with a Kalman Filter, example developed in Simulink/Matlab
ron-amit/Discount_as_Regularizer
Code for the paper "Discount Factor as a Regularizer in Reinforcement Learning" ICML 2020
JapSethi/Linear-Quadratic-Gaussian-Control-Inverted-Pendulum-On-A-Cart
Mini Side Project to check observability and do optimal linear full state estimation of Inverted Pendulum on a Cart
EmmanuelOwusu/-Linear-Quadratic-Regulators
This is a presentation on Linear Quadratic Regulators
I2RLab/RegretMeasurement-GUI
This is a survey instrument coded in Matlab for quantitative measure of regret theory. Regret theory is a model for human-like decision-making which can describe the risk-seeking and risk-averse behaviors. The explanation of the HCI design is published in IFAC conference on Cyber Physical and Human System 2018. The preprint is available at https://arxiv.org/abs/1810.00462.
cvxgrp/extquadcontrol