ZhuoranYang

Pinned Repositories

Algorithms-1
Data Structures and Algorithms in Python
Language:Python1 2 00
awesome-courses
List of awesome university courses for learning Computer Science!
2 2 00
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python1 2 00
Python-for-Signal-Processing
Notebooks for "Python for Signal Processing" book
Language:Python3 2 01
soft_dqn
Soft DQN algorithm
Language:Python1 2 00

ZhuoranYang's Repositories

ZhuoranYang/Python-for-Signal-Processing
Notebooks for "Python for Signal Processing" book
Language:Python3 2 01
ZhuoranYang/awesome-courses
List of awesome university courses for learning Computer Science!
2 2 00
ZhuoranYang/Algorithms-1
Data Structures and Algorithms in Python
Language:Python1 2 00
ZhuoranYang/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python1 2 00
ZhuoranYang/soft_dqn
Soft DQN algorithm
Language:Python1 2 00
ZhuoranYang/algforopt-notebooks
Jupyter notebooks associated with the Algorithms for Optimization textbook
Language:Jupyter Notebook00
ZhuoranYang/algorithms
Algorithms & Data Structures in C++
Language:C++0 2 00
ZhuoranYang/algorithms-2
Algorithms & Data Structures in Go
Language:Go2 0
ZhuoranYang/cpo
Constrained Policy Optimization
Language:Python2 0
ZhuoranYang/Dshell
Dshell is a network forensic analysis framework.
Language:Python2 0
ZhuoranYang/few-shot-cot
Try few shot COT and ICL. Modified from "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
ZhuoranYang/mmp
Implimentation of some Reinforcement Learning algorithms
Language:Python2 01
ZhuoranYang/neural-style
Torch implementation of neural style algorithm
Language:Lua2 0
ZhuoranYang/reinforcement-learning
Minimal and Clean Reinforcement Learning Examples
Language:Python2 0
ZhuoranYang/Reinforcement-Learning-Algorithms
These implementatios shows Convergence and performance of policy and value iteration algorithms, how the convergence of these algorithms to the optimal value function depends on the number of iterations used. Furthermore, I have implemented on-policy SARSA and off-policy Q-learning algorithms and showed how the performance of these algorithms depends on the exploration-exploitation tradeoff, and on learning rates. My experiments were evaluted on benchmark reinforcement learning tasks such as a smallworld, gridworld and a cliffworld MDP to analyze the performance of our algorithms.
Language:MATLAB1 0
ZhuoranYang/Stein-Variational-Gradient-Descent
code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"
Language:Python2 0
ZhuoranYang/tdlearn
some common TD Learning algorithms
Language:Python2 0
ZhuoranYang/v120
Proceedings of Learning for Dynamics and Control
Language:TeX
ZhuoranYang/zero_shot_few_shot_cot
Zero-Shot and Few-Shot COT and ICL
Language:Python
ZhuoranYang/zhuoranyang.github.io
Academic Website of Zhuoran Yang
Language:HTML