Pinned Repositories
Algorithms-1
Data Structures and Algorithms in Python
awesome-courses
List of awesome university courses for learning Computer Science!
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Python-for-Signal-Processing
Notebooks for "Python for Signal Processing" book
soft_dqn
Soft DQN algorithm
ZhuoranYang's Repositories
ZhuoranYang/Python-for-Signal-Processing
Notebooks for "Python for Signal Processing" book
ZhuoranYang/awesome-courses
List of awesome university courses for learning Computer Science!
ZhuoranYang/Algorithms-1
Data Structures and Algorithms in Python
ZhuoranYang/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
ZhuoranYang/soft_dqn
Soft DQN algorithm
ZhuoranYang/algforopt-notebooks
Jupyter notebooks associated with the Algorithms for Optimization textbook
ZhuoranYang/algorithms
Algorithms & Data Structures in C++
ZhuoranYang/algorithms-2
Algorithms & Data Structures in Go
ZhuoranYang/cpo
Constrained Policy Optimization
ZhuoranYang/Dshell
Dshell is a network forensic analysis framework.
ZhuoranYang/few-shot-cot
Try few shot COT and ICL. Modified from "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)
ZhuoranYang/mmp
Implimentation of some Reinforcement Learning algorithms
ZhuoranYang/neural-style
Torch implementation of neural style algorithm
ZhuoranYang/reinforcement-learning
Minimal and Clean Reinforcement Learning Examples
ZhuoranYang/Reinforcement-Learning-Algorithms
These implementatios shows Convergence and performance of policy and value iteration algorithms, how the convergence of these algorithms to the optimal value function depends on the number of iterations used. Furthermore, I have implemented on-policy SARSA and off-policy Q-learning algorithms and showed how the performance of these algorithms depends on the exploration-exploitation tradeoff, and on learning rates. My experiments were evaluted on benchmark reinforcement learning tasks such as a smallworld, gridworld and a cliffworld MDP to analyze the performance of our algorithms.
ZhuoranYang/Stein-Variational-Gradient-Descent
code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"
ZhuoranYang/tdlearn
some common TD Learning algorithms
ZhuoranYang/v120
Proceedings of Learning for Dynamics and Control
ZhuoranYang/zero_shot_few_shot_cot
Zero-Shot and Few-Shot COT and ICL
ZhuoranYang/zhuoranyang.github.io
Academic Website of Zhuoran Yang