Yuhao-Wan

Pinned Repositories

ridesharing-gym
Language:Python0 4 00
chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Language:Jupyter Notebook2.5k 37 34124
FaceRecognitionPCA
Final Project for MATH295 Numerical Analysis at Carleton College, Winter 2018
Language:MATLAB00
chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Language:Jupyter Notebook0 0 00
deep-reinforcement-learning
Implementations of deep reinforcement learning algorithms in Tensorflow
Language:Python21
Gaussian-processes
Python implementation of "A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning"
Language:Jupyter Notebook1 0 00
Pairwise-combinatorial-learner
Python implementation of algorithm in Learning Combinatorial Functions from Pairwise Comparisons
Language:Python1 0 01
time-varying-discount
A practical method to reduce discounting-induced bias during training in deeep Q-networks.
Language:Python0 1 00
wise-ft
Language:Python00
Yuhao-Wan.github.io
00

Yuhao-Wan's Repositories

Yuhao-Wan/deep-reinforcement-learning
Implementations of deep reinforcement learning algorithms in Tensorflow
Language:Python21
Yuhao-Wan/Gaussian-processes
Python implementation of "A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning"
Language:Jupyter Notebook1 0 00
Yuhao-Wan/Pairwise-combinatorial-learner
Python implementation of algorithm in Learning Combinatorial Functions from Pairwise Comparisons
Language:Python1 0 01
Yuhao-Wan/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Language:Jupyter Notebook0 0 00
Yuhao-Wan/time-varying-discount
A practical method to reduce discounting-induced bias during training in deeep Q-networks.
Language:Python0 1 00
Yuhao-Wan/wise-ft
Language:Python00
Yuhao-Wan/Yuhao-Wan.github.io
00