Pinned Repositories
ridesharing-gym
chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
FaceRecognitionPCA
Final Project for MATH295 Numerical Analysis at Carleton College, Winter 2018
chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
deep-reinforcement-learning
Implementations of deep reinforcement learning algorithms in Tensorflow
Gaussian-processes
Python implementation of "A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning"
Pairwise-combinatorial-learner
Python implementation of algorithm in Learning Combinatorial Functions from Pairwise Comparisons
time-varying-discount
A practical method to reduce discounting-induced bias during training in deeep Q-networks.
wise-ft
Yuhao-Wan.github.io
Yuhao-Wan's Repositories
Yuhao-Wan/deep-reinforcement-learning
Implementations of deep reinforcement learning algorithms in Tensorflow
Yuhao-Wan/Gaussian-processes
Python implementation of "A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning"
Yuhao-Wan/Pairwise-combinatorial-learner
Python implementation of algorithm in Learning Combinatorial Functions from Pairwise Comparisons
Yuhao-Wan/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Yuhao-Wan/time-varying-discount
A practical method to reduce discounting-induced bias during training in deeep Q-networks.
Yuhao-Wan/wise-ft
Yuhao-Wan/Yuhao-Wan.github.io