Pinned Repositories
deepneighbor
DeepNeighbor is a High-level,Flexible and Extendible package for embedding-based information retrieval from user-item interaction logs
jaxGPT
building nanoGPT with JAX
LunarLander
RL project: Training the Lunar Lander Agent With Deep Q-Learning (DQN) and Double DQN
ML-From-Scratch
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
Recs_Search_Ads_papers
SmartWater
I built a data labaling platform with active learing and human feedback in the loop.
TD_Lambda
Replication of Richard Sutton's 1988 results exploring temporal difference learning and the TD(lambda) algorithm.
tensor-puzzles
louiswang524's Repositories
louiswang524/deepneighbor
DeepNeighbor is a High-level,Flexible and Extendible package for embedding-based information retrieval from user-item interaction logs
louiswang524/Recs_Search_Ads_papers
louiswang524/jaxGPT
building nanoGPT with JAX
louiswang524/LunarLander
RL project: Training the Lunar Lander Agent With Deep Q-Learning (DQN) and Double DQN
louiswang524/ML-From-Scratch
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
louiswang524/SmartWater
I built a data labaling platform with active learing and human feedback in the loop.
louiswang524/TD_Lambda
Replication of Richard Sutton's 1988 results exploring temporal difference learning and the TD(lambda) algorithm.
louiswang524/tensor-puzzles