alecwangcq

Reinforcement learning ∩ LLMs, Generative models, Artificial intelligence

San Francisco, CA

Pinned Repositories

coco-caption-python3
Language:Jupyter Notebook01
Conv_LSTM
Language:Python15
EigenDamage-Pytorch
Code for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934
Language:Python111 5 119
f-divergence-dpo
Direct preference optimization with f-divergences.
Language:Python60
GraSP
Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH
Language:Python97 2 614
KFAC-Pytorch
Pytorch implementation of KFAC and E-KFAC (Natural Gradient).
Language:Python120 4 232
Prototypical-network
Language:Python0 1 01
show-attend-and-tell
Language:Jupyter Notebook25 2 911
Weight-Decay
Regularization, Neural Network Training Dynamics
Language:Python14 2 10
Neural-Kernel-Network
Code for "Differentiable Compositional Kernel Learning for Gaussian Processes" https://arxiv.org/abs/1806.04326
Language:Python69 9 23

alecwangcq's Repositories

alecwangcq/KFAC-Pytorch
Pytorch implementation of KFAC and E-KFAC (Natural Gradient).
Language:Python120 4 232
alecwangcq/EigenDamage-Pytorch
Code for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934
Language:Python111 5 119
alecwangcq/GraSP
Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH
Language:Python97 2 614
alecwangcq/show-attend-and-tell
Language:Jupyter Notebook25 2 911
alecwangcq/f-divergence-dpo
Direct preference optimization with f-divergences.
Language:Python60
alecwangcq/Conv_LSTM
Language:Python15
alecwangcq/coco-caption-python3
Language:Jupyter Notebook01
alecwangcq/Prototypical-network
Language:Python0 1 01