uclaml

The artificial general intelligence lab (formerly known as statistical machine learning lab) at UCLA is led by Prof. Quanquan Gu in the computer science dept.

Department of Computer Science, UCLA

Pinned Repositories

Frank-Wolfe-AdvML
A Frank-Wolfe Framework for Efficient and Effective Adversarial Attacks (AAAI'20)
Language:Python11 4 05
MoE
Towards Understanding the Mixture-of-Experts Layer in Deep Learning
Language:Jupyter Notebook19 3 14
NeuralUCB
Language:Python28 5 09
Padam
Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks" (accepted by IJCAI 2020)
Language:Python39 7 59
PDE
Official repo of Progressive Data Expansion: data, code and evaluation
Language:Jupyter Notebook26 2 01
RayS
RayS: A Ray Searching Method for Hard-label Adversarial Attack (KDD2020)
Language:Python54 5 56
Rephrase-and-Respond
Official repo of Respond-and-Respond: data, code, and evaluation
Language:Python94 3 110
SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python989 12 3088
SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
Language:Python474 29 1861
ucla-covid19-forecasts
Language:Python14 5 09

uclaml's Repositories

uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python989 12 3088
uclaml/SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
Language:Python474 29 1861
uclaml/Rephrase-and-Respond
Official repo of Respond-and-Respond: data, code, and evaluation
Language:Python94 3 110
uclaml/RayS
RayS: A Ray Searching Method for Hard-label Adversarial Attack (KDD2020)
Language:Python54 5 56
uclaml/Padam
Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks" (accepted by IJCAI 2020)
Language:Python39 7 59
uclaml/NeuralUCB
Language:Python28 5 09
uclaml/PDE
Official repo of Progressive Data Expansion: data, code and evaluation
Language:Jupyter Notebook26 2 01
uclaml/MoE
Towards Understanding the Mixture-of-Experts Layer in Deep Learning
Language:Jupyter Notebook19 3 14
uclaml/ucla-covid19-forecasts
Language:Python14 5 09
uclaml/Frank-Wolfe-AdvML
A Frank-Wolfe Framework for Efficient and Effective Adversarial Attacks (AAAI'20)
Language:Python11 4 05
uclaml/NeuralTS
Language:Python8 4 01
uclaml/CS269-Winter2019
4 2 00
uclaml/CS161-Winter2020
Fundamentals of Artificial Intelligence
2 4 03
uclaml/FedLinUCB
A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits
Language:Jupyter Notebook2 2 00
uclaml/GFA-RFE
Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Language:Python2 2 1
uclaml/PhyGCN
Language:Python2 2 01
uclaml/VACDB
Variance-aware Contextual Dueling Bandits
Language:Python2 3 0
uclaml/Benign-Overfitting-CNN
Benign Overfitting in Two-layer Convolutional Neural Networks
Language:Jupyter Notebook1 2 0
uclaml/Benign_ReLU_CNN
Language:Python1 2 01
uclaml/CS260-Fall2022
1 3 03
uclaml/CS260-Spring2020
Machine Learning
1 3 01
uclaml/CW-OFUL
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions
Language:Jupyter Notebook1 2 00
uclaml/HF-UCRL-VTR
Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs
Language:Jupyter Notebook1 2 01
uclaml/LDP-UCRL-VTR
Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes
Language:Python1 2 0
uclaml/pretrain-finetune-SGD
The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift
Language:Jupyter Notebook1 2 0
uclaml/POWERS
Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs
Language:Jupyter Notebook0 2 00
uclaml/RobustOFUL
Corruption-robust linear contextual bandits
Language:Python0 3 00
uclaml/multipass-SGD
Risk Bounds of Multi-Pass SGD for Least Squares in the Interpolation Regime
2 01
uclaml/SSL_Pseudo_Labeler
Language:Python2 01
uclaml/SSLGC
Selective Sampling on Graphs for Classification
Language:MATLAB3 0