mayeechen's Stars
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
anthropics/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
acmi-lab/cmu-10717-the-art-of-the-paper
Official repository for CMU Machine Learning Department's 10717: "The Art of the Paper".
modestyachts/CIFAR-10.1
Release of CIFAR-10.1, a new test set for CIFAR-10.
JieyuZ2/wrench
[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark
Weixin-Liang/Modality-Gap
Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning
bencw99/comparing-labeled-and-unlabeled-data
The code used for synthetic experiments and the real-world case study in "Comparing the Value of Labeled and Unlabeled Data in Method-of-Moments Latent Variable Estimation".