mayeechen

mayeechen's Stars

lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python7.7k 143 46672
anthropics/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
1.5k 19 0116
acmi-lab/cmu-10717-the-art-of-the-paper
Official repository for CMU Machine Learning Department's 10717: "The Art of the Paper".
283 54 110
modestyachts/CIFAR-10.1
Release of CIFAR-10.1, a new test set for CIFAR-10.
Language:Jupyter Notebook215 7 218
JieyuZ2/wrench
[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark
Language:Python214 6 2730
Weixin-Liang/Modality-Gap
Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning
Language:Jupyter Notebook107 5 37
bencw99/comparing-labeled-and-unlabeled-data
The code used for synthetic experiments and the real-world case study in "Comparing the Value of Labeled and Unlabeled Data in Method-of-Moments Latent Variable Estimation".
Language:Jupyter Notebook2 2 00