yaodongyu's Stars
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
OliverRensu/D-iGPT
[ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Learners"
saprmarks/feature-circuits
pomonam/kronfluence
Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature
tysam-code/hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to larger models with one parameter change (feature currently in alpha).
yuanchenyang/smalldiffusion
Simple and readable code for training and sampling from diffusion models
centerforaisafety/wmdp
WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM performance on WMDP while retaining general capabilities.
aangelopoulos/ppi_py
A package for statistically rigorous scientific discovery using machine learning. Implements prediction-powered inference.
sndnyang/Diffusion_ViT
PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"
formll/dog
DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
lucidrains/local-attention
An implementation of local windowed attention for language modeling
ethz-spylab/rlhf_trojan_competition
Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.
chs20/RobustVLM
[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
LTH14/rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
LargeWorldModel/LWM
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
lee-ny/teaching_arithmetic
wesg52/universal-neurons
Universal Neurons in GPT2 Language Models
openai/sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
HoagyC/sparse_coding
Using sparse coding to find distributed representations used by neural networks.
apple/ml-aim
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
taufeeque9/codebook-features
Sparse and discrete interpretability tool for neural networks
wangf3014/SCLIP
Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
facebookresearch/UnifiedUncertaintyCalibration
UnifiedUncertaintyCalibration
penghao-wu/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
4m4n5/CLIP-Lite
Pytorch Implementation of CLIP-Lite | Accepted at AISTATS 2023
tsb0601/MMVP
deeplearning-wisc/vit-spurious-robustness