Pinned Repositories
CLIPA
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
CRATE-alpha
This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
DMAE
[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"
EVP
[TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"
HQ-Edit
HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
MedTrinity-25M
This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
Recap-DataComp-1B
This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
RobustCNN
[ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"
SwinMM
[MICCAI 2023] This repository includes the official implementation our paper "SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation"
vllm-safety-benchmark
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
VLAA@UCSC's Repositories
UCSC-VLAA/CLIPA
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
UCSC-VLAA/MedTrinity-25M
This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
UCSC-VLAA/RobustCNN
[ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"
UCSC-VLAA/Recap-DataComp-1B
This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
UCSC-VLAA/DMAE
[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"
UCSC-VLAA/SwinMM
[MICCAI 2023] This repository includes the official implementation our paper "SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation"
UCSC-VLAA/HQ-Edit
HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
UCSC-VLAA/vllm-safety-benchmark
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
UCSC-VLAA/CRATE-alpha
This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
UCSC-VLAA/EVP
[TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"
UCSC-VLAA/o1_medical
UCSC-VLAA/MicroDiffusion
[CVPR 2024] This repository includes the official implementation our paper "MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections"
UCSC-VLAA/MixCon3D
[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"
UCSC-VLAA/FedConv
[TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning"
UCSC-VLAA/Image-Pretraining-for-Video
[ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recognition".
UCSC-VLAA/Sight-Beyond-Text
This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"
UCSC-VLAA/AdvXL
[CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"
UCSC-VLAA/AttnGCG-attack
UCSC-VLAA/Redteaming_Challenge
UCSC-VLAA/AQA-Bench
Algorithmic-Q&A-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability
UCSC-VLAA/vit_cert
[ECCV 2022] This repository includes the official implementation our paper "ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers"
UCSC-VLAA/Compress-Align
This repository includes the official implementation and dataset of our paper "Compress & Align: Curating Image-Text Data with Human Knowledge".
UCSC-VLAA/CLIPS
An Enhanced CLIP Framework for Learning with Synthetic Captions
UCSC-VLAA/o1_medicine
UCSC-VLAA/UCSC-VLAA.github.io