VLAA@UCSC

Pinned Repositories

CLIPA
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
Language:Python300 14 1113
CRATE-alpha
This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
Language:Python45 3 21
DMAE
[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"
Language:Python102 5 85
EVP
[TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"
Language:Python37 1 04
HQ-Edit
HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
Language:Python75 6 83
MedTrinity-25M
This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
Language:Python215 2 1417
Recap-DataComp-1B
This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
122 5 161
RobustCNN
[ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"
Language:Python143 4 113
SwinMM
[MICCAI 2023] This repository includes the official implementation our paper "SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation"
Language:Python100 4 86
vllm-safety-benchmark
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
Language:Python69 4 13

VLAA@UCSC's Repositories

UCSC-VLAA/CLIPA
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
Language:Python300 14 1113
UCSC-VLAA/MedTrinity-25M
This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
Language:Python215 2 1417
UCSC-VLAA/RobustCNN
[ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"
Language:Python143 4 113
UCSC-VLAA/Recap-DataComp-1B
This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
122 5 161
UCSC-VLAA/DMAE
[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"
Language:Python102 5 85
UCSC-VLAA/SwinMM
[MICCAI 2023] This repository includes the official implementation our paper "SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation"
Language:Python100 4 86
UCSC-VLAA/HQ-Edit
HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
Language:Python75 6 83
UCSC-VLAA/vllm-safety-benchmark
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
Language:Python69 4 13
UCSC-VLAA/CRATE-alpha
This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
Language:Python45 3 21
UCSC-VLAA/EVP
[TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"
Language:Python37 1 04
UCSC-VLAA/o1_medical
Language:Python36 1 11
UCSC-VLAA/MicroDiffusion
[CVPR 2024] This repository includes the official implementation our paper "MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections"
Language:Python35 4 70
UCSC-VLAA/MixCon3D
[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"
Language:Python30 2 52
UCSC-VLAA/FedConv
[TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning"
Language:Python25 1 00
UCSC-VLAA/Image-Pretraining-for-Video
[ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recognition".
Language:Python19 0 10
UCSC-VLAA/Sight-Beyond-Text
This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"
Language:Python19 2 11
UCSC-VLAA/AdvXL
[CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"
Language:Python17 3 41
UCSC-VLAA/AttnGCG-attack
Language:Python130
UCSC-VLAA/Redteaming_Challenge
Language:Python7 1 00
UCSC-VLAA/AQA-Bench
Algorithmic-Q&A-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability
Language:Python4 1 0
UCSC-VLAA/vit_cert
[ECCV 2022] This repository includes the official implementation our paper "ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers"
Language:Python3 0 00
UCSC-VLAA/Compress-Align
This repository includes the official implementation and dataset of our paper "Compress & Align: Curating Image-Text Data with Human Knowledge".
2 2 10
UCSC-VLAA/CLIPS
An Enhanced CLIP Framework for Learning with Synthetic Captions
Language:Python1
UCSC-VLAA/o1_medicine
Language:JavaScript1
UCSC-VLAA/UCSC-VLAA.github.io
Language:HTML0 0 00