chilljudaoren

Pinned Repositories

TMM
Language:Python00
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10.1k983
opencv-python
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
Language:Python4.6k863
X-VLM
X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
Language:Python46251
VQA
Language:Python368140
UAP_VLP
Universal Adversarial Perturbations for Vision-Language Pre-trained Models
Language:Python120
TMM
Language:Python123
VQAttack
This is an official repository of ``VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models'' (AAAI 2024))
Language:HTML80
MLLM-Grounding-Robustness
[ICLR 2024 Workshop on Reliable and Responsible Foundation Models] Adversarial Robustness for Visual Grounding of Multimodal Large Language Models
Language:Python60
FGA
Feature Guidance attack for VLP models. The approach involves the ALBEF, TCL, CLIP, and BEiT3 models, as well as the VE (Visual Entailment), VG (Visual Grounding), VR (Visual Reasoning), VQA (Visual Question Answering), ZC (Zero-shot Classification), and ITR (Image-Text Retrieval) tasks.
30