vokhanhan25

University of Information Technology, VNU-HCMHo Chi Minh City

vokhanhan25's Stars

poloclub/transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Language:JavaScript2.8k252
TIGER-AI-Lab/Program-of-Thoughts
Data and Code for Program of Thoughts (TMLR 2023)
Language:Python24222
veronica320/Faithful-COT
Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".
Language:Python15511
SiyuanWangw/ULogic
Language:Jupyter Notebook183
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Language:Python6k413
Cogito2012/CarCrashDataset
[ACM MM 2020] CCD dataset for traffic accident anticipation.
9210
xuanmingcui/visual_adversarial_lmm
Language:Python2
IntelLabs/lvlm-interpret
Language:Python436
Ziwei-Zheng/LVLM-Stethoscope
A library of visualization tools for the interpretability and hallucination analysis of large vision-language models (LVLMs).
Language:Python201
RobbieHolland/SpecialistVLMs
Developing VLMs for expert-level performance in specific medical specialties
Language:Python6
IDEA-Research/Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Language:Jupyter Notebook97490
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15k1.4k
QUVA-Lab/PIN
Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
Language:Python243
yu-rp/apiprompting
[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models
Language:Python433
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook13.8k2.1k
euanong/image-hijacks
Official codebase for Image Hijacks: Adversarial Images can Control Generative Models at Runtime
Language:Python356
NY1024/BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
Language:Jupyter Notebook181
yunqing-me/AttackVLM
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
Language:Python1607
Harry24k/adversarial-attacks-pytorch
PyTorch implementation of adversarial attacks [torchattacks]
Language:Python1.9k350
as791/ZOO_Attack_PyTorch
This repository contains the PyTorch implementation of Zeroth Order Optimization Based Adversarial Black Box Attack (https://arxiv.org/abs/1708.03999)
Language:Python3814
IBM/ZOO-Attack
Codes for reproducing the black-box adversarial attacks in “ZOO: Zeroth Order Optimization based Black-box Attacks to Deep Neural Networks without Training Substitute Models,” ACM CCS Workshop on AI-Security, 2017
Language:Python5523
xiangyu-mm/UniFashion
The official code for paper "UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation"
Language:Python131
ChaduCheng/LVLMs_Exploring
Language:Python1
kevinzakka/clip_playground
An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities
Language:Jupyter Notebook15313
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
Language:Python1.5k198
gordonhu608/MQT-LLaVA
[NeurIPS 2024] Matryoshka Query Transformer for Large Vision-Language Models
Language:Python9711
facebookresearch/TorchRay
Understanding Deep Networks via Extremal Perturbations and Smooth Masks
Language:Python34333
mattneary/attention
visualizing attention for LLM users
Language:Python1596
zjysteven/VLM-Visualizer
Visualizing the attention of vision-language models
Language:Jupyter Notebook615
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Language:Python10.5k1.6k

vokhanhan25

vokhanhan25's Stars

poloclub/transformer-explainer

TIGER-AI-Lab/Program-of-Thoughts

veronica320/Faithful-COT

SiyuanWangw/ULogic

THUDM/CogVLM

Cogito2012/CarCrashDataset

xuanmingcui/visual_adversarial_lmm

IntelLabs/lvlm-interpret

Ziwei-Zheng/LVLM-Stethoscope

RobbieHolland/SpecialistVLMs

IDEA-Research/Grounded-SAM-2

IDEA-Research/Grounded-Segment-Anything

QUVA-Lab/PIN

yu-rp/apiprompting

meta-llama/llama-recipes

euanong/image-hijacks

NY1024/BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt

yunqing-me/AttackVLM

Harry24k/adversarial-attacks-pytorch

as791/ZOO_Attack_PyTorch

IBM/ZOO-Attack

xiangyu-mm/UniFashion

ChaduCheng/LVLMs_Exploring

kevinzakka/clip_playground

salesforce/ALBEF

gordonhu608/MQT-LLaVA

facebookresearch/TorchRay

mattneary/attention

zjysteven/VLM-Visualizer

jacobgil/pytorch-grad-cam