WeinanGuan's Stars
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
shenyunhang/APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
BradyFU/Woodpecker
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.
ldz666666/RiDDLE
Author implementation of RiDDLE: Reversible and Diversified De-identification with Latent Encryptor (CVPR 2023)
Yueming6568/DeltaEdit
ldz666666/Style-atk
Author implementation of Exploring Adversarial Fake Images on Face Manifold (CVPR 2021 oral)
TianxiangMa/MUST-GAN
Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"
thinkerthinker/r-FACE
Reference Guided Face Component Editing
SXKDZ/arXiv-newsletter
A simple configurable bot for sending arXiv article alert by mail
fartashf/vsepp
PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"
karpathy/neuraltalk2
Efficient Image Captioning code in Torch, runs on GPU
taoxugit/AttnGAN
nouman-10/Classic-Convolutional-Models
Implementation of Classic Convolutional Models i.e, LeNet-5, AlexNet, VGG-16 and ResNet34 using PyTorch framework