WeinanGuan

WeinanGuan's Stars

VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
Language:Python75735
BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
36611
shenyunhang/APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
Language:Python47429
BradyFU/Woodpecker
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.
Language:Python59429
ldz666666/RiDDLE
Author implementation of RiDDLE: Reversible and Diversified De-identification with Latent Encryptor (CVPR 2023)
Language:Python422
Yueming6568/DeltaEdit
Language:Python10210
ldz666666/Style-atk
Author implementation of Exploring Adversarial Fake Images on Face Manifold (CVPR 2021 oral)
Language:Python301
TianxiangMa/MUST-GAN
Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"
Language:Python7617
thinkerthinker/r-FACE
Reference Guided Face Component Editing
1
SXKDZ/arXiv-newsletter
A simple configurable bot for sending arXiv article alert by mail
Language:HTML267
fartashf/vsepp
PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"
Language:Python488125
karpathy/neuraltalk2
Efficient Image Captioning code in Torch, runs on GPU
Language:Jupyter Notebook5.5k1.3k
taoxugit/AttnGAN
Language:Python1.3k418
nouman-10/Classic-Convolutional-Models
Implementation of Classic Convolutional Models i.e, LeNet-5, AlexNet, VGG-16 and ResNet34 using PyTorch framework
Language:Python41