Pinned Repositories
SEED
Official implementation of SEED-LLaMA (ICLR 2024).
SEED-Bench
(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
SEED-X
Multimodal Models in Real World
DeepFashion2
DeepFashion2 Dataset https://arxiv.org/pdf/1901.07973.pdf
frozen-in-time
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
geyuying.github.io
Yuying Ge's Homepage
MetaDance
PF-AFN
Official code for "Parser-Free Virtual Try-on via Distilling Appearance Flows", CVPR 2021.
DeepFashion2
DeepFashion2 Dataset https://arxiv.org/pdf/1901.07973.pdf
MCQ
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
geyuying's Repositories
geyuying/PF-AFN
Official code for "Parser-Free Virtual Try-on via Distilling Appearance Flows", CVPR 2021.
geyuying/MetaDance
geyuying/geyuying.github.io
Yuying Ge's Homepage
geyuying/DeepFashion2
DeepFashion2 Dataset https://arxiv.org/pdf/1901.07973.pdf
geyuying/frozen-in-time
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
geyuying/PAFF_code
Official code for "Policy Adaptation from Foundation Model Feedback", CVPR 2023
geyuying/all-in-one
[Arxiv2022] All in One: Exploring Unified Video-Language Pre-training
geyuying/BEVT
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
geyuying/GAN_lecture
geyuying/PAFF
geyuying/Awesome-Unified-Multimodal-Models
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
geyuying/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
geyuying/SEED-X
Multimodal Models in Real World
geyuying/SEED_test
[ICLR 2024] Empowers LLMs with the ability to see and draw.