Pinned Repositories
art2real
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation. CVPR 2019
dress-code
Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022
LLaVA-MORE
LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1
meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
multimodal-garment-designer
This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
safe-clip
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024
show-control-and-tell
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
mlnet
A Deep Multi-Level Network for Saliency Prediction. ICPR 2016
sam
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model. IEEE Transactions on Image Processing (2018)
ladi-vton
[ACM MM 2023] - LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On
marcellacornia's Repositories
marcellacornia/sam
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model. IEEE Transactions on Image Processing (2018)
marcellacornia/mlnet
A Deep Multi-Level Network for Saliency Prediction. ICPR 2016