Pinned Repositories
AOT-GAN-for-Inpainting
[TVCG'2023] AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)
img2poem
[MM'18] Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training
LightTrack
[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search
MM-Diffusion
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
PEN-Net-for-Inpainting
[CVPR'2019] PEN-Net: Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting
SiamDW
[CVPR'19 Oral] Deeper and Wider Siamese Networks for Real-Time Visual Tracking
Stark
[ICCV'21] Learning Spatio-Temporal Transformer for Visual Tracking
STTN
[ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting
TracKit
[ECCV'20] Ocean: Object-aware Anchor-Free Tracking
TTSR
[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution
Multimedia Research's Repositories
researchmm/TTSR
[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution
researchmm/SiamDW
[CVPR'19 Oral] Deeper and Wider Siamese Networks for Real-Time Visual Tracking
researchmm/Stark
[ICCV'21] Learning Spatio-Temporal Transformer for Visual Tracking
researchmm/TracKit
[ECCV'20] Ocean: Object-aware Anchor-Free Tracking
researchmm/STTN
[ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting
researchmm/AOT-GAN-for-Inpainting
[TVCG'2023] AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)
researchmm/MM-Diffusion
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
researchmm/LightTrack
[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search
researchmm/PEN-Net-for-Inpainting
[CVPR'2019] PEN-Net: Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting
researchmm/img2poem
[MM'18] Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training
researchmm/tasn
Trilinear Attention Sampling Network for Fine-grained Image Recognition
researchmm/soho
[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
researchmm/TTVSR
[CVPR'22 Oral] TTVSR: Learning Trajectory-Aware Transformer for Video Super-Resolution
researchmm/FTVSR
[ECCV'22] FTVSR: Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution
researchmm/DBTNet
Code for our NeurIPS'19 paper "Learning Deep Bilinear Transformation for Fine-grained Image Representation"
researchmm/generate-it
A collection of models for image<->text generation in ACM MM 2021.
researchmm/CKDN
[ICCV'21] CKDN: Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment
researchmm/SariGAN
[NeurIPS'20] Learning Semantic-aware Normalization for Generative Adversarial Networks
researchmm/WSOD2
[ICCV'19] WSOD^2: Learning Bottom-up and Top-down Objectness Distillation for Weakly-supervised Object Detection
researchmm/VQD-SR
[ICCV'23] VQD-SR: Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
researchmm/CyDAS
Cyclic Differentiable Architecture Search
researchmm/NEAS
researchmm/2D-TAN
AAAI2020 - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language
researchmm/STTR
[ACCV'22] Fine-Grained Image Style Transfer with Visual Transformers
researchmm/AAST-pytorch
[MM'20] Aesthetic-Aware Image Style Transfer
researchmm/davinci-videofactory
researchmm/AI_Illustrator
[MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
researchmm/language-guided-animation
[TMM 2023] Language-Guided Face Animation by Recurrent StyleGAN-based Generator
researchmm/AutoML
AutoFormer, Cream
researchmm/2D-TAN-Microsoft
[AAAI‘20] - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language