ludanruan
Master of Renmin University. Focus on multi-modal understanding and multi-modal generation.
Renmin University of China59 Zhongguancun Street, Haidian District
Pinned Repositories
CLIP4VLA
The official code base of Accommodating Audio Modality in CLIP for Multimodal Processing
MCLIP4VLA
Mluti-modal multi-lingual Pre-trained model
TTVSR
[CVPR'22 Oral] TTVSR: Learning Trajectory-Aware Transformer for Video Super-Resolution
MM-Diffusion
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
ludanruan's Repositories
ludanruan/MCLIP4VLA
Mluti-modal multi-lingual Pre-trained model
ludanruan/CLIP4VLA
The official code base of Accommodating Audio Modality in CLIP for Multimodal Processing
ludanruan/TTVSR
[CVPR'22 Oral] TTVSR: Learning Trajectory-Aware Transformer for Video Super-Resolution