Pinned Repositories
CNVid-3.5M
This repository contains the dataset, codebase, and benchmarks for our paper: <CNVid-3.5M: Build, Filter, and Pre-train the Large-scale Public Chinese Video-text Dataset>, which has been accepted by CVPR 2023.
LDMVFI
[AAAI'2024] "LDMVFI: Video Frame Interpolation with Latent Diffusion Models", Duolikun Danier, Fan Zhang, David Bull
diagramss
none
gemini-netlify-proxy
GeminiProChat
Minimal web UI for GeminiPro.
weapp-qrcode
weapp.qrcode.js 在 微信小程序 中,快速生成二维码
unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Cap4Video
【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?
Youku-mPLUG
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
miumiuc's Repositories
miumiuc/diagramss
none
miumiuc/gemini-netlify-proxy
miumiuc/GeminiProChat
Minimal web UI for GeminiPro.
miumiuc/weapp-qrcode
weapp.qrcode.js 在 微信小程序 中,快速生成二维码