Pinned Repositories
opencv
Open Source Computer Vision Library
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
StableSR
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
AntiAliasing
REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
ComfyUI_EchoMimic
You can using EchoMimic in ComfyUI
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
11whitewater's Repositories
11whitewater/opencv
Open Source Computer Vision Library
11whitewater/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.