Pinned Repositories
opencv
Open Source Computer Vision Library
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
StableSR
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Rofunc
🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation
ComfyUI_EchoMimic
You can using EchoMimic in ComfyUI
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
11whitewater's Repositories
11whitewater/opencv
Open Source Computer Vision Library
11whitewater/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.