MyBeautiful-Fantasy

Less is more

Xiamen University, ChinaXiamen

MyBeautiful-Fantasy's Stars

jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。
Language:Python11.4k 71 6521.3k
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python11.1k 173 6722.3k
DanielSWolf/rhubarb-lip-sync
Rhubarb Lip Sync is a command-line tool that automatically creates 2D mouth animation from voice recordings. You can use it for characters in computer games, in animated cartoons, or in any other project that requires animating mouths based on existing recordings.
Language:C++1.9k 55 128229
X-LANCE/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Language:Jupyter Notebook1.5k 66 40140
ajay-sainy/Wav2Lip-GFPGAN
High quality Lip sync
Language:Python1.1k 15 30270
MRzzm/DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Language:Python1k 19 122178
R3gm/SoniTranslate
Synchronized Translation for Videos. Video dubbing
Language:Python951 18 115179
gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Language:Python854 32 5797
awni/speech
A PyTorch Implementation of End-to-End Models for Speech-to-Text
Language:Python756 31 53175
Rudrabha/Lip2Wav
This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"
Language:Python700 26 39153
huawei-noah/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Language:Jupyter Notebook567 23 31122
Sxjdwang/TalkLip
Language:Python414 16 5036
ivanvovk/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
Language:Jupyter Notebook402 16 2655
guanjz20/StyleSync
Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
Language:Python304 32 1419
guanjz20/StyleSync_PyTorch
PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
Language:Python209 11 322
keonlee9420/DailyTalk
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
Language:Python206 8 313
KunZhou9646/Mixed_Emotions
Language:Python114 4 311
GalaxyCong/HPMDubbing
[CVPR 2023] Official code for paper: Learning to Dub Movies via Hierarchical Prosody Models.
Language:Python102 9 108
Chris10M/Lip2Speech
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
Language:Python79 4 319
amtsai96/Learning-Lip-Sync-from-Audio
Learning Lip Sync of Obama from Speech Audio
Language:Python67 3 724
GalaxyCong/StyleDubber
[ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"
Language:Python63 5 43
yochaiye/LipVoicer
Official Code implementation for the ICLR paper "LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading"
Language:Python51 2 68
tanjimin/unsupervised-video-dubbing
Unsupervised video dubbing project
Language:Python38 2 110
choijeongsoo/av2av
[CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation
Language:Python27 2 72
IS2AI/KazEmoTTS
An open-source Kazakh Emotional Text-to-Speech Dataset
Language:Python27 3 24
Ace-Pegasus/EasyDrag
Official code for EasyDrag (CVPR 2024)
Language:Python12 2 30
yochaiye/scene_agnostic_dereverberation
PyTorch implementation of the 2021 INTERSPEECH paper "Scene-Agnostic Multi-Microphone Speech Dereverberation"
Language:Python8 1 15
yochaiye/BIUREVgen
Creates the BIUREV and BIUREV-N datasets
Language:Python3 1 00
eust-w/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:JavaScript1 1 0
MyBeautiful-Fantasy/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Language:Jupyter Notebook1 0 00

MyBeautiful-Fantasy

MyBeautiful-Fantasy's Stars

jianchang512/pyvideotrans

Rudrabha/Wav2Lip

DanielSWolf/rhubarb-lip-sync

X-LANCE/AniTalker

ajay-sainy/Wav2Lip-GFPGAN

MRzzm/DINet

R3gm/SoniTranslate

gemelo-ai/vocos

awni/speech

Rudrabha/Lip2Wav

huawei-noah/Speech-Backbones

Sxjdwang/TalkLip

ivanvovk/WaveGrad

guanjz20/StyleSync

guanjz20/StyleSync_PyTorch

keonlee9420/DailyTalk

KunZhou9646/Mixed_Emotions

GalaxyCong/HPMDubbing

Chris10M/Lip2Speech

amtsai96/Learning-Lip-Sync-from-Audio

GalaxyCong/StyleDubber

yochaiye/LipVoicer

tanjimin/unsupervised-video-dubbing

choijeongsoo/av2av

IS2AI/KazEmoTTS

Ace-Pegasus/EasyDrag

yochaiye/scene_agnostic_dereverberation

yochaiye/BIUREVgen

eust-w/MockingBird

MyBeautiful-Fantasy/Speech-Backbones