Pinned Repositories
OpenSeq2Seq-forked
This repository has been archived! It cannot work with some modules.
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
MultiModal-DeepFake
[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond
SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
arash-aut's Repositories
arash-aut/OpenSeq2Seq-forked
This repository has been archived! It cannot work with some modules.
arash-aut/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation