hltaaron's Stars
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
comfyanonymous/ComfyUI_examples
Examples of ComfyUI workflows
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Hillobar/Rope
GUI-focused roop
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
6drf21e/ChatTTS_Speaker
ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview
thiagorossener/jekflix-template
A Jekyll theme inspired by Netflix. 🎬
alshedivat/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
daswer123/hallo-webui
Webui for Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
sdbds/hallo-for-windows
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
anothermartz/Easy-Wav2Lip
Colab for making Wav2Lip high quality and easy to use
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Paraworks/vits_with_chatgpt-gpt3
AliaksandrSiarohin/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
GuijiAI/duix.ai
OpenGVLab/Diffree
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
THUDM/CogVideo
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
2noise/ChatTTS
A generative speech model for daily dialogue.
CCmahua/ChatTTS-Enhanced