hltaaron

hltaaron's Stars

Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
Language:Java42.3k3.4k
comfyanonymous/ComfyUI_examples
Examples of ComfyUI workflows
Language:HTML1.7k257
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python51k5.3k
Hillobar/Rope
GUI-focused roop
Language:Python4.4k679
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
Language:Python35.4k5k
6drf21e/ChatTTS_Speaker
ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview
Language:Python46123
thiagorossener/jekflix-template
A Jekyll theme inspired by Netflix. 🎬
Language:HTML8391.2k
alshedivat/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
Language:HTML10.6k11k
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript11.7k42.1k
daswer123/hallo-webui
Webui for Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python7726
sdbds/hallo-for-windows
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python18129
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python9.2k1.3k
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python2.4k299
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Language:Python1.2k87
anothermartz/Easy-Wav2Lip
Colab for making Wav2Lip high quality and easy to use
Language:Jupyter Notebook60694
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python6.4k944
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Language:C++25.1k3.9k
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
8.9k1.8k
OpenTalker/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python11.7k2.2k
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python10.3k2.2k
Paraworks/vits_with_chatgpt-gpt3
Language:Python38952
AliaksandrSiarohin/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
Language:Jupyter Notebook14.5k3.2k
GuijiAI/duix.ai
Language:C++4.5k645
OpenGVLab/Diffree
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Language:Python21213
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python4.6k368
THUDM/CogVideo
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python7.4k678
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python32.2k2.4k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python32.7k3.8k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python30.8k3.3k
CCmahua/ChatTTS-Enhanced
Language:Python45363