jiahello's Stars
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
kangyiwen/TTSlist
10000 chatTTS voices !chatTTS 音色库,再也不为音色抽卡烦恼啦。这是我第一个项目,熬夜龟速生产10000条音色并上传Github,给点鼓励呗哈!主域名:https://www.TTSlist.com 备用:http://ttslist.aiqbh.com/
libukai/Awesome-ChatTTS
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
lenML/ChatTTS-Forge
🍦 ChatTTS-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
panyanyany/Awesome-ChatTTS
ChatTTS资源大全,免费体验地址,音色库等
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
aihacker111/Efficient-Live-Portrait
Fast running Live Portrait with TensorRT and ONNX models
AIFSH/ComfyUI-MimicBrush
a comfyui custom node for MimicBrush
harry0703/AudioNotes
快速提取音视频内容,整理成一份结构化的markdown笔记
ali-vilab/MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
BadToBest/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
landing-ai/vision-agent
Vision agent
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
modelscope/agentscope
Start building LLM-empowered multi-agent applications in an easier way.
shadowcz007/comfyui-liveportrait
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control
kijai/ComfyUI-LivePortraitKJ
ComfyUI nodes for LivePortrait
KwaiVGI/LivePortrait
Bring portraits to life!
SamKhoze/ComfyUI-DeepFuze
DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyncing, Face Swapping, Lipsync Translation, video generation, and voice cloning.
DaoCloud/public-image-mirror
很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Gouryella/ChatTTS-webui
A Web UI developed based on ChatTTS, implemented using Nuxt 3 and Ant Design.
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
2noise/ChatTTS
A generative speech model for daily dialogue.
anothermartz/Easy-Wav2Lip
Colab for making Wav2Lip high quality and easy to use
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
KevinWang676/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
RaSan147/pixi-live2d-display
A PixiJS plugin to display Live2D models of any kind (With lip-sync from audio)
v3ucn/live2d-TTS-LLM-GPT-SoVITS-Vtuber
低成本的简单基于live2d TTS文字转语音和大模型聊天的直播解决方案
Kedreamix/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting