jiahello

jiahello's Stars

gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Language:Python2.3k237
kangyiwen/TTSlist
10000 chatTTS voices ！chatTTS 音色库，再也不为音色抽卡烦恼啦。这是我第一个项目，熬夜龟速生产10000条音色并上传Github，给点鼓励呗哈！主域名：https://www.TTSlist.com 备用：http://ttslist.aiqbh.com/
Language:HTML12612
libukai/Awesome-ChatTTS
官方推荐的 ChatTTS 资源汇总项目，整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
1k66
lenML/ChatTTS-Forge
🍦 ChatTTS-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
Language:Python65382
panyanyany/Awesome-ChatTTS
ChatTTS资源大全，免费体验地址，音色库等
1.1k87
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python33.4k4.1k
aihacker111/Efficient-Live-Portrait
Fast running Live Portrait with TensorRT and ONNX models
Language:Python12210
AIFSH/ComfyUI-MimicBrush
a comfyui custom node for MimicBrush
Language:Python886
harry0703/AudioNotes
快速提取音视频内容，整理成一份结构化的markdown笔记
Language:Python984113
ali-vilab/MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Language:Python1.1k76
BadToBest/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python2.4k275
landing-ai/vision-agent
Vision agent
Language:Python1.2k124
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python5.9k642
modelscope/agentscope
Start building LLM-empowered multi-agent applications in an easier way.
Language:Python4.8k294
shadowcz007/comfyui-liveportrait
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control
Language:Python36932
kijai/ComfyUI-LivePortraitKJ
ComfyUI nodes for LivePortrait
Language:Python1.4k108
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python11.7k1.2k
SamKhoze/ComfyUI-DeepFuze
DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyncing, Face Swapping, Lipsync Translation, video generation, and voice cloning.
Language:Python28232
DaoCloud/public-image-mirror
很多镜像都在国外。比如 gcr 。国内下载很慢，需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。
Language:Shell5.4k770
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Language:Python10.8k835
Gouryella/ChatTTS-webui
A Web UI developed based on ChatTTS, implemented using Nuxt 3 and Ant Design.
Language:Python6514
jianchang512/ChatTTS-ui
一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
Language:Python5.9k669
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python30.6k3.3k
anothermartz/Easy-Wav2Lip
Colab for making Wav2Lip high quality and easy to use
Language:Jupyter Notebook59794
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Language:Python1.2k140
KevinWang676/Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
Language:Jupyter Notebook2.7k390
RaSan147/pixi-live2d-display
A PixiJS plugin to display Live2D models of any kind (With lip-sync from audio)
Language:TypeScript5915
v3ucn/live2d-TTS-LLM-GPT-SoVITS-Vtuber
低成本的简单基于live2d TTS文字转语音和大模型聊天的直播解决方案
Language:HTML13626
Kedreamix/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
Language:Python1.8k293
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python2.4k292