abeiro

abeiro's Stars

SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python4.3k396
Haoming02/sd-forge-couple
An Extension for Forge Webui that implements Attention Couple
Language:JavaScript22011
MinLL/MinAI
Bridge between LLMs and various Skyrim Mods
Language:Papyrus95
abeiro/HerikaServer
Language:PHP1612
fishaudio/fish-speech
Brand new TTS solution
Language:Python13.4k1k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python34.3k3.9k
MackinationsAi/sd-webui-udav2
A1111 Extension integration for Upgraded-Depth-Anything-V2 - UDAV2
Language:Python371
AkiiLucky/sd-webui-animed-video-controlnet
sd-webui 扩展插件，用于实现可控动漫视频生成（开发中）
Language:Jupyter Notebook6
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
Language:Python3.8k655
ali-vilab/UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diﬀusion Models for Consistent Human Image Animation".
Language:Python99654
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
Language:Python29.2k2.9k
IAHispano/Applio
A simple, high-quality voice conversion tool focused on ease of use and performance.
Language:Python1.7k276
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Language:Python2.4k253
Woolverine94/biniou
a self-hosted webui for 30+ generative ai
Language:Python45847
erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
Language:HTML1k112
daswer123/xtts-api-server
A simple FastAPI Server to run XTTSv2
Language:Python38590
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7.6k740
pkuliyi2015/multidiffusion-upscaler-for-automatic1111
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
Language:Python4.7k334
Elbios/HerikaAITools
Docker container with AI tools for Herika Skyrim mod
Language:Python3
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python4.5k391
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
Language:Python1.2k134
ImPavloh/VoiceIt
Change the voice of audios using pre-trained streamer voice models using AI.
Language:Python113
daswer123/xtts-finetune-webui
Slightly improved official version for finetune xtts
Language:Python21275
lxe/tts-server
A simple TTS server for generating speech using StyleTTS2
Language:Python266
s9roll7/ebsynth_utility
AUTOMATIC1111 UI extension for creating videos using img2img and ebsynth.
Language:Python1.2k127
CharmedBaryon/CommonLibSSE-NG
This is a reverse engineered library for Skyrim Special Edition and Skyrim VR.
Language:C++13731
coqui-ai/xtts-streaming-server
Language:Python28881
powerof3/CommonLibSF
A collaborative reverse-engineered library for Starfield
Language:C++2
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Language:Python5.1k411
trzy/llava-cpp-server
LLaVA server (llama.cpp).
Language:C++1779