abeiro's Stars
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Haoming02/sd-forge-couple
An Extension for Forge Webui that implements Attention Couple
MinLL/MinAI
Bridge between LLMs and various Skyrim Mods
abeiro/HerikaServer
fishaudio/fish-speech
Brand new TTS solution
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
MackinationsAi/sd-webui-udav2
A1111 Extension integration for Upgraded-Depth-Anything-V2 - UDAV2
AkiiLucky/sd-webui-animed-video-controlnet
sd-webui 扩展插件, 用于实现可控动漫视频生成(开发中)
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
ali-vilab/UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
IAHispano/Applio
A simple, high-quality voice conversion tool focused on ease of use and performance.
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Woolverine94/biniou
a self-hosted webui for 30+ generative ai
erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
daswer123/xtts-api-server
A simple FastAPI Server to run XTTSv2
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
pkuliyi2015/multidiffusion-upscaler-for-automatic1111
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
Elbios/HerikaAITools
Docker container with AI tools for Herika Skyrim mod
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
ImPavloh/VoiceIt
Change the voice of audios using pre-trained streamer voice models using AI.
daswer123/xtts-finetune-webui
Slightly improved official version for finetune xtts
lxe/tts-server
A simple TTS server for generating speech using StyleTTS2
s9roll7/ebsynth_utility
AUTOMATIC1111 UI extension for creating videos using img2img and ebsynth.
CharmedBaryon/CommonLibSSE-NG
This is a reverse engineered library for Skyrim Special Edition and Skyrim VR.
coqui-ai/xtts-streaming-server
powerof3/CommonLibSF
A collaborative reverse-engineered library for Starfield
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
trzy/llava-cpp-server
LLaVA server (llama.cpp).