coqui-tts

There are 21 repositories under coqui-tts topic.

  • mezbaul-h/june

    Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit

    Language:Python73471046
  • BoltzmannEntropy/xtts2-ui

    A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech

    Language:Python29783547
  • karim23657/Persian-tts-coqui

    Persian/Farsi text to speech(TTS) training using coqui tts

    Language:Jupyter Notebook12552418
  • skshadan/TTS-RVC-API

    Text to Speech using Coqui TTS + RVC

    Language:Python956819
  • rowan-sl/coqui-rs

    Rust bindings to the https://github.com/coqui-ai TTS library

    Language:Rust18222
  • deepily/genie-in-the-box

    Genie in the Box: Distill Whisper STT => Mistral-7B => Phind/Phind-CodeLlama-34B-v2 => GPT 3.5 => Coqui's TTS/OpenAI TTS

    Language:Jupyter Notebook15204
  • skshadan/WhisCall

    A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo, Virtual Audio Cable, and the WhatsApp Desktop App.

    Language:Python14103
  • nsourlos/voice_cloning_tools

    Various tools to clone a voice

    Language:Jupyter Notebook12201
  • anujsahani01/VoiceCloning-coqui-TTS

    Voice cloning using coqui-TTS

    Language:Jupyter Notebook10100
  • harmlessman/CoquiTTSGui

    Gui for users who use the coqui-TTS vits model.

    Language:Python10221
  • Aditya1Jhaveri/DoyenTalker

    DoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The system utilizes Coqui TTS for text-to-speech generation, along with various face rendering and animation techniques to create a video where the given avatar articulates the speech.

    Language:Python8202
  • overcrash66/OpenTranslator

    Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features

    Language:Python5202
  • gusanmaz/echosight

    EchoSight is a tool that helps visually impaired individuals by audibly describing images taken with a Raspberry Pi Camera or inputted via image path or URL across different operating systems.

    Language:Python4200
  • Inc44/TheTTS

    Synthesize speech using state-of-the-art open and closed-source tools

    Language:Python3200
  • astrologos/py-speakeasy

    Speakeasy GPT is a Jupyter notebook that utilizes several natural language processing utilities to provide a seamless and low-latency speech interface to ChatGPT and other large language models.

    Language:Jupyter Notebook2200
  • HappyBravo/chatGPT_4voice

    ChatGPT with Voice input and audio response.

    Language:Python2200
  • ayan4m1/autoytpoo

    Generate cursed videos with AI

    Language:Python120
  • thomassteinreiter/audiobook

    This is a proof of concept documentation for creating an audio book using free TTS tools

    Language:Python0100
  • dzsezer/children-stories

    OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.

    Language:Jupyter Notebook10
  • thebitanpaul/Linguistic

    This web app can clone anyone's voice and generate speech of given text and also convert any given text to speech using Google TTS.

    Language:Jupyter Notebook10
  • Webxspark/voxgenie

    Clone your voice with just a 10-second sample! This project allows users to generate personalized text-to-speech models that replicate their voice using Coqui TTS engine.

    Language:JavaScript101