KoljaB
Need ultra-responsive and robust voice AI? I develop scalable, low-latency real-time transcription and TTS pipelines on cloud platforms.
Pinned Repositories
ai_cli_tools
AI at your fingertips: powerful CLI tools for speech, text, and language processing
AIVoiceChat
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
Linguflex
Command Your World with Voice
LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
LocalEmotionalAIVoiceChat
Simulates talk with an AI that can express emotions
RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
RealtimeTTS
Converts text to speech in realtime
stream2sentence
Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.
TurnVoice
Voice Transformation for Videos. π€ππ¬
WhoSpeaks
Efficient approach to speaker diarization using voice characteristics extraction
KoljaB's Repositories
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
KoljaB/RealtimeTTS
Converts text to speech in realtime
KoljaB/Linguflex
Command Your World with Voice
KoljaB/LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
KoljaB/AIVoiceChat
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
KoljaB/TurnVoice
Voice Transformation for Videos. π€ππ¬
KoljaB/WhoSpeaks
Efficient approach to speaker diarization using voice characteristics extraction
KoljaB/stream2sentence
Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.
KoljaB/LocalEmotionalAIVoiceChat
Simulates talk with an AI that can express emotions
KoljaB/ai_cli_tools
AI at your fingertips: powerful CLI tools for speech, text, and language processing
KoljaB/vector_companion_fork
A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow your computer journey wherever you go!
KoljaB/KoljaB
KoljaB/oi-fork
Oi is an open-source cli tool that works on top of codellama and generates code in any editor without extensions.
KoljaB/coqui-ai-TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
KoljaB/openai_function_call
Helper functions to create openai function calls w/ pydantic
KoljaB/privateGPT
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
KoljaB/ReadAloud
Mark text or url and let it read out loud
KoljaB/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
KoljaB/Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
KoljaB/xtts-webui
Webui for using XTTS and for finetuning it