KoljaB
Please save your and my time and do not offer me any jobs. I don't collaborate and i don't work for money.
KoljaB's Stars
coqui-ai/TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Vaibhavs10/insanely-fast-whisper
adamcohenhillel/ADeus
An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI.
flowtyone/flowty-realtime-lcm-canvas
A realtime sketch to image demo using LCM and the gradio library.
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Nutlope/notesGPT
Record voice notes & transcribe, summarize, and get tasks
collabora/WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
jianfch/stable-ts
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
jasonacox/tinytuya
Python API for Tuya WiFi smart devices using a direct local area network (LAN) connection or the cloud (TuyaCloud API).
dscripka/openWakeWord
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
daswer123/xtts-webui
Webui for using XTTS and for finetuning it
KoljaB/LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
idiap/coqui-ai-TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
daswer123/xtts-api-server
A simple FastAPI Server to run XTTSv2
coqui-ai/xtts-streaming-server
virattt/financial-datasets
Financial datasets for LLMs π§ͺ
KoljaB/TurnVoice
Voice Transformation for Videos. π€ππ¬
xue160709/Local-LLM-User-Guideline
virattt/financial-agent
A financial agent, built entirely with LangChain!
remyxai/FFMPerative
Chat to Compose Video
abacaj/openhermes-function-calling
KoljaB/WhoSpeaks
Efficient approach to speaker diarization using voice characteristics extraction
daswer123/deepspeed-windows-wheels
A collection of compiled wheels for deepspeed built for python 3.10 and 3.11 with support for cuda 11.8 and 12.1 for Windows
jxnl/noteGPT
Record voice notes & transcribe, summarize, and get tasks
S95Sedan/Deepspeed-Windows
Deepspeed windows information
nyno-ai/openai-token-counter
Count tokens for OpenAI accurately with support for all parameters like name, functions.
Rohit-Karki/WellBeingApp
Android App for Well Being of a person with features like step counter, water reminder,etc
izzyx6/Linguflex
Personal Assistant that enables voice-based conversation with custom AI personalities. Handles smart home control and music playback. Efficiently conducts Internet searches, retrieves emails, and presents weather updates and news. Assists with scheduling appointments. Capable of image search and generation.