skripnik's Stars
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
deepset-ai/haystack
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
BuilderIO/partytown
Relocate resource intensive third-party scripts off of the main thread and into a web worker. 🎉
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
EleutherAI/gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
webhintio/hint
💡 A hinting engine for the web
NVlabs/eg3d
vocodedev/vocode-python
🤖 Build voice-based LLM agents. Modular + open source.
RameenAbdal/StyleFlow
StyleFlow: Attribute-conditioned Exploration of StyleGAN-generated Images using Conditional Continuous Normalizing Flows (ACM TOG 2021)
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Vaibhavs10/open-tts-tracker
KoljaB/LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
svagcrew/binking
Get bank logo, colors, brand and etc. by card number
LAION-AI/natural_voice_assistant
optas/artemis
Learning to ground explanations of affect for visual art.
alphacep/vosk-tts
Text To Speech Synthesis with Vosk
NaNoGenMo/2020
National Novel Generation Month, 2020 edition.
tonythomas01/wikipedia-section-summaries
ChatGPT driven article and section summarizer for Wikipedia
ScaleVoice/vocode-python
🤖 Build voice-based LLM agents. Modular + open source.