nmstoker's Stars
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
danielmiessler/fabric
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
CopilotKit/CopilotKit
A framework for building custom AI Copilots 🤖 in-app AI chatbots, in-app AI Agents, & AI-powered Textareas.
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
Picovoice/porcupine
On-device wake word detection powered by deep learning
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
ridgerchu/matmulfreellm
Implementation for MatMul-free LM.
KoljaB/RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
b4rtaz/distributed-llama
Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.
n4ze3m/page-assist
Use your locally running AI models to assist you in your web browsing
arunsupe/semantic-grep
grep for words with similar meaning to the query
ddupont808/GPT-4V-Act
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
fixie-ai/ultravox
A fast multimodal LLM for real-time voice
svpino/alloy-voice-assistant
facebookresearch/av_hubert
A self-supervised learning framework for audio-visual speech
ricky0123/vad
Voice activity detector (VAD) for the browser with a simple API
mezbaul-h/june
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
segment-any-text/wtpsplit
Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
dscripka/openWakeWord
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
distantmagic/paddler
Stateful load balancer custom-tailored for llama.cpp
microsoft/T-MAC
Low-bit LLM inference on CPU with lookup table
KoljaB/Linguflex
Command Your World with Voice
PacktPublishing/Building-LLM-Powered-Applications
Building Large Language Model Applications, Published by Packt
ShaShekhar/aaiela
edgedb/memhive
GrantCuster/gemini-spatial-example
How to use bounding boxes with the Gemini API
KoljaB/stream2sentence
Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.
shafkevi/lambda-bedrock-s3-streaming-rag
Fully serverless streaming RAG application
lvillasen/Spectrogram
Live sound spectrogram in JavaScript. It can be configured to change buffer size, FFT function, colormap, window type, minimum and maximum frequencies, loudness sensibility, scrolling direction, scrolling speed and pause scrolling.
vital-ai/vital-wakeword-js