kgmiller's Stars
google-research/google-research
Google Research
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
deepset-ai/haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
spotify/annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
uNetworking/uWebSockets.js
μWebSockets for Node.js back-ends :metal:
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
OpenInterpreter/01
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
sqlchat/sqlchat
Chat-based SQL Client and Editor for the next decade
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
h2oai/h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
yahoojapan/NGT
Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Data
nypublicradio/audiogram
Turn audio into a shareable video.
kyledrake/coinpunk
Open source, self-hosted DIY Bitcoin wallet service
open-dict-data/ipa-dict
Monolingual wordlists with pronunciation information in IPA
tanhakabir/SwiftAudioPlayer
Streaming and realtime audio manipulation with AVAudioEngine
alexklibisz/elastiknn
Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.
manojpamk/pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
haoheliu/voicefixer_main
General Speech Restoration
ShivaliGoel/Quotes-500K
bayeru/chat-to-your-database
Chat to your database with AI. An experimental app to test the abilities of LLMs to query SQL databases using natural language.
barbulescualex/MetalAudioVisualizer
Tutorial on making your first Audio Visualizer in Swift using Metal, Accelerate, and AVAudioEngine!
dennlinger/TopicalChange
Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.
skywalker023/fantom
👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
fgnt/graph_pit
whrg/MLB_prediction
Predict Major League Baseball games (win/loss) with machine learning