randxie
Explore New Things Now. Led Robinhood ML infrastructure, Founding engineer of Vertex AI Feature Store, Kaggle Competition Master
Explore New ThingsSan Jose
randxie's Stars
getify/You-Dont-Know-JS
A book series on JavaScript. @YDKJS on twitter.
pouchdb/pouchdb
:kangaroo: - PouchDB is a pocket-sized database.
pion/webrtc
Pure Go implementation of the WebRTC API
mediar-ai/screenpipe
library & platform to build, distribute, monetize ai apps that have the full context (like rewind, granola, etc.), open source, 100% local, developer friendly. 24/7 screen, mic, keyboard recording and control
yetone/avante.nvim
Use your Neovim like using Cursor AI IDE!
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
NeoVertex1/SuperPrompt
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
huggingface/parler-tts
Inference and training library for high-quality TTS models.
klauspost/compress
Optimized Go Compression Packages
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
vocodedev/vocode-core
🤖 Build voice-based LLM agents. Modular + open source.
Rikorose/DeepFilterNet
Noise supression using deep filtering
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
VITA-MLLM/VITA
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
filipstrand/mflux
A MLX port of FLUX based on the Huggingface Diffusers implementation.
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
vllm-project/llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
kvcache-ai/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
microsoft/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
ociubotaru/transcripts
YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
alibaba/ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
y-crdt/ypy
Python bindings to y-crdt
livekit/python-sdks
LiveKit real-time and server SDKs for Python
OFA-Sys/AIR-Bench
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension