josephsuccar

josephsuccar's Stars

xir4n/dtcwt
Python port of the Dual-Tree Complex Wavelet Transform toolbox for MATLAB
Language:Python21
HigherOrderCO/Bend
A massively parallel, high-level programming language
Language:Rust17.2k426
andylolu2/simpleGEMM
The simplest but fast implementation of matrix multiplication in CUDA.
Language:Cuda273
Rikorose/DeepFilterNet
Noise supression using deep filtering
Language:Python2.4k218
mmathew23/improved_edm
Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"
Language:Python873
Rayhane-mamah/Efficient-VDVAE
Official Pytorch and JAX implementation of "Efficient-VDVAE: Less is more"
Language:Python19023
mosaicml/diffusion
Language:Python66867
atong01/conditional-flow-matching
TorchCFM: a Conditional Flow Matching library
Language:Python1.1k82
facebookresearch/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Language:Python40847
eiz/SynchronousAudioRouter
Low latency application audio routing for Windows
Language:C++1k136
DioxusLabs/dioxus
Fullstack app framework for web, desktop, mobile, and more.
Language:Rust20.2k775
RustAudio/cpal
Cross-platform audio I/O library in pure Rust
Language:Rust2.6k347
LukeMathWalker/zero-to-production
Code for "Zero To Production In Rust", a book on API development using Rust.
Language:Rust5.7k486
state-spaces/mamba
Mamba SSM architecture
Language:Python12.5k1.1k
alesaccoia/VoiceStreamAI
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
Language:Python64992
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python4.4k380
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.5k272
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python33.4k4.1k
bytedance/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
Language:Python98577
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Language:Python1.1k101
jeanfeydy/geomloss
Geometric loss functions between point clouds, images and volumes
Language:Python58257
google/lyra
A Very Low-Bitrate Codec for Speech Compression
Language:C++3.8k356
francois-rozet/piqa
PyTorch Image Quality Assessement package
Language:Python40018
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4k394
libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
Language:C2.7k117
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
26.3k2.2k
zademn/mnist-mlops-learning
In this project I played with mlflow, streamlit and fastapi to create a training and prediction app on digits
Language:Python10220
openai/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
Language:Python7.8k1.4k