crlandsc
Audio ML engineer and researcher with a passion for music and spatial audio.
WhitebalanceChicago, IL
crlandsc's Stars
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
KindXiaoming/pykan
Kolmogorov Arnold Networks
black-forest-labs/flux
Official inference repo for FLUX.1 models
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
NX-AI/xlstm
Official repository of the xLSTM.
test-time-training/ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
k2-fsa/icefall
NVlabs/MambaVision
Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
kyegomez/zeta
Build high-performance AI models with modular building blocks
kyegomez/VisionMamba
Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory when performing batch inference to extract features on high-res images
soundata/soundata
Python library for downloading, loading & working with sound datasets
kkoutini/PaSST
Efficient Training of Audio Transformers with Patchout
muditbhargava66/PyxLSTM
Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.
JusperLee/LibriSpace
mcomunita/AFX-Research
Scientific literature about Audio Effects
sh-lee-prml/PeriodWave
The official Implementation of PeriodWave and PeriodWave-Turbo
pierreaubert/spinorama
A library to display and compare spinorama (speakers measurements) graphs.
lucidrains/adam-atan2-pytorch
Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch
YuHengsss/VSSD
Introduce Mamba2 to Vision.
RoyChao19477/PCS
Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)
KimberleyJensen/Mel-Band-Roformer-Vocal-Model
nomonosound/fast-align-audio
A fast python library for aligning similar audio snippets passed in as NumPy arrays
xi-j/Mamba-ASR
ConMamba for Automatic Speech Recognition
kwatcharasupat/query-bandit
Banquet: A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems
yamathcy/ISMIR-2024-Papers
kwatcharasupat/bandit-v2
Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"
kwatcharasupat/source-separation-landing
Landing Page for All Things Source Separation
kwatcharasupat/divide-and-remaster-v3
Landing Page for Divide and Remaster v3
YinPing-Cho/PCS-FIR-Filter
A time-domain extension to "Perceptual Contrast Stretching on Target Feature for Speech Enhancement"
MysticShadow427/simplistic-zipformer
Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch