duduOliver's Stars
beetbox/audioread
cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python
rongzhiy/LiTiaotiao
李跳跳软件及使用指南❤❤❤
maotoumao/MusicFree
插件化、定制化、无广告的免费音乐播放器
ddbourgin/numpy-ml
Machine learning, in numpy
zhuima/awesome-cloudflare
⛅️ 精选的 Cloudflare 工具、开源项目、指南、博客和其他资源列表。/ ⛅️ A curated list of Cloudflare tools, open source projects, guides, blogs and other resources.
ZFTurbo/Music-Source-Separation-Training
Repository for training models for music source separation.
qiuqiangkong/torchlibrosa
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
AudioLLMs/AudioLLM
Audio Large Language Models
mewcoder/SharedCourses
大学课程共享计划整理
emidan19/deep-tempest
Restoration for TEMPEST images using deep-learning
anusfoil/DExter
DExter: Learning and Controlling Performance Expression through Diffusion models
Salensoft/thu-cst-cracker
清华大学计算机系课程攻略
Yuan-ManX/audio-development-tools
This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.
MusicLang/musiclang_predict
AI Prediction api of the MusicLang package
jaeyeonkim99/EnCLAP
Official Implementation of EnCLAP (ICASSP 2024)
braindecode/braindecode
Deep learning software to decode EEG, ECG or MEG signals
davidkant/mai
Music and Artificial Intelligence
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
lonce/util_notebooks
shengyp/doing_the_PhD
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
symphonynet/SymphonyNet
Symphony Generation with Permutation Invariant Language Model
AudioCommons/timbral_models
Python scripts for modelling timbral attributes
MalcolmSlaney/python_auditory_toolbox
This is a Python implementation of the Auditory Toolbox
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
NetEase/Polyphonic-TrOMR
TrOMR:Transformer-based Polyphonic Optical Music Recognition
duduOliver/VLA-252A
FMInference/FlexiGen
Running large language models on a single GPU for throughput-oriented scenarios.