LXP-Never's Stars
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
XiaoMi/mace
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
magenta/ddsp
DDSP: Differentiable Digital Signal Processing
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
jameslyons/python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
modelscope/ClearerVoice-Studio
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
quic/aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
bytedance/piano_transcription
gmalivenko/pytorch2keras
PyTorch to Keras model convertor
csteinmetz1/pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
ddlBoJack/Speech-Resources
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
python-acoustics/python-acoustics
A Python library aimed at acousticians.
Staok/Awesome-Embeded-AI
收集关于嵌入式领域的机器学习算法实现的进展、相关论文和文章、开发库等,帮助初学者快速了解、学习和入门嵌入式领域的机器学习。CC-BY-NC-SA 4.0。
shahules786/mayavoz
Pytorch based speech enhancement toolkit.
busyyang/python_sound_open
语音信号处理试验教程,Python代码
vb000/Waveformer
A deep neural network architecture for low-latency audio processing
funcwj/conv-tasnet
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https://github.com/funcwj/aps)
kkumatani/distant_speech_recognition
spatial signal processing toolkit a.k.a beamforming toolkit 2.0 (BTK2.0)
sdercolin/vlabeler
Open source voice labeling application
jhauret/eben
Repo for source code of EBEN: Extreme Bandwidth Extension Network
YunyangZeng/TAPLoss
echocatzh/PFDKF
partitioned block based frequency domain Kalman filter
YoungJay0612/Single-Channel-Speech-Enhancement
Keep track of good articles on speech processing, mainly on speech enhancement, include speech denoise, speech dereverberation and aec、agc, etc.
SeventeenChen/Python_Speech_SZY
宋知用《MATLAB在语音信号分析与合成中的应用》 Python版
ASAP-Group/Multichannel-Enhancement
Block-Online Multi-Channel Speech Enhancement Using DNN-Supported Relative Transfer Function Estimates
steDamiano/pyroadacoustics
Python package for road acoustics simulation based on variable length delay lines.
CarmiShimon/Phase-Aware-Deep-Speech-Enhancement
Phase Aware Deep Speech Enhancement - Pytorch
lwnn/lwnn
Lightweight Neural Network