Honee-W's Stars
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Andong-Li-speech/TaylorSENet
This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', which was accepted by IJCAI-ECAI2022 (Long oral)
audeering/opensmile
The Munich Open-Source Large-Scale Multimedia Feature Extractor
artetxem/undreamt
Unsupervised Neural Machine Translation
THUNLP-MT/THUMT
An open-source neural machine translation toolkit developed by Tsinghua Natural Language Processing Group
OpenNMT/OpenNMT-py
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
tencent-ailab/FRA-RIR
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
xanguera/BeamformIt
BeamformIt acoustic beamforming software
jaywcjlove/awesome-mac
Now we have become very big, Different from the original idea. Collect premium software in various categories.
facebookresearch/AudioDec
An Open-source Streaming High-fidelity Neural Audio Codec
Woodsssss/fang-NWPUJiaoWuSystem
个人项目汇总
EthicalML/awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
etzinis/sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
sony/diffiner
google-research/sound-separation
sp-uhh/sgmse
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
neillu23/CDiffuSE
Conditional Diffusion Probabilistic Model for Speech Enhancement
gudgud96/frechet-audio-distance
A lightweight library for Frechet Audio Distance calculation.
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
microsoft/torchscale
Foundation Architecture for (M)LLMs
lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Rikorose/DeepFilterNet
Noise supression using deep filtering
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。