gyq517's Stars
taurusxin/ncmdump
转换网易云音乐 ncm 到 mp3 / flac. Convert Netease Cloud Music ncm files to mp3/flac files.
nuniz/blind_rt60
Algorithm for blind estimation of reverberation time
sungwon23/BSRNN
sp-uhh/sgmse
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
ZhaZhaFon/resource_speech
语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
hongfeixue/KWS_pytorch
Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
open-mmlab/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
sukumo28/vscode-audio-preview
VS Code extension that allows you to preview and play audio files.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
szagoruyko/pytorchviz
A small package to create visualizations of PyTorch execution graphs
jaakkopasanen/AutoEq
Automatic headphone equalization from frequency responses
rishikksh20/multiband-hifigan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
YuanGongND/vocalsound
Dataset and baseline code for the VocalSound dataset (ICASSP2022).
qiuqiangkong/dcase2018_task1
haoheliu/torchsubband
Pytorch implementation of subband decomposition
cwitkowitz/lhvqt
Frontend filterbank learning module with HVQT initialization capabilities.
TeXworks/texworks
Main codebase for TeXworks, a simple interface for working with TeX documents
microsoft/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
deeplyinc/Nonverbal-Vocalization-Dataset
deeplyinc/Parent-Child-Vocal-Interaction-Dataset
pytorch/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
galgreshler/Catch-A-Waveform
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
patriceguyot/Yin
Fast Python implementation of the Yin algorithm: a fundamental frequency estimator
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
ttroy50/cmake-examples
Useful CMake Examples
deezer/spleeter
Deezer source separation library including pretrained models.
qiuqiangkong/panns_inference