gyq517

gyq517's Stars

taurusxin/ncmdump
转换网易云音乐 ncm 到 mp3 / flac. Convert Netease Cloud Music ncm files to mp3/flac files.
Language:C++1.2k183
nuniz/blind_rt60
Algorithm for blind estimation of reverberation time
Language:Jupyter Notebook161
sungwon23/BSRNN
Language:Python9215
sp-uhh/sgmse
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
Language:Python52076
ZhaZhaFon/resource_speech
语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download
458
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python72.1k8.6k
facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python1.7k303
hongfeixue/KWS_pytorch
Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
Language:Python409
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Language:Python1.9k192
open-mmlab/mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Language:Python4.3k1.2k
sukumo28/vscode-audio-preview
VS Code extension that allows you to preview and play audio files.
Language:TypeScript14716
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python9k1.4k
szagoruyko/pytorchviz
A small package to create visualizations of PyTorch execution graphs
Language:Jupyter Notebook3.2k279
jaakkopasanen/AutoEq
Automatic headphone equalization from frequency responses
Language:Python13.5k2.5k
rishikksh20/multiband-hifigan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python383
YuanGongND/vocalsound
Dataset and baseline code for the VocalSound dataset (ICASSP2022).
Language:Jupyter Notebook12510
qiuqiangkong/dcase2018_task1
Language:Python2610
haoheliu/torchsubband
Pytorch implementation of subband decomposition
Language:HTML8913
cwitkowitz/lhvqt
Frontend filterbank learning module with HVQT initialization capabilities.
Language:Python203
TeXworks/texworks
Main codebase for TeXworks, a simple interface for working with TeX documents
Language:C++699130
microsoft/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Language:Python1.1k414
deeplyinc/Nonverbal-Vocalization-Dataset
Language:Jupyter Notebook284
deeplyinc/Parent-Child-Vocal-Interaction-Dataset
Language:Jupyter Notebook121
pytorch/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Language:Python22.5k9.6k
galgreshler/Catch-A-Waveform
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
Language:Python18835
patriceguyot/Yin
Fast Python implementation of the Yin algorithm: a fundamental frequency estimator
Language:Python9320
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4.5k435
ttroy50/cmake-examples
Useful CMake Examples
Language:CMake12.4k2.5k
deezer/spleeter
Deezer source separation library including pretrained models.
Language:Python26k2.9k
qiuqiangkong/panns_inference
Language:Python20231