Pinned Repositories
AFRCNN-For-Speech-Separation
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
ambisonic_rt_estimation
Ambisonic Blind Reverberation Time Estimation
aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
athena
an open-source implementation of sequence-to-sequence base speech processing engine
athena-signal
Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
BLOOM-Net
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
conformer
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
conv-tasnet
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation"
mtxing's Repositories
mtxing/AFRCNN-For-Speech-Separation
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
mtxing/aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
mtxing/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
mtxing/BLOOM-Net
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
mtxing/conformer
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
mtxing/deep_avsr
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
mtxing/DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
mtxing/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
mtxing/espnet
End-to-End Speech Processing Toolkit
mtxing/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
mtxing/generative_inpainting
DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral
mtxing/HGCN
The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"
mtxing/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
mtxing/hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
mtxing/IguanaTexMac
IguanaTex for mac
mtxing/MetricGAN
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (ICML 2019, with Travel awards)
mtxing/metrics
Machine learning metrics for distributed, scalable PyTorch applications.
mtxing/MS-SNSD
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
mtxing/Parrotron
mtxing/performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
mtxing/pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
mtxing/pytorch-inpainting-with-partial-conv
Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions' [Liu+, ECCV2018]
mtxing/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
mtxing/speechbrain
A PyTorch-based Speech Toolkit
mtxing/StyleTransfer
Implementation of "Perceptual Losses for Real-Time Style Transfer and Super-Resolution" in PyTorch
mtxing/sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
mtxing/The_C_Programming_Language
C_Programming_Language submit
mtxing/torch-dct
DCT (discrete cosine transform) functions for pytorch
mtxing/vits_chinese
Best TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Also for voice clone!
mtxing/voicefixer_main
General Speech Restoration