mtxing

Pinned Repositories

AFRCNN-For-Speech-Separation
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Language:Python1 0 00
ambisonic_rt_estimation
Ambisonic Blind Reverberation Time Estimation
Language:Python00
aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
Language:Python0 0 00
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
Language:Python00
athena
an open-source implementation of sequence-to-sequence base speech processing engine
Language:Python0 1 00
athena-signal
Language:C0 1 00
Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Language:MATLAB0 0 00
BLOOM-Net
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
Language:Python0 0 00
conformer
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Language:Python0 0 00
conv-tasnet
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation"
Language:Python10

mtxing's Repositories

mtxing/AFRCNN-For-Speech-Separation
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Language:Python1 0 00
mtxing/aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
Language:Python0 0 00
mtxing/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Language:MATLAB0 0 00
mtxing/BLOOM-Net
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
Language:Python0 0 00
mtxing/conformer
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Language:Python0 0 00
mtxing/deep_avsr
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
mtxing/DeepLearning-500-questions
深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，50余万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系scutjy2015@163.com 版权所有，违权必究 Tan 2018.06
mtxing/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
mtxing/espnet
End-to-End Speech Processing Toolkit
Language:Python1 0
mtxing/FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
mtxing/generative_inpainting
DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral
mtxing/HGCN
The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"
mtxing/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
mtxing/hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
mtxing/IguanaTexMac
IguanaTex for mac
mtxing/MetricGAN
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (ICML 2019, with Travel awards)
mtxing/metrics
Machine learning metrics for distributed, scalable PyTorch applications.
mtxing/MS-SNSD
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
Language:HTML1 0
mtxing/Parrotron
mtxing/performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
mtxing/pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
Language:Python1 0
mtxing/pytorch-inpainting-with-partial-conv
Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions' [Liu+, ECCV2018]
Language:Python1 0
mtxing/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
mtxing/speechbrain
A PyTorch-based Speech Toolkit
mtxing/StyleTransfer
Implementation of "Perceptual Losses for Real-Time Style Transfer and Super-Resolution" in PyTorch
mtxing/sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
mtxing/The_C_Programming_Language
C_Programming_Language submit
Language:C1 0
mtxing/torch-dct
DCT (discrete cosine transform) functions for pytorch
mtxing/vits_chinese
Best TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Also for voice clone!
mtxing/voicefixer_main
General Speech Restoration
Language:Python0 0