Hongjiang-Yu's Stars
huaweicloud/ModelArts-Lab
ModelArts-Lab是示例代码库。更多AI开发学习交流信息,请访问华为云AI开发者社区:huaweicloud.ai
descriptinc/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
NVIDIA/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
tts-tutorial/survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
cocosci/NSC
Neural Speech Codec
google/lyra
A Very Low-Bitrate Codec for Speech Compression
FrancescoSaverioZuppichini/Pytorch-how-and-when-to-use-Module-Sequential-ModuleList-and-ModuleDict
Code for my medium article
WenzheLiu-Speech/awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
xiph/LPCNet
Efficient neural speech synthesis
ZhihaoDU/speech_feature_extractor
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.
chmodsss/noizeus_corpora
Speech corpora for the speech recognition evaluation system
bentrevett/pytorch-seq2seq
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
BYRTIMO/END-TO-END-SPEECH-ENHANCEMENT-BASED-ON-DISCRETE-COSINE-TRANSFORM
VITA-Group/TransGAN
[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang
lRomul/argus-freesound
Kaggle | 1st place solution for Freesound Audio Tagging 2019
lRomul/argus
Lightweight library for training neural networks in PyTorch
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
santi-pdp/segan_pytorch
Speech Enhancement Generative Adversarial Network in PyTorch
sri-kankanahalli/autoencoder-speech-compression
Code for "End-to-End Optimized Speech Coding with Deep Neural Networks" (ICASSP 2018)
craigmacartney/Wave-U-Net-For-Speech-Enhancement
Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemented for the task of speech enhancement in the time-domain.
yiyang7/Super_Resolution_with_CNNs_and_GANs
Image Super-Resolution Using SRCNN, DRRN, SRGAN, CGAN in Pytorch
LoSealL/VideoSuperResolution
A collection of state-of-the-art video or single-image super-resolution architectures, reimplemented in tensorflow.
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
tensorflow/tfjs-models
Pretrained models for TensorFlow.js
geektutu/interview-questions
机器学习/深度学习/Python/Go语言面试题笔试题(Machine learning Deep Learning Python and Golang Interview Questions)
huyanxin/phasen
A unofficial Pytorch implementation of Microsoft's PHASEN
santi-pdp/pase
Problem Agnostic Speech Encoder
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
datawhalechina/competition-baseline
数据挖掘、计算机视觉、自然语言处理、推荐系统竞赛知识、代码、思路