Pinned Repositories
ASR_Syllable
采用音节建模构建语音识别声学模型
ASR_WORD
采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。
ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
athena
an open-source implementation of sequence-to-sequence based speech processing engine
audiotsm
A python library for real-time audio time-scale modification procedures
biaxial-rnn-music-composition
A recurrent neural network designed to generate classical music.
GCOT
Graph Convolutional Optimal Transport for Hyperspectral Image Spectral Clustering
POP909-Dataset
This is the dataset repository for the paper: POP909: A Pop-song Dataset for Music Arrangement Generation
prefix-beam-search
Code for prefix beam search tutorial by @labodk
psola
Python package implementing the TD-PSOLA algorithm for speech processing
suldier's Repositories
suldier/GCOT
Graph Convolutional Optimal Transport for Hyperspectral Image Spectral Clustering
suldier/POP909-Dataset
This is the dataset repository for the paper: POP909: A Pop-song Dataset for Music Arrangement Generation
suldier/ASR_Syllable
采用音节建模构建语音识别声学模型
suldier/ASR_WORD
采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。
suldier/ASRT_SpeechRecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
suldier/athena
an open-source implementation of sequence-to-sequence based speech processing engine
suldier/CDD
suldier/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
suldier/DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
suldier/DualSC
suldier/expressive_tacotron
Tensorflow Implementation of Expressive Tacotron
suldier/faust
Functional programming language for signal processing and sound synthesis
suldier/FloWaveNet
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
suldier/gantts
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
suldier/intro2musictech
公众号“无痛入门音乐科技”开源代码
suldier/Listen-Attend-Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
suldier/LPCTron
Tacotron2 + LPCNET for complete End-to-End TTS System
suldier/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
suldier/MTTS
A Demo of Mandarin/Chinese TTS frontend
suldier/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
suldier/POT
POT : Python Optimal Transport
suldier/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
suldier/Sinsy-Remix
The HMM-Based Singing Voice Syntheis System Remix "Sinsy-r"
suldier/so-vits-svc
SoftVC VITS Singing Voice Conversion
suldier/Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
suldier/suldier.github.io
suldier/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
suldier/Tacotron2-1
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
suldier/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)
suldier/wavenet_vocoder
WaveNet vocoder