Pinned Repositories
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
async_cosyvoice
使用vllm加速cosyvoice2的推理
beaqlejs
*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.
book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
ClariNet
A Pytorch Implementation of ClariNet
Concatenate_wav
Concatenate wavs(for unit selection)
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
sunxh16's Repositories
sunxh16/beaqlejs
*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.
sunxh16/iaf
Code for reproducing key results in the paper "Improving Variational Inference with Inverse Autoregressive Flow"
sunxh16/json-c
https://github.com/json-c/json-c is the official code repository for json-c. See the wiki for release tarballs for download.
sunxh16/Learn_Python
Learn_Python
sunxh16/P_wavenet_vocoder
working on parallel wavenet
sunxh16/Parallel-Wavenet
It is a Tutorial, not a complete implement
sunxh16/pixel-cnn
Python3 / Tensorflow implementation of PixelCNN++, as described in "PixelCNN++: A PixelCNN Implementation with Discretized Logistic Mixture Likelihood and Other Modifications"
sunxh16/speech-to-text-wavenet
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
sunxh16/tacotron2_v2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
sunxh16/tacotron_asr
Speech Recognition Using Tacotron
sunxh16/tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
sunxh16/wavenet_vocoder
WaveNet vocoder