sunxh16

Pinned Repositories

AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Language:Python0 0 00
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python0 0 00
beaqlejs
*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.
Language:JavaScript0 2 00
book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
Language:TeX0 0 00
ClariNet
A Pytorch Implementation of ClariNet
Language:Python0 2 00
Concatenate_wav
Concatenate wavs(for unit selection)
Language:C++0 0 00
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python0 0 00
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python0 0 00
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python0 1 00
FloWaveNet
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Language:Python0 2 00

sunxh16's Repositories

sunxh16/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Language:Python0 0 00
sunxh16/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python0 0 00
sunxh16/book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
Language:TeX0 0 00
sunxh16/ClariNet
A Pytorch Implementation of ClariNet
Language:Python0 2 00
sunxh16/Concatenate_wav
Concatenate wavs(for unit selection)
Language:C++0 0 00
sunxh16/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python0 0 00
sunxh16/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python0 0 00
sunxh16/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python0 1 00
sunxh16/FloWaveNet
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Language:Python0 2 00
sunxh16/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python0 0 00
sunxh16/NeuralVoicePuppetry
This github contains the network architectures of NeuralVoicePuppetry.
Language:Python0 1 00
sunxh16/NNPACK
Acceleration package for neural networks on multi-core CPUs
Language:C00
sunxh16/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
Language:Python0 1 00
sunxh16/onnxruntime
ONNX Runtime
Language:C++2 0
sunxh16/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Language:Jupyter Notebook1 0
sunxh16/Python-Wrapper-for-World-Vocoder
A Python wrapper for the high-quality vocoder "World"
Language:Python2 0
sunxh16/rigl
End-to-end training of sparse deep neural networks with little-to-no performance loss.
Language:Python1 0
sunxh16/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
Language:Python1 0
sunxh16/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python0 0
sunxh16/sp2si-code
Contains code for our work on speech to singing conversion (ICASSP 2020)
Language:Python1 0
sunxh16/SqueezeWave
Language:Python1 0
sunxh16/tacotron2_v1
DeepMind's Tacotron-2 Tensorflow implementation
Language:Python1
sunxh16/tacotron2_v2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook2 0
sunxh16/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python0 0
sunxh16/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python0 0
sunxh16/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Language:Python0 0
sunxh16/voice_conversion
Language:Python2 0
sunxh16/wav2letter
Facebook AI Research Automatic Speech Recognition Toolkit
Language:C++2 0
sunxh16/waveglow
A Flow-based Generative Network for Speech Synthesis
Language:Python1 0
sunxh16/World
A high-quality speech analysis, manipulation and synthesis system
Language:C++2 0