Pinned Repositories
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
beaqlejs
*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.
book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
ClariNet
A Pytorch Implementation of ClariNet
Concatenate_wav
Concatenate wavs(for unit selection)
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
FloWaveNet
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
sunxh16's Repositories
sunxh16 doesn’t have any repository yet.