Pinned Repositories
2022MCM-C-problem
2022美赛C题(MCM/ICM)F奖源码数据公开
ControlSpeech
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec
Design-compiler
吉林大学编译原理课程设计,基于SNL语言完成词法分析,语法分析程序。
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Languagecodec
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models
Nucleic-acid-detection-system
吉林大学软件工程软构件与中间件课设
SocketFTP
吉林大学计算机网络课设(实现FTP文件传输系统)
TextrolSpeech
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models (2024 ICASSP)
WavChat
A Survey of Spoken Dialogue Models (60 pages)
WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
jishengpeng's Repositories
jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
jishengpeng/WavChat
A Survey of Spoken Dialogue Models (60 pages)
jishengpeng/Languagecodec
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models
jishengpeng/ControlSpeech
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec
jishengpeng/TextrolSpeech
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models (2024 ICASSP)
jishengpeng/2022MCM-C-problem
2022美赛C题(MCM/ICM)F奖源码数据公开
jishengpeng/SocketFTP
吉林大学计算机网络课设(实现FTP文件传输系统)
jishengpeng/Nucleic-acid-detection-system
吉林大学软件工程软构件与中间件课设
jishengpeng/Design-compiler
吉林大学编译原理课程设计,基于SNL语言完成词法分析,语法分析程序。
jishengpeng/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
jishengpeng/jishengpeng.github.io
A Modern and Responsive Academic Personal Homepage
jishengpeng/libri-light
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
jishengpeng/mini-omni
open-source multimodel large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
jishengpeng/NeuralSVB
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
jishengpeng/OpenVoice
Instant voice cloning by MyShell.
jishengpeng/parler-tts
Inference and training library for high-quality TTS models.