hcwu1993

Pinned Repositories

Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python0 0 00
awesome-knowledge-distillation
Awesome Knowledge Distillation
0 2 00
ChatTTS
TTS
Language:Jupyter Notebook0 0 00
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python00
DCGAN-LSGAN-WGAN-GP-DRAGAN-Tensorflow-2
DCGAN LSGAN WGAN-GP DRAGAN Tensorflow 2
Language:Python0 2 00
deepvoice3_pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Language:Python00
encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python0 0 00
forced-alignment-tools
A collection of links and notes on forced alignment tools
Language:Python00
GenshinAudio
All audio extracted from Genshin Impact, music, voicelines and everything else
0 0 00
NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Language:Python10

hcwu1993's Repositories

hcwu1993/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Language:Python10
hcwu1993/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python0 0 00
hcwu1993/awesome-knowledge-distillation
Awesome Knowledge Distillation
0 2 00
hcwu1993/ChatTTS
TTS
Language:Jupyter Notebook0 0 00
hcwu1993/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python00
hcwu1993/DCGAN-LSGAN-WGAN-GP-DRAGAN-Tensorflow-2
DCGAN LSGAN WGAN-GP DRAGAN Tensorflow 2
Language:Python0 2 00
hcwu1993/deepvoice3_pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Language:Python00
hcwu1993/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Language:Python0 0 00
hcwu1993/forced-alignment-tools
A collection of links and notes on forced alignment tools
Language:Python00
hcwu1993/GenshinAudio
All audio extracted from Genshin Impact, music, voicelines and everything else
0 0 00
hcwu1993/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python
hcwu1993/hello-world
Begining of github
2 0
hcwu1993/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python1 0
hcwu1993/llama2.c
Inference Llama 2 in one file of pure C
Language:C0 0
hcwu1993/merlin
This is now the official location of the Merlin project.
Language:Python2 0
hcwu1993/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
Language:Python0 0
hcwu1993/pandas-cookbook
Recipes for using Python's pandas library
Language:Jupyter Notebook2 0
hcwu1993/parler-tts
Inference and training library for high-quality TTS models.
hcwu1993/parrot
RNN-based generative models for speech.
Language:Python
hcwu1993/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
hcwu1993/tensorflow
Computation using data flow graphs for scalable machine learning
Language:C++2 0
hcwu1993/TensorFlow-Examples
TensorFlow Tutorial and Examples for beginners
Language:Jupyter Notebook
hcwu1993/tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
Language:Python2 0
hcwu1993/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python0 0
hcwu1993/video-subtitle-extractor
视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
hcwu1993/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python1 0
hcwu1993/waveglow
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
Language:Python2 0
hcwu1993/wavenet_vocoder
WaveNet vocoder
Language:Python
hcwu1993/wechat_jump_game
python 微信《跳一跳》辅助
Language:Python2 0