Gabibing
웨이브덱(waveDeck) CEO / Artificial Intelligence Online Competition 1st place, by the Ministry of Science and Technology, South Korea
waveDeck Corp.Seoul, South Korea
Gabibing's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
yeemachine/kalidokit
Blendshape and kinematics calculator for Mediapipe/Tensorflow.js Face, Eyes, Pose, and Finger tracking models.
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
prophesier/diff-svc
Singing Voice Conversion via diffusion model
xianfei/SysMocap
A real-time motion capture system for 3D virtual character animating.
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
pixiv/three-vrm
Use VRM on Three.js
sh-lee-prml/HierSpeechpp
The official implementation of HierSpeech++
yourtablecloth/TableCloth
식탁보 프로젝트
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
huawei-noah/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
kjsman/stable-diffusion-pytorch
Yet another PyTorch implementation of Stable Diffusion (probably easy to read)
Beomi/BitNet-Transformers
0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture
Kyubyong/g2pK
g2pK: g2p module for Korean
AminRezaei0x443/memory-efficient-attention
Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch
revsic/torch-nansypp
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis
sh-lee-prml/BigVGAN
Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training
scarletcho/KoG2P
Korean grapheme-to-phone conversion in Python
jonghwanhyeon/python-mecab-ko
A python binding for mecab-ko
lucidrains/NWT-pytorch
Implementation of NWT, audio-to-video generation, in Pytorch
MWM-io/nansypp
Unofficial implementation of NANSY++ in Pytorch Lightning
imgai-newbey/ds
ds