Pinned Repositories
chinese-hubert-soft
dub_genius
基于GPT-SoVITS的视频剪辑快捷配音工具
DupImageDetection
海量图片去重算法-局部分块Hash算法
gpt-vits
text to speech using decoder-only transformer and VITS
hifigan-yingram-vc
vc
LinearityIQA
Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment, Accepted by ACM MM 2020
McHuo
A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes
natsume
A Japanese text frontend processing toolkit
RAFT-Softsplat-VFI
Video Frame Interpolation (RAFT + Softsplat)
splinter21's Repositories
splinter21/gpt-vits
text to speech using decoder-only transformer and VITS
splinter21/McHuo
A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes
splinter21/ai-audio-datasets-list
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
splinter21/anime-character-extract
one-shot Character Extraction From Anime Video With MultiModal Method
splinter21/APNet2
Source code of APNet2, a vocoder
splinter21/Asaritsu4Diffsinger
A mult-languages(CN/JP/EN) singing database for Diffsinger(OpenVPI).
splinter21/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
splinter21/descript-audio-vae
VAE GAN modified from Descript Audio Codec, which replaces the RVQ with VAE
splinter21/docs
docs
splinter21/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
splinter21/fish-speech
splinter21/FreeTalker
splinter21/GlotNet
splinter21/kanbun-dataset
Classical Chinese-Classical Japanese Parallel Corpus
splinter21/LLVC
splinter21/NMT-p2g
splinter21/pesto
Self-supervised learning for fast pitch estimation
splinter21/pesto-full
Full models and training code for PESTO
splinter21/PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
splinter21/ScaleCrafter
Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
splinter21/SourceFilterNeuralFormants
splinter21/Speech2Lip
splinter21/StarRail_Voice_Downloader
星穹铁道语音下载
splinter21/svd-temporal-controlnet
splinter21/ttts
splinter21/Whispering-LLaMA
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
splinter21/xtts-finetune-webui
Slightly improved official version for finetune xtts
splinter21/xtts-webui
Webui for using XTTS and for finetuning it
splinter21/YOLOv8-anime-hands
splinter21/zvc
A lightweight vector-search based AI voice conversion system