splinter21

Pinned Repositories

chinese-hubert-soft
Language:Python1 0 00
dub_genius
基于GPT-SoVITS的视频剪辑快捷配音工具
Language:Python1 0 00
DupImageDetection
海量图片去重算法-局部分块Hash算法
Language:Python2 1 00
gpt-vits
text to speech using decoder-only transformer and VITS
Language:Python1 1 00
hifigan-yingram-vc
vc
Language:Jupyter Notebook2 0 03
LinearityIQA
Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment, Accepted by ACM MM 2020
Language:Python3 1 00
McHuo
A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes
1 0 01
natsume
A Japanese text frontend processing toolkit
Language:C++2 0 02
RAFT-Softsplat-VFI
Video Frame Interpolation (RAFT + Softsplat)
Language:Python4 1 00

splinter21's Repositories

splinter21/gpt-vits
text to speech using decoder-only transformer and VITS
Language:Python1 1 00
splinter21/McHuo
A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes
1 0 01
splinter21/ai-audio-datasets-list
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
0 0
splinter21/anime-character-extract
one-shot Character Extraction From Anime Video With MultiModal Method
Language:Python0 0
splinter21/APNet2
Source code of APNet2, a vocoder
Language:Python0 0
splinter21/Asaritsu4Diffsinger
A mult-languages(CN/JP/EN) singing database for Diffsinger(OpenVPI).
0 0
splinter21/audio2photoreal
Code and dataset for photorealistic Codec Avatars driven from audio
Language:Python0 0
splinter21/descript-audio-vae
VAE GAN modified from Descript Audio Codec, which replaces the RVQ with VAE
Language:Python0 0
splinter21/docs
docs
Language:HTML0 0
splinter21/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python0 0
splinter21/fish-speech
Language:Python0 0
splinter21/FreeTalker
Language:Python0 0
splinter21/GlotNet
Language:Python0 0
splinter21/kanbun-dataset
Classical Chinese-Classical Japanese Parallel Corpus
0 0
splinter21/LLVC
Language:Python0 0
splinter21/NMT-p2g
Language:Python0 0
splinter21/pesto
Self-supervised learning for fast pitch estimation
Language:Python0 0
splinter21/pesto-full
Full models and training code for PESTO
Language:Python0 0
splinter21/PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
Language:Python0 0
splinter21/ScaleCrafter
Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
Language:Python0 0
splinter21/SourceFilterNeuralFormants
Language:Python0 0
splinter21/Speech2Lip
Language:Python0 0
splinter21/StarRail_Voice_Downloader
星穹铁道语音下载
Language:Python0 0
splinter21/svd-temporal-controlnet
Language:Python0 0
splinter21/ttts
Language:Python0 0
splinter21/Whispering-LLaMA
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
Language:Jupyter Notebook0 0
splinter21/xtts-finetune-webui
Slightly improved official version for finetune xtts
splinter21/xtts-webui
Webui for using XTTS and for finetuning it
splinter21/YOLOv8-anime-hands
0 0
splinter21/zvc
A lightweight vector-search based AI voice conversion system
Language:Python0 0