Pinned Repositories
chinese-hubert-soft
dub_genius
基于GPT-SoVITS的视频剪辑快捷配音工具
DupImageDetection
海量图片去重算法-局部分块Hash算法
gpt-vits
text to speech using decoder-only transformer and VITS
hifigan-yingram-vc
vc
LinearityIQA
Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment, Accepted by ACM MM 2020
McHuo
A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes
natsume
A Japanese text frontend processing toolkit
RAFT-Softsplat-VFI
Video Frame Interpolation (RAFT + Softsplat)
splinter21's Repositories
splinter21/StarRail_Datasets
StarRail Datasets For SVC/SVS/TTS
splinter21/whisper-phoneme-asr
splinter21/ACE_phonemes
a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine
splinter21/bert-vits
vits with bert
splinter21/CharsiuG2P
Multilingual G2P in 100 languages
splinter21/Chat-Haruhi-Suzumiya
Chat凉宫春日, 由李鲁鲁, 冷子昂等同学开发的模仿二次元对话的聊天机器人。
splinter21/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
splinter21/ChatNVL-Towards-Visual-Novel-ChatBot
splinter21/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Forked and maintained by the OpenVPI community
splinter21/JK-VITS
Bilingual-TTS (Japanese and Korean)
splinter21/latent-voice-conversion
splinter21/libf0
A Python Library for Fundamental Frequency Estimation in Music Recordings
splinter21/LLaMA-Efficient-Tuning
Fine-tuning LLaMA with PEFT (PT+SFT+RLHF with QLoRA)
splinter21/MakeDiffSinger
Pipelines and tools to build your own DiffSinger dataset.
splinter21/Mangio-RVC-Fork
Mangio-RVC-Fork
splinter21/MoeMusicTranscription
An automatic music transcription application
splinter21/NSF-BigVGAN
BigVGAN with Neural Source-Filter
splinter21/reducenet
splinter21/Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
splinter21/RMVPE
splinter21/so-vits-svc
SoftVC VITS Singing Voice Conversion
splinter21/SoundStorm
The reproduced code for Google's SoundStorm
splinter21/StableDiffusionWebUIScan
Simple Stable Diffusion WebUI Scanner (for seeta cloud)
splinter21/storm
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
splinter21/sukasuka-vocal-dataset-builder
すかすかアニメボカロデータセット。1st anime vocal dataset. Extract audio (vocal) files from video based on .ass subtitle files; manually label vocal files to characters. Will be used for PITS/VITS/Diffusion text-to-speech/SVC. 根据字幕,从视频里抽取全部语音,然后手动按角色标注。
splinter21/uvq
splinter21/VocalForge
Your one-stop solution for voice dataset creation
splinter21/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
splinter21/XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
splinter21/yinglish
【yinglish】淫语翻译机!