/tiger_diff-svc

Diff-SVC Model and information for "Tiger"

Apache License 2.0Apache-2.0



🐯☄️ Tiger AI for Diff-SVC 🐯☄️

Sample

v2 - Bloody Mary (Sped-Up)

v1 - Future Love

About

This is all the information about Tiger's Diff-SVC model in one spot. I'll be updating the information as I update the model.

For the sake of convinience, any model created from v2 forward will be included in a Colab notebook, including any other singers I create.

Version Sample Rate Data Steps Trained Pretrained Used Inference
v1 22050Hz 30 minutes 55,000 Yes Download
v2 44100Hz 120 minutes 190,000 No Notebook

v1

There's about 30 minutes of data in English and Japanese. This data wasn't intended for Diff-SVC, but is perfectly fine for it. There are plans to add more languages, as well as document more closely how much data of each language there is.

v2

Tiger v2 has 2 hours of much more ranged data. Along with being trained at 44.1kHz and using a much higher quality vocoder, his range of pitch, possible phonetics and overall quality has gone way up. Below are some statistics on the amount of data as well as distribution of languages for his current dataset.

Language Minutes # of Songs
English 58 34
Japanese 41 15
Chinese 8 3
Korean 5 2
PT-BR 5 2
Spanish 1 1
French 2 1

Misc.

All artwork on this page is by Aquiboni