vlomme/Multi-Tacotron-Voice-Cloning

Training encoder

dtanalytic opened this issue · 2 comments

Thanks for work! Help me to train an encoder. How is it possible to add new custom voices to train datasets, or only fixed (like LibriSpeech: train-other-500, VoxCeleb1...) are available through the interface of commands:
python encoder_preprocess.py <datasets_root>
and
python encoder_train.py my_run <datasets_root>/SV2TTS/encoder

If possible, than how i should keep files, in root data directory or subfolders, in what formats? I tried to add my voice to subfolder but got an error like:

"Python encoder_preprocess.py data
Arguments:
datasets_root: data
out_dir: data/SV2TTS/encoder
datasets: ['preprocess_voxforge']
skip_existing: False

Preprocessing preprocess_voxforge
Couldn't find data/book, skipping this dataset"

I looked at the source and found that there are fixed funcs that preprocess different formats of train data (like preprocess22,preprocess44...) What do they mean? Maybe i should use one of them?
Thank you.

each dataset structure needs its own code. I have written functions for three options