biggytruck/SpeechSplit2

About F0 and VTLP

SeptemberN opened this issue · 1 comments

I noticed that during training, when extracting the content, the data is processed for f0 and then the tones are processed using VTLP, but in demo.ipynb only f0 is processed and no VTLP is used, why is this? Won't this affect the conversion?