GuangChen2016 opened this issue 3 years ago · 0 comments
some of the synthesized results (about 3% utterances)has some artifacts (noise). In details, the mel-spectrum in corresponding ares discontinuous, shown as follows:
Any suggestions to improve the this?