Questions in reasoning

Question

Questions in reasoning

Closed this issue a year ago · 1 comments

2018161062Jiseong commented a year ago

When I proceed with inference using the model I have trained, it seems that there are a lot of voices of the original sound source left. Can I increase the ratio of the voices I have trained? (Same function as add_noise_step of diff-svc model)

Answer 1 · 2023-05-16T12:43:11.000Z

You don't describe too much details, but generally contentvec768l12 encoder will have less timbre leakage.