List of repositories relevant to VITS

The author does not understand anything about machine learning and this text may contain many errors. If the code is publicly available, the Github link shall be attached. I am sure there are many more great repositories not listed here. Sorry I didn't have time.

Original

jaywalnut310/vits: VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Modified

SoftVC

Replacing VITS' TextEncoder with HuBERT's ContentEncoder eliminates the need for inputting phoneme sequences (i.e., eliminate language dependence). HuBERT is part of SoftVC.

iSTFT (inverse short-time Fourier transform)

Performance is improved by improving the decoder, which was the bottleneck, with multiband generation and inverse short-time Fourier transform.

Other Improvements

Other Languages

Refactored

Because refactoring takes time, the latest technologies are not always adopted in theses repositories. However, these should be made easier to use.

34j/awesome-vits

List of repositories relevant to VITS

Original

Modified

SoftVC

iSTFT (inverse short-time Fourier transform)

Other Improvements

Other Languages

Refactored

Others

GUIs and pre-trained models

Integration with LLM

Articles, Awesome Lists, News