DanRuta/xVA-Synth

Add audio streaming support

Pendrokar opened this issue · 2 comments

@DanRuta had mentioned that v0.12 of TorchAudio has streaming capabilities. If TorchAudio was xVASynth's was updated, perhaps it would easily allow implementing audio streaming?
https://pytorch.org/blog/pytorch-1.12-new-library-releases/#beta-streaming-api

[Edit] Misinterpreted, while xVASynth does install TorchAudio, it is only used for mel spectrogram representation

What would be the use case of this?

What would be the use case of this?

This would allow real-time generation of audio for Text-To-Speech and Speech-To-Speech (STS). STS also known as Voice Conversion. Playback of incomplete files depends on the use audio player or game audio engine. So while xVASynth is currently most used by Skyrim's Mantella mod, Skyrim itself is not able to playback incomplete files.