VoiceAssistant-400K

Question

VoiceAssistant-400K

Closed this issue 2 months ago · 3 comments

Thanks for your great work! Any plan to open the data production process of VoiceAssistant-400K?

Answer 1 · 2024-10-22T11:33:17.000Z

Hi, for the data processing, we use cosyvoice for TTS (input audio) and SNAC for output audio encoding, you may refer to these projects for more info.

Answer 2 · 2024-10-22T11:40:00.000Z

thanks for your reply! I don't mean the production of the audio, but the production of these single or multi-turn conversations?

Answer 3 · 2024-10-24T14:12:28.000Z

we use open-source datasets, you may refer to tech report the dataset sources.