jishengpeng/Languagecodec

about reconstruction

Opened this issue · 1 comments

How can I use the first VQ to reconstructe wav by languagecodec

The dimension of first 3 layers's code embedding is 128 // 3, which means that each of them cannot be used to reconstruct the audio soly. You can use all of first 3 layers' codes to reconstruct the audio.