Differences with encodec?
HeJinLing opened this issue · 1 comments
HeJinLing commented
请问大佬,时域的funcodec和encodec区别在什么地方呢?
ZhihaoDU commented
Question: What's the difference between funcodec and encodec on time-domain models ?
Response: Thanks for your question. The differences are as follows:
- We involve a power-spectrum loss to enhance the quality in middle frequency, as shown in the Table 3 of the paper https://arxiv.org/abs/2309.07405.
- We provide low and normal frame-rate models trained both on LibriTTS and internal industrial datasets. As a result, our low frame-rate model achieves the SOTA ViSQOL scores on commonly-used datasets including AiShell, Librispeech, Wenet, giga speech and so on.
- FunCodec as a toolkit provides full training and inference capacity, while Encodec only release a model and inference recipe.
- FunCodec as a toolkit provides a super set of features than other open-source toolkits, including distributed training, batchified inference, more discriminators and frequency-domain models.
BTW. Codec based text-to-speech and music models will be released soon.