Any plans to push the source code? Really looking forward to it.
RahulBhalley opened this issue · 7 comments
RahulBhalley commented
Any plans to push the source code? Really looking forward to it.
rosinality commented
I am currently implementing this. 😅 I will take some time for implementation and experiments.
RahulBhalley commented
Sure, no issues. :) It would really help me also implement it on my end if you could please share some references repositories that you're probably looking at right now to implement NANSY.
Regards
Rahul Bhalley
rosinality commented
Hmm, I'm currently trying to implement it from the paper. librosa/torchaudio/parselmouth was helpful.
RahulBhalley commented
Ok, thanks!
RahulBhalley commented
Hi @rosinality any updates on this?
rosinality commented
I have almost implemented the model, but I found aligning features which have different sampling ratios/window sizes (wav2vec features vs mel, yingram features) is not straightforward. Currently I am stopped at that point.
RahulBhalley commented
Ok, thanks for updating me. :)