seungwonpark/melgan

Wrong implementation of Generator

seungwonpark opened this issue · 3 comments

The last layer should be:

nn.utils.weight_norm(nn.Conv1d(32, 1, kernel_size=7, stride=1, padding=3)),

not:

nn.utils.weight_norm(nn.ConvTranspose1d(32, 1, kernel_size=7, stride=1, padding=3)),

omg...

Working on this at fix/22 branch.

I'll be training the model from scratch and upload audio samples again. I wonder why the previous model was working well.

As far as why it worked so well, this article trains a CNN image classifier with Conv2d and ConvTranspose2d using the same hyperparameters for both and gets almost identical results.

The new version looks working well. I will merge it to master branch after the pre-trained model is ready. Also, will update the audio samples.