Replicating the bits / dim results in the original VQ-VAE paper
pclucas14 opened this issue · 7 comments
Hi,
thanks for your clean implementation ! I was wondering, have you ever tried to calculate the bits / dimension metric (as in the original paper) ? I've tried to do so using the provided code, and I'm still quite far from the results in the paper. I was hoping maybe you would have some insight to share as to why that is the case.
Thanks!
Lucas
@pclucas14 I haven't tried to do that but plan to... give me a few days to figure it out... I am trying to compile a table of bits/dim for MNIST/ Fashion-MNIST so I might as well start with this
great! let me know what you end up getting :)
Thanks! Here is where I am collecting the results: zalandoresearch/fashion-mnist#144
great! it's a very complete list, pretty cool :D
The best I got for VQ-VAE on CIFAR-10 was 5.00 BPD (using a unigram prior). This is still far from the results in the VQ-VAE paper. Hopefully you can manage to close the gap even more :)
Wonder if there is code for proper bpd computation? If so, can you please share?