A Question about diversity of codebook

Question

A Question about diversity of codebook

xy9485 opened this issue 2 years ago · 7 comments

Dear authors, I'm looking into the proposed paper, and I have a question about the loss term which encourages codebook diversity. I notice that the green curve in figure 2b remains to be above zero all the time. And I assume this loss term pushes codes in a codebook to be as orthogonal to each other as possible and therefore should be minimized to 0 (pls correct me if I'm wrong).

My question is: do you have any operation which constrain the codes in a codebook to be vectors containing only non-negative values?

Appreciate your answer in advance:)

Answer 1 · 2022-11-30T02:38:45.000Z

Hi @xy9485 ,

Thank you for your question! We did not explore operations to produce only non-negative vectors.

Similar to the commonly-used L2 regularization, the diversity regularization loss empirically keeps above 0 to balance with other learning objectives (e.g., contrastive learning) to reach an overall optimum.

Answer 2 · 2022-11-30T13:25:32.000Z

Hi @gimpong
Thank you for your answer. But the commonly used l2 norm as regularization term is guaranteed to be non-negative, whereas the diversity regularization isn't, correct?
Or do you mean the codes in the codebook tend to remain non-negative empirically during training? That seems to be possible if the input features for VQ are from the previous layer with Relu activation

Answer 3 · 2022-11-30T14:15:00.000Z

Yes, I think you are right. Maybe the ReLU leads to nonnegative codes in the codebook.

Answer 4 · 2022-11-30T15:19:52.000Z

I also notice you used entropy of quantization as an additional regularization term, combining with the codebook diversity term, how much does it help empirically? or doesn't?

Answer 5 · 2022-11-30T15:39:56.000Z

In my experiments, the entropy regularization term made little difference to the performance. It is OK to remove this term.

Answer 6 · 2022-11-30T15:41:22.000Z

That's useful to know, thanks again for your responses:)

Answer 7 · 2022-11-30T15:43:02.000Z

It's my pleasure. :)