Cluster-NMS

Question

Cluster-NMS

buttercutter opened this issue 4 years ago · 7 comments

buttercutter commented 4 years ago

I am trying to understand Cluster-NMS operations.

The mathematical proof seems a bit complicated to follow and comprehend.

Why C1 does not change values ? In other words, why C1 == X ?
How to obtain b1 ?
Why is it Cn = E x X instead of Cn = E x Cn-1 ?

Answer 1 · 2020-12-01T18:49:32.000Z

Matrix C will change at every iteration unless vector b is unchanged.
Vector b is obtained by calculating the column wise maximum on the matrix C and then binarizing. So b=(b1,b2,...,bn) is a 0,1 vector, where 1 denotes preservation and 0 denotes suppression.
Vector b indicates the suppression results of NMS under a certain iteration. So, by left multipling a diagonal matrix E, it is equivalent to do row transformation on the matrix X. This will ignore those current suppressed boxes so that they will not have any effects on the other boxes. (note that X is original IoU matrix.)

Finally, we will get exactly the same results to Original NMS as long as vector b does not change any more.

Answer 2 · 2020-12-10T15:10:38.000Z

This will ignore those current suppressed boxes so that they will not have any effects on the other boxes.

How exactly does left multiplying diagonal matrix E achieve this ?

Answer 3 · 2020-12-11T11:18:53.000Z

For example, let b=[1 0 0 1 0].

In our paper, the matrix

then do E×X.

In practice, we use

then do element-wise multiplication with the upper triangular IoU matrix X.

Answer 4 · 2020-12-12T16:00:29.000Z

Why the extra 1 inside the matrix in practice ?

and how do all those iterations converge to the original NMS result ?

Answer 5 · 2020-12-13T03:29:44.000Z

A diagonal matrix left multiplies another is equivalent to do row transformation (by Higher Algebra). So in practice, I replace it with element-wise multiplication for simplicity. Because it's faster than matrix multiplication. As for why the result of Cluster-NMS is equal to that of Original NMS, a simple case is provided here https://github.com/Zzh-tju/CIoU#description-of-cluster-nms-and-its-usage

For mathematics detail, kindly refer to our paper.

Answer 6 · 2020-12-13T04:31:59.000Z

So in practice, I replace it with element-wise multiplication for simplicity. Because it's faster than matrix multiplication.

I may had missed something, but how is this (matrix in practice) being element-wise multiplication compared to the matrix given in the paper ?

Answer 7 · 2020-12-13T07:07:33.000Z

https://github.com/Zzh-tju/CIoU/blob/master/layers/functions/detection.py#L154-L155