Batch normalization

Question

Batch normalization

Closed this issue 7 years ago · 8 comments

Hello, Chelsea.

The batch normalization documentation says that this ops is not attached to the TensorFlow graph by default. So, there're two ways to force the updates during training:

explicitly tell the graph to update ops in tf.GraphKeys.UPDATE_OPS
or set updates_collections parameter of batch_norm to None.

I don't see neither of those in the code. Maybe I'm missing something.

I haven't been able to make the first way work due to while cycle in map_fn function. But the second modification is easy and seems to work. Although, I'm not sure I see any difference in performance.

Answer 1 · 2017-08-10T12:09:35.000Z

I compute the test-time statistics using the test batch of data, instead of computing the average training statistics. This doesn't require keeping track of batch norm training statistics. [Note that train is always set to True when calling the batch_norm function, which means that tensorflow will compute the statistics using the current batch]

It's possible that it would work better by using training batch statistics, but I haven't tried it.

Answer 2 · 2017-08-10T12:22:04.000Z

It's not the issue I'm talking about. See the Note from tf.contrib.layers.batch_norm page

Note: when training, the moving_mean and moving_variance need to be updated. By default the update ops are placed in tf.GraphKeys.UPDATE_OPS, so they need to be added as a dependency to the train_op.

In other words, without these steps I wrote in the issue, moving_mean and moving_variance doesn't update at all (even during training). Again, maybe I'm missing some other way you're updating them.

Answer 3 · 2017-08-10T12:26:01.000Z

You only need to update moving_mean and moving_variance if you use them. In this case, the batch norm statistics are being computed using the batch data rather than a moving average of the statistics (so they don't need to be updated).

Answer 4 · 2017-08-10T12:51:33.000Z

OK. Indeed, they're needed only during testing.
Thanks.

Answer 5 · 2018-01-12T05:23:56.000Z

@cbfinn, as you mentioned before,

I compute the test-time statistics using the test batch of data, instead of computing the average training statistics

This seems to be a bit of cheating especially on test-time. In general, we can assume evaluating only one sample at a time on test-time and then there is no way to get proper statistics for batch_norm. This means the test-set performance will partially dependent on the size of batch.

Answer 6 · 2018-01-12T05:45:19.000Z

If only having access to a single test example is a constraint, you can also use a batch of N-1 training examples with a single test example to compute the statistics. This should perform equivalently, while only using one test example.

…

On Jan 11, 2018 9:23 PM, "Taesup (TS) Kim" ***@***.***> wrote: @cbfinn <https://github.com/cbfinn>, as you mentioned before, I compute the test-time statistics using the test batch of data, instead of computing the average training statistics This seems to be a bit of cheating especially on test-time. In general, we can assume evaluating only one sample at a time on test-time and then there is no way to get proper statistics for batch_norm. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#9 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABMlATdKcIxQvsXgakxVlYQG5dK0VsdJks5tJuxtgaJpZM4OzLMX> .

Answer 7 · 2018-04-20T02:35:35.000Z

Hi, I see your approcach.
If I use moving average of the statistics by adding update_op into train ops, Then Need I set train=FALSE when testing use batch_norm function?

Answer 8 · 2018-04-20T02:38:13.000Z

Yes

…

On Thu, Apr 19, 2018, 7:35 PM Jackie Loong ***@***.***> wrote: Hi, I see your approcach. If I use moving average of the statistics by adding update_op into train ops, Then *Need I set train=FALSE* when testing use batch_norm function? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#9 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABMlAZJQHg-N16-ZxjUNiimpNGK70_XVks5tqUl3gaJpZM4OzLMX> .