how to acquire the real whole batch sequenece training loss(reduction_mode=mean) ?

in the train.py, the loss return from main process is the loss of one sequence block, not the whole sequence loss.

Line 150 in 01a9360

gathered_loss = accelerator.reduce(loss.clone().detach(), "mean")

It is the whole sequence loss?

3Q very much