kxgong opened this issue 7 months ago · 0 comments
Hi, I used 2x 8xA100 machine to train this code on video datasets. I use accelerate as ddp launcher.
After 8 ~ 9 hours of running, I only ran about 3800 steps.
Is this normal?