Questions about memory consumption.

Question

Questions about memory consumption.

sayez opened this issue a year ago · 5 comments

Thank you for your work.

I have a few questions about your paper (also related to DiffI2I) related to the memory consumption of the Stage 1 of your solution(s).

What kind/How many GPUS did you use in your experiments?
How many GB of GPU memory do you use at each step (per GPU if you distribute batches among several ones).
In the paper, you mention that the input of DIRformer for super resolution is 64x64 (and thus, as many tokens thanks to OverlapPatchEmbed, which seems reasonable) whereas the input is 256x256 for inpainting. Isn't the memory consumption 'exploding' with that many transformer tokens?

Thanks in advance.

Answer 1 · 2024-02-02T15:02:57.000Z

Use 8xV100(32GB), you can directly try to train Diffir on inpainting for testing. Diffi2i has lighter transformer structure, which consumes even fewer than diffir

Answer 2 · 2024-02-02T16:13:58.000Z

May I ask you how much time in hours/days lasts the full training of stage 1 ? (one million training steps according to the papers)

Answer 3 · 2024-02-02T16:30:14.000Z

about a week

Answer 4 · 2024-02-02T17:00:55.000Z

Thank you!

Answer 5 · 2024-03-27T06:35:03.000Z

Use 8xV100(32GB), you can directly try to train Diffir on inpainting for testing. Diffi2i has lighter transformer structure, which consumes even fewer than diffir

Thank you for your excellent work.I would like to know where the code of Diffi2i is？