magic-research/PLLaVA

Training time consumption

Closed this issue · 2 comments

Awesome, this work is existing and meaningful!!
Could you please tell me how much time and GPU it takes to train the entire model under the current settings? I like this job very much, but I may not have enough resources to implement it.

#1 (comment)

The largest model costs around 48 A100 GPU days.

The smaller models costs less.
cc @cathyxl

How long does it take to train the 7B model?