Yxxxb/VoCo-LLaMA

How to compare the inference time?

Closed this issue 5 months ago · 4 comments

Gumpest commented 5 months ago

Hi, authors. I wonder how to present the efficiency via inference time.

Gumpest commented 5 months ago

Besides, I want to learn how to compute the CUDA time. Thanks a lot.

Yxxxb commented 5 months ago

Hi,

llama.cpp, LLaVA-cli or simple time function of Python can complete the measurement of time.

Gumpest commented 5 months ago

@Yxxxb What batch size do you use?

Yxxxb commented 5 months ago

The overall batch size is 128, which is the same as LLaVA SFT stage. You could check the training Hyperparameters in the "Additional Implement Details" section of our paper's appendix.