baaivision/CapsFusion

When will the code and dataset available?

Opened this issue · 5 comments

Great work!

Thank you for your interest in our work. The CapsFus-LLaMA model and distributed inference code have been released, please check it out.

What about the 10M version of the dataset?

Do you plan to release 100M version?

Do you plan to release 100M version?

@miguelscarv @rahimentezari Hello, we have released the CapsFusion-120M dataset, please check it out!

What about the 10M version of the dataset?

@miguelscarv The 10M version of the dataset might take some more time to be released, as there are some complications involved with releasing images. Not sure whether the current 120M format (url + captions) could satisfy your need?