Questions about LanguageBind Usage
lingjunzhao opened this issue · 0 comments
lingjunzhao commented
Hi,
Thanks for releasing the codes! I was reading your paper, but still have some questions about LanguageBind used in Video-LLaVA:
- Were the weights of the image/video encoder initialized from LanguageBind trainable or frozen, during Video-LLaVA training?
- Which version of LanguageBind from the model zoo did you initialize the weights from, e.g. LanguageBind_Video_V1.5_FT or LanguageBind_Video_FT?