SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models
Primary LanguagePythonOtherNOASSERTION