clownrat6/VideoLLaMA2
VideoLLaMA 2: Improving Video-LLMs with Convolutional Spatial-Temporal Aggregation and Stronger Audio Capability
PythonApache-2.0
VideoLLaMA 2: Improving Video-LLMs with Convolutional Spatial-Temporal Aggregation and Stronger Audio Capability
PythonApache-2.0