Lzq5's Stars
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
wangxiang1230/Awesome-Online-Action-Detection
Awesome Online Action Detection
Lzq5/Video-Text-Alignment
lucidrains/recurrent-memory-transformer-pytorch
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
showlab/videollm-online
VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)
showlab/Awesome-MLLM-Hallucination
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
yuzhms/Streaming-Video-Model
[CVPR2023] Code for "Streaming Video Model"
yancie-yjr/StreamYOLO
Real-time Object Detection for Streaming Perception, CVPR 2022
ninatu/howtocaption
Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024
Lipurple/ARIS
A Simple Plugin for Transforming Images to Arbitrary Scales
ju-chen/Efficient-Prompt