shiny-red-apple's Stars
v-iashin/video_features
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
v-iashin/BMT
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
lucidrains/MaMMUT-pytorch
Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch
rowanz/merlot_reserve
Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"
zhengzangw/awesome-huge-models
A collection of AWESOME things about HUGE AI models.
dhansmair/flamingo-mini
Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training
lucidrains/flamingo-pytorch
Implementation of š¦© Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
rowanz/merlot
MERLOT: Multimodal Neural Script Knowledge Models
KaiyangZhou/pytorch-vsumm-reinforce
Unsupervised video summarization with deep reinforcement learning (AAAI'18)
m-bain/frozen-in-time
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
e-apostolidis/PGL-SUM
A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE ISM 2021)
ok1zjf/VASNet
PyTorch implementation of the ACCV 2018-AIU2018 paper Video Summarization with Attention
semchan/Uformer
A PyTorch implementation of our paper Video summarization with u-shaped transformer. Published in Applied Intelligence.
e-apostolidis/AC-SUM-GAN
A PyTorch Implementation of AC-SUM-GAN from "AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization" (IEEE TCSVT 2021)
medhini/clip_it
CLIP-It! Language-Guided Video Summarization
hong8e/KoROUGE
Calculating ROUGE score for Korean (Wrapper for ROUGE-1.5.5.pl script)