wegigabyteinsn's Stars
wangkang12/yolo_slowfast-master
Ricardokevins/Bert-In-Relation-Extraction
使用Bert完成实体之间关系抽取
guang-yng/VStates
Video Evnet Extraction via Tracking Visual States of Arguments (AAAI2023)
facebookresearch/grounded-video-description
Video Grounding and Captioning
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
gaomingqi/Awesome-Video-Object-Segmentation
:bookmark: Curated list of video object segmentation (VOS) papers, datasets, and projects.
kahnchana/mvu
Multimodal Video Understanding Framework (MVU)
PKU-YuanGroup/Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Event-AHU/VTF_PAR
[CVPR-2023 Workshop@NFVLR] Official PyTorch implementation of Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
zwq456/CLIP-VIS
[IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
YoadTew/zero-shot-video-to-text
X-PLUG/Youku-mPLUG
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks