wegigabyteinsn

wegigabyteinsn's Stars

wangkang12/yolo_slowfast-master
Language:Python143
Ricardokevins/Bert-In-Relation-Extraction
使用Bert完成实体之间关系抽取
Language:Python66776
guang-yng/VStates
Video Evnet Extraction via Tracking Visual States of Arguments (AAAI2023)
Language:Python111
facebookresearch/grounded-video-description
Video Grounding and Captioning
Language:Python32373
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
Language:Python3.4k440
gaomingqi/Awesome-Video-Object-Segmentation
:bookmark: Curated list of video object segmentation (VOS) papers, datasets, and projects.
2297
kahnchana/mvu
Multimodal Video Understanding Framework (MVU)
Language:Python25
PKU-YuanGroup/Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Language:Python90144
Event-AHU/VTF_PAR
[CVPR-2023 Workshop@NFVLR] Official PyTorch implementation of Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition
Language:Python232
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.7k6.4k
zwq456/CLIP-VIS
[IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.
Language:Python372
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.9k264
YoadTew/zero-shot-video-to-text
Language:Python747
X-PLUG/Youku-mPLUG
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
Language:Python28811