OpenGVLab/InternVideo

Video Foundation Models & Data for Multimodal Understanding

PythonApache-2.0

Issues

Wrong results in Action Recognition task.
#133 opened 2 days ago by zhengrongz
2
Test evaluation on Something-Something v2
#132 opened 2 days ago by yujiangpu20
2
Docker image
#108 opened a month ago by Hari-Durai-Baskar
1
Why do i get this load-checkpoint as a list of False values?
#123 opened 2 days ago by Hari-Durai-Baskar
1
Training and Evaluation Code for ViClip
#131 opened 4 days ago by fmthoker
1
[Help requested] Inference InternVideo2_clip model.
#129 opened 11 days ago by gracikk-ds
15
Download InternVid Dataset Issues
#117 opened 5 days ago by yh675
1
Hello, are the 6k action words available?
#130 opened 8 days ago by calci
0
NEED HELP: Action Classification low performance
#127 opened 17 days ago by JayMay1994
4
S2 pretrained model of InternVideo2 does not work well for Zero-Shot Video-Text Retrieval
#107 opened a month ago by Wenju-Huang
14
About Video Temporal Grounding
#105 opened 2 months ago by ArlixLin
1
运行 InternVideo2/single_modality/scripts/finetuning/full_tuning/k400/1B_ft_k710_ft_k400_f16.sh 报错
#128 opened 12 days ago by JayMay1994
0
internvideo2 internvl_clip_vision.py load model
#125 opened 18 days ago by rixejzvdl649
0
will you release audio modality pretrained model of InternVideo2？
#126 opened 19 days ago by 1093842024
0
test_num_segment、test_num_crop
#124 opened 19 days ago by rixejzvdl649
0
what‘s the plan of releasing audio modality pretrained model of InternVideo2-s2 and s3？
#122 opened 25 days ago by 1093842024
0
AttributeError: 'BF16_Optimizer' object has no attribute 'cur_scale'
#121 opened a month ago by xin0623
2
InternVideo1: Cannot load videos using petrel_client for Video-Text Retrieval task
#118 opened a month ago by ruixuan-ray-zhang
2
The correct way to prompt for 0-shot video classification
#120 opened a month ago by Eugleo
2
InternVideo2 s3-1B for zero-shot video captioning
#119 opened a month ago by cdjkim
0
Confusion about zero-shot setting on Video-Text Retrieval
#89 opened 2 months ago by overwhelmedxh
4
ModuleNotFoundError: No module named 'dropout_layer_norm'
#102 opened 2 months ago by hzlcodus
5
Gpu resource required ?
#115 opened a month ago by Hari-Durai-Baskar
2
InternVideo2 Weights for InternVideo 6B s2 version + License File
#116 opened a month ago by tamirm2009
0
Zero-shot retrieval reproduction issue
#112 opened a month ago by jqsun98
1
will you release pretrained model of InternVideo2-s2-1B frame=8?
#103 opened a month ago by 1093842024
5
Installation Issues with Demo Notebook
#110 opened a month ago by raviy0807
2
Clip model size is too small
#114 opened a month ago by dwsmart32
3
What is the difference between model internvideo2_s2 and internvideo2_clip?
#113 opened a month ago by lllllllll-3154
3
How to conduct the video-text matching loss mentioned in the paper for ViCLIP during text-video retrieval fine-tuning
#111 opened a month ago by jpWang
1
where is /eval_msrvtt_no_deepspeed.sh for s2 evaluation?
#109 opened a month ago by Hari-Durai-Baskar
2
Performance Reproduction of ViCLIP (on MSRVTT)
#106 opened a month ago by jpWang
1
is there a demo code for video QA and video Captioning?
#104 opened 2 months ago by LanHao0
1
Do you have plans to release all the captions of InternVid?
#95 opened 2 months ago by yellow-binary-tree
2
is there any clear training instruction of internvideo1, like step by step?
#97 opened 2 months ago by lynshwoo2022
1
Questions about the demo of ViCLIP provided in the InternVideo/Data/InternVid
#101 opened 2 months ago by XuecWu
2
Simple question: What are the public datasets included in InternVid-200M?
#100 opened 2 months ago by jong980812
1
When will be InternVideo2 released
#93 opened 2 months ago by jpWang
1
Intervideo2 Release
#94 opened 2 months ago by plmsmile
1
is there a pretrain model release schedule for InternVideo2?
#98 opened 2 months ago by 1093842024
1
InternVideo-MM-L-14 pretraining datasets
#99 opened 2 months ago by ninatu
0
model interaction part of internvideo1
#96 opened 2 months ago by lynshwoo2022
0
When will the training scripts and related code of ViCLIP be released?
#85 opened 2 months ago by XuecWu
2
Video links for Instruction Videos
#84 opened 2 months ago by hainow
2
What do DIV and FLT stand for?
#91 opened 2 months ago by vedantroy
2
How to set multi-gpu to eval zero shot performance？
#90 opened 2 months ago by 1240446371
0
notebook
#86 opened 2 months ago by betterze
8
zeroshot video-retrieval
#88 opened 2 months ago by 1240446371
0
Notebook for running spatiotemporal detection
#87 opened 2 months ago by kaustavnandy
0
Ckpt release of vit-H model
#83 opened 3 months ago by XiaominLi1997
1