Issues
- 2
Wrong results in Action Recognition task.
#133 opened by zhengrongz - 2
Test evaluation on Something-Something v2
#132 opened by yujiangpu20 - 1
Docker image
#108 opened by Hari-Durai-Baskar - 1
- 1
Training and Evaluation Code for ViClip
#131 opened by fmthoker - 15
- 1
Download InternVid Dataset Issues
#117 opened by yh675 - 0
Hello, are the 6k action words available?
#130 opened by calci - 4
NEED HELP: Action Classification low performance
#127 opened by JayMay1994 - 14
S2 pretrained model of InternVideo2 does not work well for Zero-Shot Video-Text Retrieval
#107 opened by Wenju-Huang - 1
About Video Temporal Grounding
#105 opened by ArlixLin - 0
运行 InternVideo2/single_modality/scripts/finetuning/full_tuning/k400/1B_ft_k710_ft_k400_f16.sh 报错
#128 opened by JayMay1994 - 0
internvideo2 internvl_clip_vision.py load model
#125 opened by rixejzvdl649 - 0
- 0
test_num_segment、test_num_crop
#124 opened by rixejzvdl649 - 0
what‘s the plan of releasing audio modality pretrained model of InternVideo2-s2 and s3?
#122 opened by 1093842024 - 2
- 2
InternVideo1: Cannot load videos using petrel_client for Video-Text Retrieval task
#118 opened by ruixuan-ray-zhang - 2
- 0
InternVideo2 s3-1B for zero-shot video captioning
#119 opened by cdjkim - 4
- 5
- 2
Gpu resource required ?
#115 opened by Hari-Durai-Baskar - 0
- 1
Zero-shot retrieval reproduction issue
#112 opened by jqsun98 - 5
- 2
Installation Issues with Demo Notebook
#110 opened by raviy0807 - 3
Clip model size is too small
#114 opened by dwsmart32 - 3
What is the difference between model internvideo2_s2 and internvideo2_clip?
#113 opened by lllllllll-3154 - 1
How to conduct the video-text matching loss mentioned in the paper for ViCLIP during text-video retrieval fine-tuning
#111 opened by jpWang - 2
- 1
Performance Reproduction of ViCLIP (on MSRVTT)
#106 opened by jpWang - 1
- 2
- 1
is there any clear training instruction of internvideo1, like step by step?
#97 opened by lynshwoo2022 - 2
- 1
- 1
When will be InternVideo2 released
#93 opened by jpWang - 1
Intervideo2 Release
#94 opened by plmsmile - 1
- 0
InternVideo-MM-L-14 pretraining datasets
#99 opened by ninatu - 0
model interaction part of internvideo1
#96 opened by lynshwoo2022 - 2
- 2
Video links for Instruction Videos
#84 opened by hainow - 2
What do DIV and FLT stand for?
#91 opened by vedantroy - 0
- 8
- 0
zeroshot video-retrieval
#88 opened by 1240446371 - 0
- 1
Ckpt release of vit-H model
#83 opened by XiaominLi1997