TXH-mercury/VALOR

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

PythonMIT

Issues

download manage and control from youtube
#26 opened 2 months ago by frankymaxyyy
1
Some videos in VALOR-32K are unavailable on YouTube
#8 opened 4 months ago by ttgeng233
4
Code to perform QA task
#25 opened 4 months ago by Dewmi24
1
Inference code
#19 opened 4 months ago by abhimanyu891998
6
Here guessing what to do to start runbning this on videos
#4 opened 4 months ago by spacewalkingninja
6
A question about the optimizer:
#18 opened 4 months ago by HrealcodeH
5
Providing all versions of pretrained weights
#21 opened 4 months ago by YingtianDt
1
Questions about how to calculate metrics
#22 opened 4 months ago by aTunass
1
Pre-training Data Release
#3 opened 4 months ago by vateye
5
TypeError: __init__() missing 2 required positional arguments: 'stdout' and 'stderr'
#24 opened 4 months ago by cs-wangfeng
3
Errors in loading Bert and attention score calculation
#5 opened 4 months ago by Kirillova-Anastasia
1
AssertionError when calculating BLEU score
#15 opened 4 months ago by thechargedneutron
1
Comparison between SoTA methods
#2 opened 4 months ago by MAGAer13
1
Inference Code
#7 opened a year ago by isjwdu
7
Information on where to find the frames_fps4 and audio_22050hz sections
#23 opened 7 months ago by cs-wangfeng
0
RuntimeError: CUDA error: no kernel image is available for execution on the device
#20 opened 9 months ago by xibian1120
0
Plan to release finetuned models?
#11 opened a year ago by yt2639
8
Different Results on msrvtt-1kA
#17 opened a year ago by YasmineXXX
1
Strange error, but it works normally
#9 opened a year ago by zsw111-zzz
2
"Output file #0 does not contain any stream"
#10 opened a year ago by zsw111-zzz
2
About prerequisite
#6 opened a year ago by isjwdu
1
link to the pretrained_weights is not available
#1 opened a year ago by Kirillova-Anastasia
2