zinengtang/TVLT

PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)

Jupyter NotebookMIT

Issues

RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
#17 opened 8 months ago by saransh03sharma
0
error in Demo_Video_Audio_MAE.ipynb opened in colab
#16 opened a year ago by snapfinger
0
VQAv2 finetuned checkpoint
#15 opened a year ago by farisalasmary
0
Finetuning for the custom dataset
#14 opened a year ago by palashmoon
0
about mosei
#13 opened a year ago by LM-MSA
0
The question for cmumosei.
#6 opened 2 years ago by AIXiaoBaiDemon
4
Whether the text of MOSEI's text-based results comes from ASR or raw dataset?
#12 opened a year ago by Yimi81
0
In accurate test results for emotion classification
#5 opened 2 years ago by Changezi001
5
Draw false video from batch
#11 opened 2 years ago by G-JWLee
1
inaccurate VQA score
#10 opened 2 years ago by Park-ing-lot
3
Finetuning on MOSEI but with nan output
#8 opened 2 years ago by BDHU
4
Finetuning for emotion analysis but nan output
#9 opened 2 years ago by Changezi001
3
CUDA memory error
#7 opened 2 years ago by Park-ing-lot
2
Processing cmumosei dataset
#3 opened 2 years ago by BDHU
8
Downstream task Cosine scheduler
#4 opened 2 years ago by G-JWLee
7
CMU-MOSEI valid test
#2 opened 2 years ago by dori2063
1