tttyuntian/ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
PythonApache-2.0
Watchers
No one’s watching this repository yet.
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
PythonApache-2.0
No one’s watching this repository yet.