MILVLG/mt-captioning
A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning
PythonApache-2.0
Stargazers
- 88899
- AcodeCZhejiang University
- Aidenfaustine
- BetterZH
- gyq716
- haikangdeng
- hcwei13
- JcekBJTU,China
- JunlongFengChangchun University of Science and Technology
- LLipsky
- MIL-VLGHangzhou Dianzi University
- nbgaoMedia Intelligence Laboratory(MIL@HDU)
- noonisy
- NTUYi
- PrabhatLigal
- qq123aa456
- RomanShen
- sun254667307
- syan2018
- tony-hongSaarland University
- uhmwpe
- xumengmeng-97
- youcaiSUN
- Zoroaster97Media Intelligence Laboratory(MIL@HDU)