TheoCoombes/ClipCap
Using pretrained encoder and language models to generate captions from multimedia inputs.
Python
Issues
- 1
Evaluation using pre-trained model
#8 opened by uu95 - 2
minimal usage instruction
#6 opened by rom1504 - 0
train and release models
#7 opened by rom1504 - 0
- 2