/multimodal_scripts

Customized training scripts of multimodal components: clip, ViT, detr

Watchers