[ECCV24] Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
Primary LanguagePythonMIT LicenseMIT