[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models
Primary LanguageJupyter NotebookMIT LicenseMIT