This repo contains the implementation of the class project: Controlling DDPMs and VAEs through EBMs by Giuseppe Concialdi
The repository has been developed starting with the fork of the DiffuseVAE repository and the LACE repository.
Controlling VAE and DDP through EBMs leads to a fast, diverse and high-quality generation with a high degree of control for the synthesis.
The main contributions of this work are:
-
A novel mixed framework that combines VAEs, DDPMs and EBMs to provide a controllable model that generates high-quality and diverse images in a reasonable amount of time leveraging the diffusion speed-quality tradeoff.
-
A modular three-stage approach with a generator-refiner network and a controller added for improved controllability and compositionality. Each module is independent and can be replaced by other architectures.
Conditional latent sampling | Diffusion-guided refinement |
---|---|
It is possible to try the conditional latent generation using the Colab code tutorial in the file GCR: Latent conditional sampling.ipynb
All results can be accessed here.
Giuseppe Concialdi (@Gio99c)