Chat in plain English with the app and have it edit your images as you would like.
Instruct Pix2Pix is a model trained to combine the learnings of GPT-3 & Stable Diffusion. How could it go wrong ?
- The authors start from a caption ex: “photograph of a girl riding a horse”
- Then they use a finetuned GPT-3 to generate instructions (“have her ride a dragon”) and edited captions (“photograph of a girl riding a dragon”) to generate prompts and instructions.
- From those 2 captions they use Stable Diffusion to generate 2 images.
- Which gives them their artificially generated training sample made of pairs (instruction-[image from prompt 1, image from prompt 2])
- They then train Pix2Pix to generate image from prompt 2 given instruction and image from prompt 1
More at: https://www.timothybrooks.com/instruct-pix2pix Try the app at: https://chat-diffusion-beta.vercel.app/
Venus de Milo | Turn her into a cyborg |
---|---|
Elon | Turn him into a cyborg |
---|---|
This repo is forked from https://github.com/KAJdev/diffusion-chat and kept much of the front-end code.
This repo uses the HuggingFace diffusers' implementation of Tim Brooks et al. Instruct Pix2pix model - https://www.timothybrooks.com/instruct-pix2pix