/text-to-image-generation-feat-diffusion

:bulb:Text-to-Image Generation (feat. Diffusion): paper/code review and experimental findings related to text-to-image generation and diffusion-based models

Primary LanguageHTMLMIT LicenseMIT

text-to-image-generation-feat-diffusion

This is the repository of Pseudo Lab's Text-to-Image Generation (feat. Diffusion) team.

๐Ÿ’ก Our aim is to review papers and code related to image generation and text-to-image generation models, approach them theoretically, and conduct various experiments by fine-tuning diffusion based models.

About Us - Pseudo Lab

About Us - Text-to-Image Generation (feat. Diffusion) Team

์ฐธ์—ฌ ๋ฐฉ๋ฒ•: ๋งค์ฃผ ์ˆ˜์š”์ผ ์˜คํ›„ 9์‹œ, ๊ฐ€์งœ์—ฐ๊ตฌ์†Œ Discord Room-DH ๋กœ ์ž…์žฅ!

Contributors

  • ์กฐ์ƒ์šฐ [Sangwoo Jo] | Github | Linkedin |
  • ๋ฌธ๊ด‘์ˆ˜ [Kwangsu Mun] | Github | Linkedin |
  • ๊น€์ง€์ˆ˜ [Jisu Kim] | Github | Linkedin |
  • ๋ฐ•๋ฒ”์ˆ˜ [Beomsoo Park] | Github | Linkedin |
  • ์ง€์Šนํ™˜ [Seunghwan Ji] | Github | Linkedin |
  • ๊ณ ๋™๊ทผ [Donggeun Sean Ko] | Github | Linkedin |
  • ์กฐ๋‚จ๊ฒฝ [Namkyeong Cho] | Github | Linkedin |
  • ๊น€์„ ํ›ˆ [SeonHoon Kim] | Github | Linkedin |
  • ์ด์ค€ํ˜• [Junhyoung Lee] | Github | Linkedin |
  • ์กฐํ˜•์„œ [Hyoungseo Cho] | Github | Linkedin |
  • ์œ ์ •ํ™” [Jeonghwa Yoo] | Github | Linkedin |
  • ๋ฐ•์„ธํ™˜ [Sehwan Park] | Github | Linkedin |

Reviewed Papers

idx Date Presenter Paper / Code
1 2023.03.29 Sangwoo Jo Auto-Encoding Variational Bayes (ICLR 2014)
Generative Adversarial Networks (NIPS 2014)
2 2023.04.05 Kwangsu Mun
Jisu Kim
Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks (ICCV 2017)
A Style-Based Generator Architecture for Generative Adversarial Networks (CVPR 2019)
3 2023.04.12 Beomsoo Park
Seunghwan Ji
Denoising Diffusion Probabilistic Models (NeurIPS 2020)
Denoising Diffusion Implicit Models (ICLR 2021)
4 2023.05.10 Donggeun Sean Ko Diffusion Models Beat GANs in Image Synthesis (NeurIPS 2021)
Zero-Shot Text-to-Image Generation (ICML 2021)
5 2023.05.17 Namkyeong Cho
Sangwoo Jo
High-Resolution Image Synthesis with Latent Diffusion Models (CVPR 2022)
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation (CVPR 2023)
6 2023.05.24 Kwangsu Mun
Jisu Kim
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Adding Conditional Control to Text-to-Image Diffusion Models
7 2023.05.31 Beomsoo Park
Seunghwan Ji
LoRA: Low-Rank Adaptation of Large Language Models
Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
8 2023.08.30 Donggeun Sean Ko
Sangwoo Jo
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting (CVPR 2023)
9 2023.09.06 SeonHoon Kim
Seunghwan Ji
Hierarchical Text-Conditional Image Generation with CLIP Latents
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations (ICLR 2022)
10 2023.09.13 Namkyeong Cho
Junhyoung Lee
DeepFloyd IF
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
11 2023.09.20 HyoungSeo Cho
Sangwoo Jo
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models
12 2023.09.27 Sehwan Park
Junhyoung Lee
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models (ICML 2022)
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
13 2023.10.11 Jeonghwa Yoo
SeonHoon Kim
Synthetic Data from Diffusion Models Improves ImageNet Classification
Your Diffusion Model is Secretly a Zero-Shot Classifier (ICCV 2023)
14 2023.10.18 Seunghwan Ji A Study on the Evaluation of Generative Models
15 2023.10.25 Sangwoo Jo
HyoungSeo Cho
Progressive Distillation for Fast Sampling of Diffusion Models (ICLR 2022)
ConceptLab: Creative Generation using Diffusion Prior Constraints
16 2023.11.01 SeonHoon Kim
Jeonghwa Yoo
BBDM: Image-to-image Translation with Brownian Bridge Diffusion Models (CVPR 2023)
Make-A-Video: Text-to-Video Generation without Text-Video Data
17 2023.11.15 Sehwan Park
Junhyoung Lee
Diffusion Models already have a Semantic Latent Space (ICLR 2023)
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models (CVPR 2023)

Jupyter Book Update Procedure

  1. Clone the repo on your local computer
git clone https://github.com/Pseudo-Lab/text-to-image-generation.git
  1. Install required packages
pip install jupyter-book==0.15.1
pip install ghp-import==2.1.0
  1. Change the contents in book/docs folder with the following format and update _toc.yml file accordingly

  • 3.1. Add information section on top of the markdown page
- **Title:** {๋…ผ๋ฌธ ์ œ๋ชฉ}, {ํ•™ํšŒ/ํ•™์ˆ ์ง€๋ช…}

- **Reference**
    - Paper:  [{๋…ผ๋ฌธ ๋งํฌ}]({๋…ผ๋ฌธ ๋งํฌ})
    - Code: [{code ๋งํฌ}]({code ๋งํฌ})
    - Review: [{review ๋งํฌ}]({review ๋งํฌ})
    
- **Author:** {๋ฆฌ๋ทฐ ์ž‘์„ฑ์ž ๊ธฐ์ž…}

- **Edited by:** {๋ฆฌ๋ทฐ ํŽธ์ง‘์ž ๊ธฐ์ž…}

- **Last updated on {์ตœ์ข… update ๋‚ ์งœ e.g. Apr. 12, 2023}**
  • 3-2. Use the following template when displaying images
:::{figure-md} 
<img src="{์ฃผ์†Œ}" alt="{tag๋ช…}" class="bg-primary mb-1" width="{800px}">

{์ œ๋ชฉ} \  (source: {์ถœ์ฒ˜})
:::
  • 3-3. Update _toc.yml file accordingly
format: jb-book
root: intro
parts:
- caption: Paper/Code Review
  chapters:
  - file: docs/review/vae
  - file: docs/review/gan
  1. Build the book using Jupyter Book command
jupyter-book build ./book
  1. Sync your local and remote repositories
cd text-to-image-generation
git add .
git commit -m "adding my first book!"
git push
  1. Publish your Jupyter Book with Github Pages
ghp-import -n -p -f book/_build/html -m "initial publishing"