Vision Transformer only GAN (Gen and Discr) - Pytorch

1. Related work:

Original paper: Jiang, Yifan, Shiyu Chang, and Zhangyang Wang. "Transgan: Two transformers can make one strong gan." arXiv preprint arXiv:2102.07074 (2021).

TransGAN: Two Transformers Can Make One Strong GAN

Code inspired from TransGAN

2. GAN Architecture

3. Self-attention masking

4. Gradually Increasing Receptive Field

5. Difference from original paper

GELU activation is only approximated through: $0.5x(1+tanh[\sqrt{2/\pi}(x+0.044715x\^3)])$ to speed up computations done at GPU.
MLP is formed by Fully Connected => GELU ==> Fully Connected => GELU

6. Install requirements

pip install -r requirements.txt

Python version >= 3.6

7. Training

python train.py

8. Jupyter notebook

Use Untitled.ipynb to easily modify training parameters

lblogan14/transformer-GAN

Vision Transformer only GAN (Gen and Discr) - Pytorch

1. Related work:

2. GAN Architecture

3. Self-attention masking

4. Gradually Increasing Receptive Field

5. Difference from original paper

6. Install requirements

7. Training

8. Jupyter notebook