Implement about DRB-GAN: A DynamicResBlock Generative Adversarial Network for Artistic Style Transfer: https://openaccess.thecvf.com/content/ICCV2021/papers/Xu_DRB-GAN_A_Dynamic_ResBlock_Generative_Adversarial_Network_for_Artistic_Style_ICCV_2021_paper.pdf
- MUNIT: https://github.com/NVlabs/MUNIT/tree/a82e222bc359892bd0f522d7a0f1573f3ec4a485
- MsImageDis, Basic Blocks, AdaptiveInstanceNorm2d
- FUNIT: https://github.com/NVlabs/FUNIT
- ClassModelEncoder, ContentEncoder
- UGATIT: https://github.com/znxlwm/UGATIT-pytorch
- Class Activation Map(setting bias=False), Gamma, Beta block, adaILN
- Dynamic Convolution: https://github.com/TArdelean/DynamicConvolution
- VGG19: https://github.com/gordicaleksa/pytorch-neural-style-transfer/blob/master/models/definitions/vgg_nets.py
- Dynamic Convolution weight from gammma
Line 117 in 642beae
- Classifier weight without gradient calculate with style_mixed_feature
DRB-GAN/model/StyleEncodNet.py
Line 80 in 642beae
- Multi-scale discriminator
DRB-GAN/model/DiscriminativeNet.py
Line 32 in 642beae
- Explore data analysis using v2 model: cluster of style features, style classifier predicted labels difference from true label
- Training with fp16 (not actually save any memory space)
- Observer How to converge
- Rolling Guidance Filter, Guided Filter, Gabor Filter
- Evaluation Metric: LPIPS, ArtFID, SIFID
- AdaWCT, FastDifferentiableMatSqrt
- Weighted averaging strategy for collection style transfer
- Different data augmentation(sharpness) with assigned artist's style
- LPIPS: Reference-guided LPIPS values for each style with content: 0.5225579113960266
- FID between style and stylized: 49.61556248811253
- SIFID between style and stylized: 0.00016049967
- IS for stylized: 6.173467775877308
- Reference-guided LPIPS values for all style: 0.4380792519592103
- FID between style and stylized: 127.29334051784087
- SIFID between style and stylized: 0.08613132
- IS for stylized: 5.70985832650343
- class accuracy: 0.9672864203341963
- error rate about nicholas is highest: 66/858 = 0.0769
- update LOUIS_WAIN 90 -> 190:
- class accuracy: 0.961644464859899
- error rate about LOUIS_WAIN is highest: 29/190 = 0.1526
- The filters of blur image like Rolling Guidance Filter and Guided Filter are generated artifact(rectangle patters and raster effect on contours)
- Add random noise on contours by Gabor Filter is generated artifact(raster effect on contours), if using noise on non-contours same result as add_random_noise
- original architecture not stable when adversarial weight up to 5.0 and generated artifact(rectangle patters and raster effect on contours)