PyTorch-Special-Pre-trained-Models

Special pre-trained VGG-16 network on CIE Lab and Grayscale images converted from ImageNet training set

1 Model Validation Accuracy (on ImageNet Validation 50k)

Compared to the official model provided by PyTorch, the classification ability of our model is only slightly weaker. Basically, these models are targeted for regression task, so we think the small improvement is unnecessary.

The Fully Convolutional models do not include MaxPooling layer & AdaptiveAvgPooling layer, which is replaced by a convolutional layer with stride = 2. Note that, the total amount of convolutional layer is unchanged. Each convolutional layer is spectral normalized (you may find the source code in this project), which is very useful for the training of WGAN.

1.1 Fully Convolutional Gray VGG-16

epoch 30 | top 1: 32.50% | top 5: 59.90%
epoch 60 | top 1: 56.28% | top 5: 80.66%
epoch 90 | top 1: 57.96% | top 5: 81.80%
epoch 120 | top 1: 57.46% | top 5: 81.31%

1.2 CIE Lab VGG-16

epoch 30 | top 1: 37.41% | top 5: 65.55%
epoch 60 | top 1: 60.29% | top 5: 83.83%
epoch 90 | top 1: 60.38% | top 5: 83.11%
epoch 120 | top 1: 60.87% | top 5: 83.17%

1.3 Fully Convolutional CIE Lab VGG-16

epoch 30 | top 1: 35.70% | top 5: 63.22%
epoch 60 | top 1: 59.21% | top 5: 83.16%
epoch 90 | top 1: 65.15% | top 5: 86.72%
epoch 120 | top 1: 65.13% | top 5: 86.80%
We further tested validation accuracy when the learning was further decreasing to 1e-6; however, there is no explicit improvement. For example, when epoch = 125, the top 1 accuracy = 65.16% and top 5 accuracy = 86.81%.

1.4 Fully Convolutional RGB ResNet-50 IN

For epoch 5, the top 1 accuracy is 55.77% and top 5 accuracy is 79.64%. For epoch 15, the top 1 accuracy is 57.16% and top 5 accuracy is 80.79%.

2 Download Link

2.1 Fully Convolutional Gray VGG-16

Now the epoch 120 model of Fully Convolutional Gray VGG-16 is available: Link

2.2 CIE Lab VGG-16

Now the epoch 120 model of CIE Lab VGG-16 is available: Link

2.3 Fully Convolutional CIE Lab VGG-16

Now the epoch 120 model of Fully Convolutional CIE Lab VGG-16 is available: Link

2.4 Gray VGG-16 with Spectral Norm

Now the epoch 120 model of Gray VGG-16 with Spectral Norm is available: Link

2.5 ResNet-50-RGB IN

Now the epoch 60 model of ResNet-50-RGB IN is available: Link

2.6 ResNet-50-Gray BN

Now the epoch 60 model of ResNet-50-Gray BN is available: Link

Other models pending...

3 Convert

Normally, we save the whole model as a .pth file. If you want the weights only, please run convert.py

4 Acknowledgement

If you use Fully Convolutional Gray VGG-16 or Gray VGG-16 with Spectral Norm, please consider cite:

@article{zhao2020scgan,
  title={SCGAN: Saliency Map-guided Colorization with Generative Adversarial Network},
  author={Zhao, Yuzhi and Po, Lai-Man and Cheung, Kwok-Wai and Yu, Wing-Yin and Abbas Ur Rehman, Yasar},
  journal={IEEE Transactions on Circuits and Systems for Video Technology},
  year={2020},
  publisher={IEEE}
}

If you use Fully Convolutional ResNet-50-Gray BN, please consider cite:

@article{zhao2021vcgan,
  title={VCGAN: Video Colorization with Hybrid Generative Adversarial Network},
  author={Zhao, Yuzhi and Po, Lai-Man and Yu, Wing-Yin and Rehman, Yasar Abbas Ur and Liu, Mengyang and Zhang, Yujia and Ou, Weifeng},
  journal={arXiv preprint arXiv:2104.12357},
  year={2021}
}

taylover2016/PyTorch-Special-Pre-trained-Models