Special pre-trained VGG-16 network on CIE Lab and Grayscale images converted from ImageNet training set
Compared to the official model provided by PyTorch, the classification ability of our model is only slightly weaker. Basically, these models are targeted for regression task, so we think the small improvement is unnecessary.
The Fully Convolutional models do not include MaxPooling layer & AdaptiveAvgPooling layer, which is replaced by a convolutional layer with stride = 2. Note that, the total amount of convolutional layer is unchanged. Each convolutional layer is spectral normalized (you may find the source code in this project), which is very useful for the training of WGAN.
-
epoch 30 | top 1: 32.50% | top 5: 59.90%
-
epoch 60 | top 1: 56.28% | top 5: 80.66%
-
epoch 90 | top 1: 57.96% | top 5: 81.80%
-
epoch 120 | top 1: 57.46% | top 5: 81.31%
-
epoch 30 | top 1: 37.41% | top 5: 65.55%
-
epoch 60 | top 1: 60.29% | top 5: 83.83%
-
epoch 90 | top 1: 60.38% | top 5: 83.11%
-
epoch 120 | top 1: 60.87% | top 5: 83.17%
-
epoch 30 | top 1: 35.70% | top 5: 63.22%
-
epoch 60 | top 1: 59.21% | top 5: 83.16%
-
epoch 90 | top 1: 65.15% | top 5: 86.72%
-
epoch 120 | top 1: 65.13% | top 5: 86.80%
-
We further tested validation accuracy when the learning was further decreasing to 1e-6; however, there is no explicit improvement. For example, when epoch = 125, the top 1 accuracy = 65.16% and top 5 accuracy = 86.81%.
- For epoch 5, the top 1 accuracy is 55.77% and top 5 accuracy is 79.64%. For epoch 15, the top 1 accuracy is 57.16% and top 5 accuracy is 80.79%.
Now the epoch 120 model of Fully Convolutional Gray VGG-16
is available: Link
Now the epoch 120 model of CIE Lab VGG-16
is available: Link
Now the epoch 120 model of Fully Convolutional CIE Lab VGG-16
is available: Link
Now the epoch 120 model of Gray VGG-16 with Spectral Norm
is available: Link
Now the epoch 60 model of ResNet-50-RGB IN
is available: Link
Now the epoch 60 model of ResNet-50-Gray BN
is available: Link
Other models pending...
Normally, we save the whole model as a .pth
file. If you want the weights only, please run convert.py
If you use Fully Convolutional Gray VGG-16
or Gray VGG-16 with Spectral Norm
, please consider cite:
@article{zhao2020scgan,
title={SCGAN: Saliency Map-guided Colorization with Generative Adversarial Network},
author={Zhao, Yuzhi and Po, Lai-Man and Cheung, Kwok-Wai and Yu, Wing-Yin and Abbas Ur Rehman, Yasar},
journal={IEEE Transactions on Circuits and Systems for Video Technology},
year={2020},
publisher={IEEE}
}
If you use Fully Convolutional ResNet-50-Gray BN
, please consider cite:
@article{zhao2021vcgan,
title={VCGAN: Video Colorization with Hybrid Generative Adversarial Network},
author={Zhao, Yuzhi and Po, Lai-Man and Yu, Wing-Yin and Rehman, Yasar Abbas Ur and Liu, Mengyang and Zhang, Yujia and Ou, Weifeng},
journal={arXiv preprint arXiv:2104.12357},
year={2021}
}