Feature Generating Networks for Zero-Shot Learning

Figure from Official Paper

Generalized Zero Shot Learning (GZSL)

Zero-shot learning aims to recognize objects whose instances may not have been seen during training. [1]
GZSL leverages semantic information of both seen (source) and unseen (target) classes to bridge the gap between both seen and unseen classes [2]

f-CLSWGAN Architecture Figure from Official Paper

Animal with Attributes 2 [3] dataset is used. This dataset contains 50 classes with 37322 images. 40 classes are divided into seen classes.
Original paper authors has shared the features maps from ResNet101 [4]. These feature maps are used for training.
Seen classes are splitted to train and test sets. trainval_loc indexes ares used for training, test_seen_loc indexes are used for testing.[4]
Unseen classes are not splitted. test_unseen_loc indexes are used for both training and testing. [4]

After training, models will be saved to defined path in config.py.
Training process without any pretrained models.
- python main.py --train
Pretrained models for any part of proposed approach can be used for fine-tuning.
- python main.py --train --g_cls_path <path/to/model>
- python main.py --train --g_cls_path <path/to/model> --wgan_G_path <path/to/model> --wgan_D_path <path/to/model> --projection_path <path/to/model>

Pretrained models can be used for evaluation
- python main.py --g_cls_path <path/to/model> --wgan_G_path <path/to/model> --wgan_D_path <path/to/model> --projection_path <path/to/model>