/COMIC

Pytorch implementation of Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels

Primary LanguagePython

COMIC: A framework for Partial Labeling and Long-Tailed Multi-Label Classification (PTL-MLC)

version python pytorch

πŸ“„ About

This repository contains code for our accepted ICCV 2023 paper:

"Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels".

Abstract

Conventional multi-label classification (MLC) methods assume that all samples are fully labeled and identically distributed. Unfortunately, this assumption is unrealistic in large-scale MLC data that has long-tailed (LT) distribution and partial labels (PL). To address the problem, we introduce a novel task, Partial labeling and Long-Tailed Multi-Label Classification (PLT-MLC), to jointly consider the above two imperfect learning environments. Not surprisingly, we find that most LT-MLC and PL-MLC approaches fail to solve the PLT-MLC, resulting in significant performance degradation on the two proposed PLT-MLC benchmarks. end-to-end learning framework: COrrection $\rightarrow$ ModificatIon $\rightarrow$ balanCe, abbreviated as COMIC. Our bootstrapping philosophy is to simultaneously correct the missing labels (Correction) with convinced prediction confidence over a class-aware threshold and to learn from these recall labels during training. We next propose a novel multi-focal modifier loss that simultaneously addresses head-tail imbalance and positive-negative imbalance to adaptively modify the attention to different samples (Modification) under the LT class distribution. We also develop a balanced training strategy by measuring the model's optimization direction towards the head and tail samples, and thus design the balanced classifier (Balance) conditioned on the head and tail optimization to maintain a stable performance. Our experimental study shows that the proposed COMIC significantly outperforms the general MLC, LT-MLC and ML-MLC methods in terms of effectiveness and robustness on our newly created PLT-MLC datasets.

πŸŽ“ What’s In This Repo

The proposed dataset and framework implementations of our paper are as follows:

Environment

  • Python==3.8
  • Pytorch==1.9.1
  • cuda=10.2

PLT-MLC Datasets

We release all the PLT-MLC datasets on link. The data distribution and statistics of the developed $\texttt{PLT-COCO}$ dataset and $\texttt{PLT-VOC}$ dataset are shown in the figure below. Notably, we also upload the files of class distribtuion and label co-occurrence matrix in their corresponding link.

Pretrained Models

We also released the pretrained models on link (PLT-VOC dataset) and link (PLT-COCO dataset) . You can change the model_path in the testing code to verify the results.

Performance of the proposed COMIC on two proposed PLT-MLC datasets.

Model Many Shot Medium Shot Few Shot Total Shot
COMIC on PLT-COCO 49.21Β±0.22 60.08Β±0.13 55.36Β±0.21 55.08Β±0.14
COMIC on PLT-VOC 73.10Β±0.35 89.18Β±0.45 84.53Β±0.48 81.53Β±0.35

Run the Code

All the training and testing codes of our model are written in the shell.txt. For example, if you want to train our model with backbone resnet50 on PLT-COCO dataset, you can run the following command line.

Training

python plt_mlc_main.py --cfg /home/project/COMIC/config/coco/resnet50_plt.yaml

You require to edit the configurations in the resnet50_plt.yaml file.

 resnet50_plt.yaml
    β”œβ”€β”€ PerformanceLoss
    β”œβ”€β”€ num_components
    β”œβ”€β”€ training_opt
    β”‚   └── batch_size    
    β”‚   └── dataset 
    β”‚   └── distribution_path
    β”‚   └── co_occurrence_matrix    
    β”‚   └── train_annatation_path 
    β”‚   └── val_annatation_path 
    β”‚   └── train_data_path    
    β”‚   └── val_data_path 
    β”‚   └── feature_dim 
    β”‚   └── gpu_ids         

Testing

python test.py --cfg /home/project/COMIC/config/coco/resnet50_plt.yaml

🀝 Referencing and Citing

If you find our work useful in your research and would like to cite our project, please use the following citation: