This repository contains the source code for the paper presented at ICLR 2024
Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection. [Paper]
Qinyu Zhao, Ming Xu, Kartik Gupta, Akshay Asthana, Liang Zheng, Stephen Gould
Feature shaping refers to a family of methods that exhibit state-of-the-art performance for out-of-distribution (OOD) detection. These approaches manipulate the feature representation, typically from the penultimate layer of a pre-trained deep learning model, so as to better differentiate between in-distribution (ID) and OOD samples. However, existing feature-shaping methods usually employ rules manually designed for specific model architectures and OOD datasets, which consequently limit their generalization ability. To address this gap, we first formulate an abstract optimization framework for studying feature-shaping methods. We then propose a concrete reduction of the framework with a simple piecewise constant shaping function and show that existing feature-shaping methods approximate the optimal solution to the concrete optimization problem. Further, assuming that OOD data is inaccessible, we propose a formulation that yields a closed-form solution for the piecewise constant shaping function, utilizing solely the ID data. Through extensive experiments, we show that the feature-shaping function optimized by our method improves the generalization ability of OOD detection across a large variety of datasets and model architectures.
git clone https://github.com/Qinyu-Allen-Zhao/OptFSOOD.git
- Download ImageNet-1k and place the training data and validation data in
<YOUR_DATA_PATH>/ImageNet_1k_2012/train
and<YOUR_DATA_PATH>/ImageNet_1k_2012/val
, respectively. We note that the structure of validation set was changed. Please use./utils/imagenet_folder_gen.py
to modify the validation set (you need to download the annotations for validation set as well). - Download the datasets we used. We provide a zip file on Google Drive.
- Unzip the file and edit the root directory of all datasets in the
./config/datasets.yaml
file.
- A zip file of model checkpoints is provided on Google Drive. We download most checkpoints from torchvision and Hugging Face, and trained two MLP-Mixer models for CIFAR datasets.
- Please download and unzip the file under this directory.
- If you are using your own model checkpoints, please modify
cfg.resume
accordingly in./config/experiments/
You can use the eval_ood.py
file to evaluate different OOD methods on different benchmarks. We also provide a sample script in ./script/example_imagenet_rn50.sh
for reference.
```
python eval_ood.py -h
usage: Evaluate different OOD detection methods. [-h] [-b BATCH_SIZE] [-d DETECTOR]
[-e {imagenet,cifar10,cifar100}]
[-m {rn50,mb,vit_b,vit_l,swin_s,swin_b,mix_b,mix_l,dense,mix_n}]
[-s] [-p P] [-g] [-r] [-o {vanilla,energy}]
options:
-h, --help show this help message and exit
-b BATCH_SIZE, --batch_size BATCH_SIZE
Batch Size. It should have no influence on results but on the IO speed.
-d DETECTOR, --detector DETECTOR
The OOD detection method.
-e {imagenet,cifar10,cifar100}, --experiment {imagenet,cifar10,cifar100}
The benchmark name.
-m {rn50,mb,vit_b,vit_l,swin_s,swin_b,mix_b,mix_l,dense,mix_n}, --model {rn50,mb,vit_b,vit_l,swin_s,swin_b,mix_b,mix_l,dense,mix_n}
The model for evaluation.
-s, --use_surrogate Use surrogate to estimate parameters in methods. See Appendix.B.
-p P, --p P Hyperparameters in prior methods. See Appendix.A.
-g, --is_gaussian Use Gaussian surrogate to estimate parameters in methods. See
Appendix.B.
-r, --use_real Use read OOD data to estimate parameters in methods. See Appendix.B.
-o {vanilla,energy}, --use_ood_score {vanilla,energy}
The vanilla method denoted “Ours (V)”, based on the maximum logit
score, and there is a variant based on energy scores.
```
Our method achieves state-of-the-art performance averaged on the ImageNet benchmark across a variety of OOD datasets and methods.
If you use our codebase or our results in your research, please cite our work:
@inproceedings{zhao2024towards,
author = {Zhao, Qinyu and Xu, Ming and Gupta, Kartik and Asthana, Akshay and Zheng, Liang and Gould, Stephen},
title = {Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection},
booktitle = {ICLR},
year = {2024}
}
This codebase refers to the OpenOOD v1.0 benchmark. For further details, visit OpenOOD GitHub Repository.
# v1.0 report
@article{yang2022openood,
author = {Yang, Jingkang and Wang, Pengyun and Zou, Dejian and Zhou, Zitang and Ding, Kunyuan and Peng, Wenxuan and Wang, Haoqi and Chen, Guangyao and Li, Bo and Sun, Yiyou and Du, Xuefeng and Zhou, Kaiyang and Zhang, Wayne and Hendrycks, Dan and Li, Yixuan and Liu, Ziwei},
title = {OpenOOD: Benchmarking Generalized Out-of-Distribution Detection},
year = {2022}
}