/DMF

Primary LanguagePythonMIT LicenseMIT

License: MIT Python GitHub stars

DMF

Deep Multimodal Fusion for Generalizable Person Reidentification

This is the official implementation of our paper Deep Multimodal Fusion for Generalizable Person Re-identification. And the pretrained models can be downloaded from data2vec.

News

  • Support Market1501, CUHK03, MSMT17 and RandPerson datasets.

TODO

Write the documents.

Requirements

  • python2
  • torch
  • torchvision
  • timm
  • yacs
  • opencv-python
  • fairseq

How to use it?

This repo. supports training on multiple GPUs and the default setting is single GPU (One RTX 3090 GPU).

  1. Download all necessry datasets (e.g. Market1501, CUHK03 and MSMT17 datasets) and move them to 'data'.
  2. Training
python train.py
  1. Testing
python test.py

Experiment Results on Market-1501, CUHK03 and MSMT17 datasets.

Dataset for fine-tuning Market-1501 CUHK03 MSMT17 Settings
Rank-1 mAP Rank-1 mAP Rank-1 mAP
Market-1501----23.422.650.621.51GPU
MSMT1781.355.126.124.7----1GPU
MSMT17all82.658.834.032.1----1GPU
RandPerson78.752.021.519.352.418.91GPU

Acknowledgments

This work was supported by the National Natural Science Foundation of China under Projects (Grant No. 61977045 and Grant No. 81974276). If you have further questions and suggestions, please feel free to contact us (xiangsuncheng17@sjtu.edu.cn).

If you find this code useful in your research, please consider citing:

@article{xiang2022deep,
  title={Deep Multimodal Fusion for Generalizable Person Re-identification},
  author={Xiang, Suncheng and Chen, Hao and Ran, Wei and Yu, Zefang and Liu, Ting and Qian, Dahong and Fu, Yuzhuo},
  journal={arXiv preprint arXiv:2211.00933},
  year={2022},
}