DukeMTMC-VideoReID [1] is a subset of the DukeMTMC tracking dataset [2] for video-based person re-identification.
The dataset consists of 702 identities for training, 702 identities for testing, and 408 identities as distractors. In total there are 2,196 videos for training and 2,636 videos for testing. Each video contains person images sampled every 12 frames. During testing, a video for each ID is used as the query and the remaining videos are placed in the gallery.
Data Offline. Since the privacy implications of the data set are being considered, we have to offline the download link for DukeMTMC-VideoReID. We are very sorry about it. There is currently no clear indication as to when, if ever, the data set will be allowed to be restored.
You may try the following download link to check whether the data set is restored officially. http://vision.cs.duke.edu/DukeMTMC/data/misc/DukeMTMC-VideoReID.zip
Directory | Description |
---|---|
./train | The training video tracklets. It contains 702 identities. |
./query | The query video tracklets. Each of them is from different identities in different cameras. |
./gallery | The gallery video tracklets. It contains 702 gallery identities and 408 distractors. |
Followings are the directory structure for DukeMTMC-VideoReID.
Splits
Person ids
Video tracklet ids
Frame bounding box images
For example, for one frame image train/0001/0003/0001_C6_F0099_X30823.jpg
, train
, 0001
, 0003
, and 0001_C6_F0099_X30823.jpg
are the split, person id, video tracklet id, and image frame name, respectively.
Naming Rules for image file.
For most frame bounding box images, e.g. 0001_C6_F0099_X30823.jpg
, "0001" is the identity. "C6" indicate Camera 6. "F0099" means it is the 99th frame within the tracklet. "X" indicates it is a normal image (otherwise, "D" for distractors) and " 30823" is the 30823th frame in the whole video of Camera 6.
The baseline model is an end-to-end ResNet-50 model with temporal average pooling (ETAP-Net).
More details about the ETAP-Net can be found in our CVPR2018 paper Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning.
- Python 3.6
- PyTorch (version >= 0.2.0)
- h5py, scikit-learn, metric-learn, tqdm
Move the downloaded dataset file DukeMTMC-VideoReID.zip
to ./data/
and unzip here.
python3 run.py --dataset DukeMTMC-VideoReID --logs_dir logs/DukeMTMC-VideoReID_baseline/ --max_frames 900 --batch_size 16
Method | Rank-1 | Rank-5 | Rank-20 | mAP |
ETAP-Net | 83.62 | 94.59 | 97.58 | 78.34 |
-
[1] Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning. Wu et al., CVPR 2018
-
[2] Performance Measures and a Data Set for Multi-Target, Multi-Camera Tracking. Ristani et al., ECCVWS 2016
Please cite the following two papers if this dataset helps your research.
@inproceedings{wu2018cvpr_oneshot,
title = {Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning},
author = {Wu, Yu and Lin, Yutian and Dong, Xuanyi and Yan, Yan and Ouyang, Wanli and Yang, Yi},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2018}
}
@inproceedings{ristani2016MTMC,
title = {Performance Measures and a Data Set for Multi-Target, Multi-Camera Tracking},
author = {Ristani, Ergys and Solera, Francesco and Zou, Roger and Cucchiara, Rita and Tomasi, Carlo},
booktitle = {European Conference on Computer Vision workshop on Benchmarking Multi-Target Tracking},
year = {2016}
}
Please refer to the license file for DukeMTMC-VideoReID and DukeMTMC.
If you have any questions about this dataset, please do not hesitate to contact me.