py-MDNet

by Hyeonseob Nam and Bohyung Han at POSTECH

Update (April, 2019)

Migration to python 3.6 & pyTorch 1.0
Efficiency improvement (~5fps)
ImagNet-VID pretraining
Code refactoring

Introduction

PyTorch implementation of MDNet, which runs at ~5fps with a single CPU core and a single GPU (GTX 1080 Ti).

[Project] [Paper] [Matlab code]

If you're using this code for your research, please cite:

@InProceedings{nam2016mdnet,
author = {Nam, Hyeonseob and Han, Bohyung},
title = {Learning Multi-Domain Convolutional Neural Networks for Visual Tracking},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2016}
}

Results on OTB

Raw results of MDNet pretrained on VOT-OTB (VOT13,14,15 excluding OTB): Google drive link
Raw results of MDNet pretrained on Imagenet-VID: Google drive link

Prerequisites

python 3.6+
opencv 3.0+
PyTorch 1.0+ and its dependencies
for GPU support: a GPU with ~3G memory

Usage

Tracking

 python tracking/run_tracker.py -s DragonBaby [-d (display fig)] [-f (save fig)]

You can provide a sequence configuration in two ways (see tracking/gen_config.py):
- python tracking/run_tracker.py -s [seq name]
- python tracking/run_tracker.py -j [json path]

Pretraining

Download VGG-M (matconvnet model) and save as "models/imagenet-vgg-m.mat"

Pretraining on VOT-OTB

Download VOT datasets into "datasets/VOT/vot201x"

 python pretrain/prepro_vot.py
 python pretrain/train_mdnet.py -d vot

Pretraining on ImageNet-VID

Download ImageNet-VID dataset into "datasets/ILSVRC"

 python pretrain/prepro_imagenet.py
 python pretrain/train_mdnet.py -d imagenet

hyeonseobnam/py-MDNet