/PyMDNet

A tensorflow implementation of [Hyeonseob Nam and Bohyung Han, Learning Multi-Domain Convolutional Neural Networks for Visual Tracking, CVPR2016]

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

PyMDNet

by HungWei-Andy @ NTU DISPLab

Introduction

Python (tensorflow) implementation of Hyeonseob Nam and Bohyung Han, Learning Multi-Domain Convolutional Neural Networks, CVPR2016.

Requirements

Python 2.7
numpy>=1.12.1
tensorflow-gpu==1.0.0
matplotlib>=2.0.1
skimage>=0.13.0
Pillow>=2.2.1

Data Directory

Please download and put the video files of OTB and VOT in the directory 'data' with structure as follows:

.
├── data
|     ├── otb
|     |    ├── Basketball
|     |    └── ...
|     | 
|     └── vot
|          ├── vot2013
|          |      ├── bicycle
|          |      └── ...
|          | 
|          ├── vot2014
|          |      ├── ball
|          |      └── ...
|          |
|          └── vot2015
|                 ├── bag
|                 └── ...
|
├── models
└── README.md

Initial VGG-M Model

The initial model is converted from caffe VGG-M model into .npy file using caffe-tensorflow library provided by ethereon.

To download the initial model, run download.sh directly.

  bash download.sh

Pretrained Models

We have pretrained the mdoel on vot dataset, otb dataset, and both. To download the pretrained model, run download_trained.sh directly.

  bash download_trained.sh

Usage

Tracking

  python tracking.py --dataset dataset --seq sequence --load_path path [--no_display]

Pretraining

  bash pretrain.sh