PyTracking

A general python framework for visual object tracking and video object segmentation, based on PyTorch.

One tracking paper accepted at CVPR 2022!

Transforming Model Prediction for Tracking | Code available!

Two tracking/VOS papers accepted at ICCV 2021!

Learning Target Candidate Association to Keep Track of What Not to Track | Code available!
Generating Masks from Boxes by Mining Spatio-Temporal Consistencies in Videos | Code coming here soon...

Highlights

KeepTrack and DiMP Trackers

Official implementation of the KeepTrack (ICCV 2021), DiMP (ICCV 2019), including complete training code and trained models.

Tracking Libraries

Libraries for implementing and evaluating visual trackers. It includes

All common tracking and video object segmentation datasets.
Scripts to analyse tracker performance and obtain standard performance scores.
General building blocks, including deep networks, optimization, feature extraction and utilities for correlation filter tracking.

Training Framework: LTR

LTR (Learning Tracking Representations) is a general framework for training your visual tracking networks. It is equipped with

All common training datasets for visual object tracking and segmentation.
Functions for data sampling, processing etc.
Network modules for visual tracking.
And much more...

Model Zoo

The tracker models trained using PyTracking, along with their results on standard tracking benchmarks are provided in the model zoo.

Trackers

The toolkit contains the implementation of the following trackers.

KeepTrack (ICCV 2021)

[Paper] [Raw results] [Models] [Training Code] [Tracker Code]

Official implementation of KeepTrack. KeepTrack actively handles distractor objects to continue tracking the target. It employs a learned target candidate association network, that allows to propagate the identities of all target candidates from frame-to-frame. To tackle the problem of lacking groundtruth correspondences between distractor objects in visual tracking, it uses a training strategy that combines partial annotations with self-supervision.

DiMP (ICCV 2019)

[Paper] [Raw results] [Models] [Training Code] [Tracker Code]

Official implementation of the DiMP tracker. DiMP is an end-to-end tracking architecture, capable of fully exploiting both target and background appearance information for target model prediction. It is based on a target model prediction network, which is derived from a discriminative learning loss by applying an iterative optimization procedure. The model prediction network employs a steepest descent based methodology that computes an optimal step length in each iteration to provide fast convergence. The model predictor also includes an initializer network that efficiently provides an initial estimate of the model weights.

Installation

Clone the GIT repository.

git clone https://github.com/visionml/pytracking.git

Clone the submodules.

In the repository directory, run the commands:

git submodule update --init

Install dependencies

Run the installation script to install all the dependencies. You need to provide the conda install path (e.g. ~/anaconda3) and the name for the created conda environment (here pytracking).

bash install.sh conda_install_path pytracking

This script will also download the default networks and set-up the environment.

Note: The install script has been tested on an Ubuntu 18.04 system. In case of issues, check the detailed installation instructions.

Windows: (NOT Recommended!) Check these installation instructions.

Let's test it!

Activate the conda environment and run the script pytracking/run_webcam.py to run ATOM using the webcam input.

conda activate pytracking
cd pytracking
python run_webcam.py keep_track default

behnamh217rn21/KeepTrack