Deep Learning for Visual Tracking: A Comprehensive Survey (Extended version of the paper on arXiv)

The comprehensive comparisons of recent Deep Learning (DL)-based visual tracking methods on the OTB-2013, OTB-2015, VOT-2018, and LaSOT datasets (Raw Results on OTB Dataset, Raw Results on VOT Dataset, Raw Results on LaSOT Dataset).

Performance comparisons will be updated soon with more datasets & methods.!

===============================================================================

Licence: copyright 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

===============================================================================

Performance Comparison of Visual Trackers in terms of Precision and Success Plots on OTB-2013 Dataset (Ranking based on Area Under Curve (AUC)) [OTB-2013 Dataset] [OTB-2013 Paper]:

- Average Performance Comparisons of Visual Tracking Methods: - Attribute-based Performance Comparisons (Eleven attributes including: Illumination Variation (IV), Scale Variation (SV), Occlusion (OCC), Deformation (DEF), Motion Blur (MB), Fast Motion (FM), In-Plane Rotation (IPR), Out-of-Plane Rotation (OPR), Out-of-View (OV), Background Clutter (BC), Low Resolution (LR)):

Performance Comparison of Visual Trackers in terms of Precision and Success Plots on OTB-2015 Dataset (Ranking based on Area Under Curve (AUC)) [OTB-2015 Dataset][OTB-2015 Paper]:

Performance Comparison of Visual Trackers on VOT-2018 Dataset [VOT-2018 Dataset][VOT-2018 Paper]:

Experiment Baseline (Expected Overlap Analysis):

- Expected overlap curves:

- Expected overlap scores:

- Overview: Expected Overlap Analysis:

Experiment Baseline (Accuracy-Robustness (AR) Ranking):

- AR plot (mean):

- AR plot (weighted_mean):

- AR plot (pooled):

- Table: Accuracy:

- Table: Robustness:

Experiment Baseline (Attribute-based Ranking: Camera Motion, Illumination Change, Motion Change, Occlusion, Size Change, No Degradation):

- Orderings for overall overlap:

- Orderings for failures:

- AR plot for camera motion:

- AR plot for illumination change:

- AR plot for motion change:

- AR plot for occlusion:

- AR plot for size change:

- AR plot for no degradation:

Experiment Baseline (Speed Report) [first to third methods are shown by yellow, blue, and green colors.]:

- Raw Frame per Second (FPS):

- Normalized (Equivalent Filter Operations (EFO): "tracker speed in terms of a predefined filtering operation that the VOT tookit automatically carries out prior to running the experiments"):

Experiment Unsupervised (Overall Comparisons):

- Average overlap:

- Table: Overlap Overview:

- Orderings for overall overlap:

Experiment Unsupervised (Attribute-based Comparisons):

- Overlap plot for camera motion:

- Overlap plot for illum change:

- Overlap plot for motion change:

- Overlap plot for occlusion:

- Overlap plot for size change:

- Overlap plot for no degradation:

Experiment Unsupervised (Speed Report): [first to third methods are shown by yellow, blue, and green colors.]:

- Raw Frame per Second (FPS):

- Normalized (Equivalent Filter Operations (EFO): "tracker speed in terms of a predefined filtering operation that the VOT tookit automatically carries out prior to running the experiments"):

Overall Comparison:

- Report Overview:

Qualitative Comparisons of State-of-the-art Visual Tracking Methods on VOT2018 Dataset (Under TraX Protocol):

BMX_Video, Crabs1_Video, Gymnastics3_Video, Motorcross2_Video, Singer3_Video, Godfather_Video, Bag_Video, Dinasaur_Video, Matrix_Video, Hand_Video, Glove_Video, Ball2_Video, Blanket_Video, Gymnastics1_Video, Butterfly_Video, Motorcross1_Video, Pedestrian_Video, Singer2_Video, Shaking_Video, Racing_Video, Handball1_Video, Sheep_Video, Bolt1_Video, Fernando_Video, Bolt2_Video, Book_Video, Leaves_Video, Fish1_Video, Fish2_Video, Tiger_Video, Wiper_Video, Traffic_Video, Crossing_Video, Fish3_Video, Ball1_Video, Graduate_Video, Iceskater_Video, Soldier_Video, DroneAcross_Video, Soccer2_Video, DroneFlip_Video, Ants1_Video, Iceskater_Video, Handball2_Video, Nature_Video, Ants3_Video, Road_Video, Helicopter_Video, Girl_Video, Gymnastics2_Video, Conduction_Video, Zebrafish1_Video, Basketball_Video, Frisbee_Video, Car1_Video, Birds1_Video, Drone1_Video, Flamingo_Video.

Performance Comparison of Visual Trackers in terms of Precision and Success Plots on LaSOT Dataset (Ranking based on Area Under Curve (AUC)) [LaSOT Dataset][LaSOT Paper]:

- Average Performance Comparisons of Visual Tracking Methods: - Attribute-based Performance Comparisons (Fourteen attributes including: Illumination Variation (IV), Scale Variation (SV), Deformation (DEF), Motion Blur (MB), Fast Motion (FM), Out-of-View (OV), Background Clutter (BC), Low Resolution (LR), Aspect Ratio Change (ARC), Camera Motion (CM), Full Occlusion (FOC), Partial Occlusion (POC), Viewpoint Change (VC), Rotation (ROT)):