Share Email Print

Proceedings Paper

Benchmarking deep learning trackers on aerial videos
Author(s): Breton Minnehan; Anthony Salmin; Karl Salva; Andreas Savakis
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

In this paper, we benchmark five state-of-the-art trackers on aerial platform videos: Multi-domain Convolutional Neural Network (MDNET) tracker, which was the winner of the VOT2015 tracking challenge, the Fully Convolutional Neural network Tracker (FCNT), the Spatially Regularized Correlation Filter (SRDCF) tracker, the Continuous Convolution Operator Tracker (CCOT) tracker, which was the winner of the VOT2016 challenge, and the Tree structure Convolutional Neural Network (TCNN) tracker. We assess performance in terms of both tracking accuracy and processing speed based on two sets of videos: a subset of the OTB dataset where the cameras are located at a high vantage point and a new dataset of aerial videos captured by a moving platform. Our results indicate that these trackers performed as expected for the videos in the OTB subset, however, tracker performance degraded significantly in aerial videos due to target size, camera motion and target occlusions. The CCOT tracker yielded the best overall performance in terms of accuracy, while the SRDCF tracker was the fastest.

Paper Details

Date Published: 30 April 2018
PDF: 7 pages
Proc. SPIE 10649, Pattern Recognition and Tracking XXIX, 1064915 (30 April 2018); doi: 10.1117/12.2323866
Show Author Affiliations
Breton Minnehan, Rochester Institute of Technology (United States)
Anthony Salmin, Rochester Institute of Technology (United States)
Karl Salva, U.S. Air Force Research Lab. (United States)
Andreas Savakis, Rochester Institute of Technology (United States)

Published in SPIE Proceedings Vol. 10649:
Pattern Recognition and Tracking XXIX
Mohammad S. Alam, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?