Share Email Print

Proceedings Paper

Systematic evaluation of deep learning based detection frameworks for aerial imagery
Author(s): Lars Sommer; Lucas Steinmann; Arne Schumann; Jürgen Beyerer
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Object detection in aerial imagery is crucial for many applications in the civil and military domain. In recent years, deep learning based object detection frameworks significantly outperformed conventional approaches based on hand-crafted features on several datasets. However, these detection frameworks are generally designed and optimized for common benchmark datasets, which considerably differ from aerial imagery especially in object sizes. As already demonstrated for Faster R-CNN, several adaptations are necessary to account for these differences. In this work, we adapt several state-of-the-art detection frameworks including Faster R-CNN, R-FCN, and Single Shot MultiBox Detector (SSD) to aerial imagery. We discuss adaptations that mainly improve the detection accuracy of all frameworks in detail. As the output of deeper convolutional layers comprise more semantic information, these layers are generally used in detection frameworks as feature map to locate and classify objects. However, the resolution of these feature maps is insufficient for handling small object instances, which results in an inaccurate localization or incorrect classification of small objects. Furthermore, state-of-the-art detection frameworks perform bounding box regression to predict the exact object location. Therefore, so called anchor or default boxes are used as reference. We demonstrate how an appropriate choice of anchor box sizes can considerably improve detection performance. Furthermore, we evaluate the impact of the performed adaptations on two publicly available datasets to account for various ground sampling distances or differing backgrounds. The presented adaptations can be used as guideline for further datasets or detection frameworks.

Paper Details

Date Published: 30 April 2018
PDF: 13 pages
Proc. SPIE 10648, Automatic Target Recognition XXVIII, 1064803 (30 April 2018); doi: 10.1117/12.2304768
Show Author Affiliations
Lars Sommer, Karlsruher Institut für Technologie (Germany)
Fraunhofer-Institut für Optronik, Systemtechnik und Bildauswertung (Germany)
Lucas Steinmann, Fraunhofer-Institut für Optronik, Systemtechnik und Bildauswertung (Germany)
Arne Schumann, Fraunhofer-Institut für Optronik, Systemtechnik und Bildauswertung (Germany)
Jürgen Beyerer, Fraunhofer-Institut für Optronik, Systemtechnik und Bildauswertung (Germany)
Karlsruher Institut für Technologie (Germany)

Published in SPIE Proceedings Vol. 10648:
Automatic Target Recognition XXVIII
Firooz A. Sadjadi; Abhijit Mahalanobis, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?