Share Email Print

Proceedings Paper

Training lightweight network from scratch for efficient object detection in aerial images
Author(s): Ang Su; Pengyu Guo; Banglei Guan
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Object detection in aerial images plays an important role for a wide range of applications. Although many efforts have been done in the last decade, it is still an active and challenging problem because of the highly complex backgrounds and the large variations in the visual appearance of objects caused by viewpoint variation, occlusion, illumination, etc. Recently, many object detectors based on deep learning demonstrate the great advantages for significantly improving the detection performance in aerial images. However, the most accuracy neural networks usually have hundreds of layers and thousands of channels, thus requiring huge computation and memory consumption. Besides, the state-of-the-art object detectors are usually fined-tuned from the models pretrained on classification dataset ImageNet, which limits the modification of network architecture and also leads to learning bias because of the different domains. In this paper we trained a lightweight convolutional neural network from scratch to perform object detection in aerial images. When designing the lightweight network, Concatenated Rectified Linear Units (CReLU) and depthwise separable convolution operation were employed to reduce the computation cost and model size. When training the lightweight network from scratch, we employ Group Normalization (GN) in each convolution layer, which makes smoother optimization landscape and has more stable gradients. A serial of ablation experiments is conducted on the recently published large-scale Dataset for Object detection in Aerial images (DOTA), and the results show that the proposed object detection methods with lightweight network trained from scratch achieves competitive performance but has smaller model size and lower computation cost.

Paper Details

Date Published: 7 October 2019
PDF: 8 pages
Proc. SPIE 11155, Image and Signal Processing for Remote Sensing XXV, 111551P (7 October 2019); doi: 10.1117/12.2535479
Show Author Affiliations
Ang Su, National Univ. of Defense Technology (China)
Pengyu Guo, Academy of Military Science (China)
Banglei Guan, National Univ. of Defense Technology (China)

Published in SPIE Proceedings Vol. 11155:
Image and Signal Processing for Remote Sensing XXV
Lorenzo Bruzzone; Francesca Bovolo, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?