Share Email Print

Proceedings Paper

Deep learning concepts and datasets for image recognition: overview 2019
Author(s): Karel Horak; Robert Sablatnig
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

We present basics of a deep learning concept and an overview of well-known deep learning concepts as general Convolutional Neural Networks, R-CNN family, Single Shot Multibox Detector, You Only Look Once architecture and the RetinaNet in the first part of this paper. The all mentioned architectures are described to quickly compare to each other regarding their suitability for given general task. Several selected datasets often used in deep learning competitions are listed in the subsequent chapters in more details. The most known of practically used and listed datasets are COCO, KITTI, PascalVOC and CityShapes. The overview serves as a comparison of the state-of-the-art deep learning methods.

Paper Details

Date Published: 14 August 2019
PDF: 8 pages
Proc. SPIE 11179, Eleventh International Conference on Digital Image Processing (ICDIP 2019), 111791S (14 August 2019); doi: 10.1117/12.2539806
Show Author Affiliations
Karel Horak, Brno Univ. of Technology (Czech Republic)
Robert Sablatnig, Technische Univ. Wien (Austria)

Published in SPIE Proceedings Vol. 11179:
Eleventh International Conference on Digital Image Processing (ICDIP 2019)
Jenq-Neng Hwang; Xudong Jiang, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?