Share Email Print

Proceedings Paper

Convolutional neural networks for recognition and segmentation of aluminum profiles
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Learning to automatically recognize objects in the real world is a very important and stimulating challenge. This work deals with the problem of detecting aluminium profiles within images, using hierarchical representations such as those based on deep learning methods. The use of regional CNN, a conceptually simple, flexible, and general framework for object instance segmentation, allows to exceed the previous state-of-the-art results. This approach efficiently detects objects in an image while simultaneously generating a high-quality segmentation mask for each instance. Neural network training uses ResNet networks of depth 50 or 101 layers. In particular, the training dataset consists of synthetic data generated by CAD files. The Dataset creation process is fundamental: experimental results show that trivial datasets lead to poor detection performance. A rich dataset, instead, including more complex images, allows the network to learn more and better guaranteeing excellent results. How to get more data, if you do not have more data? To get more data, we just need to make minor alterations such as flips, scale or rotations to existing dataset. This process is known as Data augmentation. The performance of the proposed system strongly depends on the dataset used for training and on the backbone architecture used. This why, adopting a strategy that generates a priori a very large dataset the time required to create the annotation file grows almost exponentially. The validation and test dataset, on the other hand, consists of real images captured by cameras. The distinctiveness of this work consists to train a deep neural network with synthetic data (aluminium CAD files) and verify if it on real data. Experiments show that the implementation of architecture, as described above, leads to good performance in automatic detection and classification. Future work will be addressed to improve the network training process together with the architecture, the algorithms and the dataset creation process. The latter is proved to be fundamental for the balance and optimization of the whole process. The way is to develop not much augmented datasets focusing on online data augmentation during network training.

Paper Details

Date Published: 21 June 2019
PDF: 11 pages
Proc. SPIE 11059, Multimodal Sensing: Technologies and Applications, 110590S (21 June 2019); doi: 10.1117/12.2525687
Show Author Affiliations
Pier Luigi Mazzeo, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy)
Arturo Argentieri, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy)
Federico De Luca, Univ. del Salento (Italy)
Paolo Spagnolo, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy)
Cosimo Distante, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy)
Marco Leo, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy)
Pierluigi Carcagnì, Istituto di Scienze Applicate e Sistemi Intelligenti "Eduardo Caianiello" (Italy)

Published in SPIE Proceedings Vol. 11059:
Multimodal Sensing: Technologies and Applications
Ettore Stella, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?