Share Email Print
cover

Proceedings Paper • new

Power law scaling of test error versus number of training images for deep convolutional neural networks
Author(s): Vittorio Sala
Format Member Price Non-Member Price
PDF $17.00 $21.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

The highest accuracy in image classification is of utmost importance for the industrial application of algorithms based on convolutional neural networks. Empirically, it is sometimes possible to improve accuracy by increasing the size of the training set. In this work, the scaling of the test accuracy versus the size of the training set was studied for different networks. First, a network with a depth of few layers was initialized with random parameters and trained on subsets of images of variable size sampled from the MNIST dataset of handwritten digits and MNIST fashion dataset of clothes and accessories. The scaling of the accuracy versus the size of the training set may be described as the sum of two components: a power law and an offset independent on the size of the training set. Exponent of the power law appears to be the same in both dataset and independent on seeds, initial weights and number of convolutional filters. Then, the scaling of the accuracy versus the size of training set has been evaluated on a dataset of pictures of paintings, sacred icons and sculptures with the goal to correctly classify unknown pictures. The networks chosen are the ones implemented in the machine vision library Halcon 18.11, including two convolutional neural networks with unknown topology and Resnet50, pretrained on industrial images. The scaling of the accuracy versus the size of the training set seems to be compatible with the power law scaling observed on the few layers network trained on MNIST.

Paper Details

Date Published: 21 June 2019
PDF: 5 pages
Proc. SPIE 11059, Multimodal Sensing: Technologies and Applications, 1105914 (21 June 2019); doi: 10.1117/12.2525811
Show Author Affiliations
Vittorio Sala, iMAGE S S.p.A. (Italy)


Published in SPIE Proceedings Vol. 11059:
Multimodal Sensing: Technologies and Applications
Ettore Stella, Editor(s)

© SPIE. Terms of Use
Back to Top