Share Email Print
cover

Proceedings Paper

Subset selection of training data for machine learning: a situational awareness system case study
Author(s): M. McKenzie; S. C. Wong
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

Recent advances in machine learning with big data sets has allowed for significant advances in the optimisation of classification and recognition systems. However, for applications such as situational awareness systems, the entirety of the available data dwarfs the amount permissible for a training set with tractable machine learning optimization times. Furthermore, the performance of any optimized system is highly dependent of the training set correctly and completely representing the entire data space of scenarios. In this paper we present a technique to characterize the entire data space to ascertain the key factors for representation and subsequently select a subset that statistically represents the correct mix of scenarios. We demonstrate the effectiveness of these characterization and subset selection techniques by using a genetic algorithm to optimize the performance of a gunfire recognition system.

Paper Details

Date Published: 11 May 2015
PDF: 8 pages
Proc. SPIE 9494, Next-Generation Robotics II; and Machine Intelligence and Bio-inspired Computation: Theory and Applications IX, 94940U (11 May 2015); doi: 10.1117/12.2176536
Show Author Affiliations
M. McKenzie, Defence Science and Technology Organisation (Australia)
S. C. Wong, Defence Science and Technology Organisation (Australia)


Published in SPIE Proceedings Vol. 9494:
Next-Generation Robotics II; and Machine Intelligence and Bio-inspired Computation: Theory and Applications IX
Misty Blowers; Dan Popa; Muthu B. J. Wijesundara, Editor(s)

© SPIE. Terms of Use
Back to Top