Share Email Print

Proceedings Paper

Multisensor fusion techniques for tactical speaker recognition
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Multi-sensor fusion techniques have been widely used for target and object recognition, but are relatively unheard of in the speech processing community. Multi-sensor fusion deals with the combination of complementary, and sometimes contradictory, sensor data into a reliable estimate of the environment to achieve a sum which is better than the parts. Rome Laboratory developed a tactical speaker recognition algorithm which incorporates both feature and classifier fusion. The strategy is to exploit the fact that different classifies err in different ways and multiple features, like multiple sensors, can improve recognition performance over the performance of any one feature set (or even a composite feature set). The feature sets used are LPC cepstra, Hamming liftered cepstra, RASTA liftered cepstra, and delta cepstra. Each feature set is used to train separate classifiers: K Nearest Neighbor, Hypersphere, Multilayer Perceptron and Vector Quantization classifiers. This paper details experiments whereby the results of each feature and classifier pair are fused using two different methods: a simple voting scheme (or majority strategy) and a weighted voting scheme. Results are quoted on a simulated data set and the Rome Laboratory Greenflag tactical database.

Paper Details

Date Published: 6 April 1995
PDF: 12 pages
Proc. SPIE 2492, Applications and Science of Artificial Neural Networks, (6 April 1995); doi: 10.1117/12.205185
Show Author Affiliations
Laurie H. Fenstermacher, Rome Lab. (United States)

Published in SPIE Proceedings Vol. 2492:
Applications and Science of Artificial Neural Networks
Steven K. Rogers; Dennis W. Ruck, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?