Share Email Print
cover

Proceedings Paper

Speaker identification for the improvement of the security communication between law enforcement units
Author(s): Jaromir Tovarek; Pavol Partila
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

This article discusses the speaker identification for the improvement of the security communication between law enforcement units. The main task of this research was to develop the text-independent speaker identification system which can be used for real-time recognition. This system is designed for identification in the open set. It means that the unknown speaker can be anyone. Communication itself is secured, but we have to check the authorization of the communication parties. We have to decide if the unknown speaker is the authorized for the given action. The calls are recorded by IP telephony server and then these recordings are evaluate using classification If the system evaluates that the speaker is not authorized, it sends a warning message to the administrator. This message can detect, for example a stolen phone or other unusual situation. The administrator then performs the appropriate actions. Our novel proposal system uses multilayer neural network for classification and it consists of three layers (input layer, hidden layer, and output layer). A number of neurons in input layer corresponds with the length of speech features. Output layer then represents classified speakers. Artificial Neural Network classifies speech signal frame by frame, but the final decision is done over the complete record. This rule substantially increases accuracy of the classification. Input data for the neural network are a thirteen Mel-frequency cepstral coefficients, which describe the behavior of the vocal tract. These parameters are the most used for speaker recognition. Parameters for training, testing and validation were extracted from recordings of authorized users. Recording conditions for training data correspond with the real traffic of the system (sampling frequency, bit rate). The main benefit of the research is the system developed for text-independent speaker identification which is applied to secure communication between law enforcement units.

Paper Details

Date Published: 2 May 2017
PDF: 8 pages
Proc. SPIE 10200, Signal Processing, Sensor/Information Fusion, and Target Recognition XXVI, 102001C (2 May 2017); doi: 10.1117/12.2261796
Show Author Affiliations
Jaromir Tovarek, VŠB-Technical Univ. of Ostrava (Czech Republic)
Pavol Partila, VŠB-Technical Univ. of Ostrava (Czech Republic)


Published in SPIE Proceedings Vol. 10200:
Signal Processing, Sensor/Information Fusion, and Target Recognition XXVI
Ivan Kadar, Editor(s)

© SPIE. Terms of Use
Back to Top