Share Email Print

Proceedings Paper

Wake-up-word speech recognition application for first responder communication enhancement
Author(s): Veton Këpuska; Jason Breitfeller
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Speech Recognition systems, historically, have proven to be cumbersome and insufficiently accurate for a range of applications. The ultimate goal of our proposed technology is to fundamentally change the way current Speech Recognition (SR) systems interact with humans and develop an application that is extremely hardware efficient. Accurate SR and reasonable hardware requirements will afford the average first responder officer, e.g., police officer, a true break-through technology that will change the way an officer performs his duties. The presented technology provides a cutting-edge solution for human-machine interaction through the utilization of a properly solved Wake-Up-Word (WUW) SR problem. This paradigm-shift provides the basis for development of SR systems with truly "Voice Activated" capabilities, impacting all SR based technologies and the way in which humans interact with computers. This shift is a radical departure from the current "push-to-talk" paradigm currently applied to all speech-to-text or speech-recognition applications. To be able to achieve this goal, a significantly more accurate pattern classification and scoring technique is required, which in turn provides SR systems enhanced performance for correct recognition (i.e., minimization of false rejection) as well as correct rejection (i.e., minimization of false acceptance). A revolutionary and innovative classification and scoring technique is used that is a significant enhancement over an earlier method presented in reference [1]. The solution in reference [1] has been demonstrated to meet the stringent requirements of the WUW-SR task. Advanced solution of [1] is a novel technique that is model and algorithm independent. Therefore, it could be used to significantly improve performance of existing recognition algorithms and systems. Reduction of error rates of over 40% are commonly observed for both false rejections and false acceptance. In this paper the architecture of the WUW-SR based system as interface to current SR applications is presented. In this system WUW-SR is used as a gateway for truly Voice Activated applications utilizing the current solution without "push-to-talk" paradigm. The technique has been developed with hardware optimization in mind and therefore has the ability to run as a "background" application on a standard Windows-based PC platform.

Paper Details

Date Published: 10 May 2006
PDF: 8 pages
Proc. SPIE 6201, Sensors, and Command, Control, Communications, and Intelligence (C3I) Technologies for Homeland Security and Homeland Defense V, 62011E (10 May 2006); doi: 10.1117/12.666025
Show Author Affiliations
Veton Këpuska, Florida Institute of Technology (United States)
Jason Breitfeller, BreitIdeas Inc. (United States)

Published in SPIE Proceedings Vol. 6201:
Sensors, and Command, Control, Communications, and Intelligence (C3I) Technologies for Homeland Security and Homeland Defense V
Edward M. Carapezza, Editor(s)

© SPIE. Terms of Use
Back to Top