Share Email Print

Proceedings Paper

Audio-visual speech recognition for a vowel discrimination task
Author(s): Peter L. Silsbee; Alan Conrad Bovik
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Among the various methods which have been proposed to improve the robustness and accuracy of automatic speech recognition (ASR) systems, lipreading has received very little attention. In this paper, we provide motivation for the use of lipreading. A novel speaker dependent lipreading system is developed, which uses hidden Markov modeling, a well known and highly successful technique for audio-based ASR. It is used in conjunction with an audio ASR system in order to improve the accuracy of the latter, especially under degraded acoustical conditions. Reductions in error of 30 to over 60% result.

Paper Details

Date Published: 22 October 1993
PDF: 12 pages
Proc. SPIE 2094, Visual Communications and Image Processing '93, (22 October 1993); doi: 10.1117/12.157855
Show Author Affiliations
Peter L. Silsbee, Univ. of Texas/Austin (United States)
Alan Conrad Bovik, Univ. of Texas/Austin (United States)

Published in SPIE Proceedings Vol. 2094:
Visual Communications and Image Processing '93
Barry G. Haskell; Hsueh-Ming Hang, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?