Share Email Print
cover

Proceedings Paper

Some observations on computer lip-reading: moving from the dream to the reality
Author(s): Helen L. Bear; Gari Owen; Richard Harvey; Barry-John Theobald
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

In the quest for greater computer lip-reading performance there are a number of tacit assumptions which are either present in the datasets (high resolution for example) or in the methods (recognition of spoken visual units called "visemes" for example). Here we review these and other assumptions and show the surprising result that computer lip-reading is not heavily constrained by video resolution, pose, lighting and other practical factors. However, the working assumption that visemes, which are the visual equivalent of phonemes, are the best unit for recognition does need further examination. We conclude that visemes, which were defined over a century ago, are unlikely to be optimal for a modern computer lip-reading system.

Paper Details

Date Published: 7 October 2014
PDF: 10 pages
Proc. SPIE 9253, Optics and Photonics for Counterterrorism, Crime Fighting, and Defence X; and Optical Materials and Biomaterials in Security and Defence Systems Technology XI, 92530G (7 October 2014); doi: 10.1117/12.2067464
Show Author Affiliations
Helen L. Bear, Univ. of East Anglia (United Kingdom)
Gari Owen, Annwvyn Solutions (United Kingdom)
Richard Harvey, Univ. of East Anglia (United Kingdom)
Barry-John Theobald, Univ. of East Anglia (United Kingdom)


Published in SPIE Proceedings Vol. 9253:
Optics and Photonics for Counterterrorism, Crime Fighting, and Defence X; and Optical Materials and Biomaterials in Security and Defence Systems Technology XI
Roberto Zamboni; Douglas Burgess; Gari Owen; François Kajzar; Attila A. Szep; Harbinder Rana, Editor(s)

© SPIE. Terms of Use
Back to Top