Share Email Print

Proceedings Paper

"Can you see me now?" An objective metric for predicting intelligibility of compressed American Sign Language video
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

For members of the Deaf Community in the United States, current communication tools include TTY/TTD services, video relay services, and text-based communication. With the growth of cellular technology, mobile sign language conversations are becoming a possibility. Proper coding techniques must be employed to compress American Sign Language (ASL) video for low-rate transmission while maintaining the quality of the conversation. In order to evaluate these techniques, an appropriate quality metric is needed. This paper demonstrates that traditional video quality metrics, such as PSNR, fail to predict subjective intelligibility scores. By considering the unique structure of ASL video, an appropriate objective metric is developed. Face and hand segmentation is performed using skin-color detection techniques. The distortions in the face and hand regions are optimally weighted and pooled across all frames to create an objective intelligibility score for a distorted sequence. The objective intelligibility metric performs significantly better than PSNR in terms of correlation with subjective responses.

Paper Details

Date Published: 14 March 2007
PDF: 9 pages
Proc. SPIE 6492, Human Vision and Electronic Imaging XII, 64920M (14 March 2007); doi: 10.1117/12.707448
Show Author Affiliations
Francis M. Ciaramello, Cornell Univ. (United States)
Sheila S. Hemami, Cornell Univ. (United States)

Published in SPIE Proceedings Vol. 6492:
Human Vision and Electronic Imaging XII
Bernice E. Rogowitz; Thrasyvoulos N. Pappas; Scott J. Daly, Editor(s)

© SPIE. Terms of Use
Back to Top