Share Email Print

Proceedings Paper

Designing caption production rules based on face, text, and motion detection
Author(s): C. Chapdelaine; M. Beaulieu; L. Gagnon
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

Producing off-line captions for the deaf and hearing impaired people is a labor-intensive task that can require up to 18 hours of production per hour of film. Captions are placed manually close to the region of interest but it must avoid masking human faces, texts or any moving objects that might be relevant to the story flow. Our goal is to use image processing techniques to reduce the off-line caption production process by automatically placing the captions on the proper consecutive frames. We implemented a computer-assisted captioning software tool which integrates detection of faces, texts and visual motion regions. The near frontal faces are detected using a cascade of weak classifier and tracked through a particle filter. Then, frames are scanned to perform text spotting and build a region map suitable for text recognition. Finally, motion mapping is based on the Lukas-Kanade optical flow algorithm and provides MPEG-7 motion descriptors. The combined detected items are then fed to a rule-based algorithm to determine the best captions localization for the related sequences of frames. This paper focuses on the defined rules to assist the human captioners and the results of a user evaluation for this approach.

Paper Details

Date Published: 4 March 2008
PDF: 8 pages
Proc. SPIE 6806, Human Vision and Electronic Imaging XIII, 68061K (4 March 2008); doi: 10.1117/12.766841
Show Author Affiliations
C. Chapdelaine, Computer Research Institute of Montreal (Canada)
M. Beaulieu, Computer Research Institute of Montreal (Canada)
L. Gagnon, Computer Research Institute of Montreal (Canada)

Published in SPIE Proceedings Vol. 6806:
Human Vision and Electronic Imaging XIII
Bernice E. Rogowitz; Thrasyvoulos N. Pappas, Editor(s)

© SPIE. Terms of Use
Back to Top