Share Email Print

Proceedings Paper

Multistage predictive coding approach to unsupervised text detection in video images
Author(s): Eliza Yingzi Du; Paul D. Thouin; Chein-I Chang
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

Text detection in video images is a challenging research problem because of the poor spatial resolution and the complex backgrounds, which may contain a variety of colors. This paper presents a multistage predictive coding scheme, referred to as Multistage Pulse Code Modulation (MPCM), which can be used to effectively detect text in color video frames. It converts a video image to a coded image with each pixel encoded by a priority code ranging from 7 down to 0. A priority code 7 retains the most significant information while a priority code 0 represents the least significant information which can be dropped without loss of much information. Using the global mean of the coded image as a threshold value, a set of potential text regions can be detected from each video frame. A series of spatial filters is then implemented in order to eliminate regions that are unlikely to contain text. As a final step, we eliminate those potential text regions where Optical character Recognition (OCR) produces no results. An extensive set of experiments demonstrates that our proposed MPCM-based text detection technique is effective in detecting text in a wide variety of video images.

Paper Details

Date Published: 18 December 2001
PDF: 8 pages
Proc. SPIE 4670, Document Recognition and Retrieval IX, (18 December 2001); doi: 10.1117/12.450727
Show Author Affiliations
Eliza Yingzi Du, Univ. of Maryland/Baltimore County (United States)
Paul D. Thouin, U.S. Dept. of Defense (United States)
Chein-I Chang, Univ. of Maryland/Baltimore County (United States)

Published in SPIE Proceedings Vol. 4670:
Document Recognition and Retrieval IX
Paul B. Kantor; Tapas Kanungo; Jiangying Zhou, Editor(s)

© SPIE. Terms of Use
Back to Top