Share Email Print

Proceedings Paper

Automatic video caption detection and extraction in the DCT compressed domain
Author(s): Chin-Fu Tsao; Yu-Hao Chen; Jin-Hau Kuo; Chia-wei Lin; Ja-Ling Wu
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

The text in a video frame can help us to understand the semantics of video content directly. Although there are many approaches that can automatically detect and localize text a video, most of them use the original pixels of an image to find the text regions. In this paper, we present an approach to automatically localize captions in MPEG compressed videos. Caption regions are segmented from background by using their distinguishing texture characteristics. Unlike previously published ones which fully decompress the video sequence before extracting the caption regions or only extract text regions in Intra-(I-) frames, our approach detect and localize caption regions directly in the DCT compressed domain. Therefore, only very small amounts of decoding processes are required. Experiments show that a good caption detection rate can be obtained, and the average recalls of Intra- and Inter-frame detections are 97.77% and 97.84%, respectively.

Paper Details

Date Published: 31 July 2006
PDF: 13 pages
Proc. SPIE 5960, Visual Communications and Image Processing 2005, 59602N (31 July 2006); doi: 10.1117/12.631588
Show Author Affiliations
Chin-Fu Tsao, National Taiwan Univ. (Taiwan)
Yu-Hao Chen, National Taiwan Univ. (Taiwan)
Jin-Hau Kuo, National Taiwan Univ. (Taiwan)
Chia-wei Lin, National Taiwan Univ. (Taiwan)
Ja-Ling Wu, National Taiwan Univ. (Taiwan)

Published in SPIE Proceedings Vol. 5960:
Visual Communications and Image Processing 2005
Shipeng Li; Fernando Pereira; Heung-Yeung Shum; Andrew G. Tescher, Editor(s)

© SPIE. Terms of Use
Back to Top