Share Email Print

Proceedings Paper

Video text localization using wavelet and shearlet transforms
Author(s): Purnendu Banerjee; B. B. Chaudhuri
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Text in video is useful and important in indexing and retrieving the video documents efficiently and accurately. In this paper, we present a new method of text detection using a combined dictionary consisting of wavelets and a recently introduced transform called shearlets. Wavelets provide optimally sparse expansion for point-like structures and shearlets provide optimally sparse expansions for curve-like structures. By combining these two features we have computed a high frequency sub-band to brighten the text part. Then K-means clustering is used for obtaining text pixels from the Standard Deviation (SD) of combined coefficient of wavelets and shearlets as well as the union of wavelets and shearlets features. Text parts are obtained by grouping neighboring regions based on geometric properties of the classified output frame of unsupervised K-means classification. The proposed method tested on a standard as well as newly collected database shows to be superior to some of the existing methods.

Paper Details

Date Published: 24 March 2014
PDF: 10 pages
Proc. SPIE 9021, Document Recognition and Retrieval XXI, 90210B (24 March 2014); doi: 10.1117/12.2036077
Show Author Affiliations
Purnendu Banerjee, Society for Natural Language Technology Research (India)
B. B. Chaudhuri, Indian Statistical Institute (India)

Published in SPIE Proceedings Vol. 9021:
Document Recognition and Retrieval XXI
Bertrand Coüasnon; Eric K. Ringger, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?