Share Email Print

Proceedings Paper

Real-time text extraction based on the page layout analysis system
Author(s): M. Soua; A. Benchekroun; R. Kachouri; M. Akil
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Several approaches were proposed in order to extract text from scanned documents. However, text extraction in heterogeneous documents stills a real challenge. Indeed, text extraction in this context is a difficult task because of the variation of the text due to the differences of sizes, styles and orientations, as well as to the complexity of the document region background. Recently, we have proposed the improved hybrid binarization based on Kmeans method (I-HBK)5 to extract suitably the text from heterogeneous documents. In this method, the Page Layout Analysis (PLA), part of the Tesseract OCR engine, is used to identify text and image regions. Afterwards our hybrid binarization is applied separately on each kind of regions. In one side, gamma correction is employed before to process image regions. In the other side, binarization is performed directly on text regions. Then, a foreground and background color study is performed to correct inverted region colors. Finally, characters are located from the binarized regions based on the PLA algorithm. In this work, we extend the integration of the PLA algorithm within the I-HBK method. In addition, to speed up the separation of text and image step, we employ an efficient GPU acceleration. Through the performed experiments, we demonstrate the high F-measure accuracy of the PLA algorithm reaching 95% on the LRDE dataset. In addition, we illustrate the sequential and the parallel compared PLA versions. The obtained results give a speedup of 3.7x when comparing the parallel PLA implementation on GPU GTX 660 to the CPU version.

Paper Details

Date Published: 1 May 2017
PDF: 8 pages
Proc. SPIE 10223, Real-Time Image and Video Processing 2017, 1022305 (1 May 2017); doi: 10.1117/12.2262364
Show Author Affiliations
M. Soua, ESIEE Paris, IGM, A3SI (France)
A. Benchekroun, ESIEE Paris, IGM, A3SI (France)
R. Kachouri, ESIEE Paris, IGM, A3SI (France)
M. Akil, ESIEE Paris, IGM, A3SI (France)

Published in SPIE Proceedings Vol. 10223:
Real-Time Image and Video Processing 2017
Nasser Kehtarnavaz; Matthias F. Carlsohn, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?