Share Email Print

Proceedings Paper

General text line extraction approach based on locally orientation estimation
Author(s): Nazih Ouwayed; Abdel Belaïd; François Auger
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

This paper presents a novel approach for the multi-oriented text line extraction from historical handwritten Arabic documents. Because of the multi-orientation of lines and their dispersion in the page, we use an image paving algorithm that can progressively and locally determine the lines. The paving algorithm is initialized with a small window and then its size is corrected by extension until enough lines and connected components were found. We use the Snake for line extraction. Once the paving is established, the orientation is determined using the Wigner-Ville distribution on the histogram projection profile. This local orientation is then enlarged to limit the orientation in the neighborhood. Afterwards, the text lines are extracted locally in each zone basing on the follow-up of the baselines and the proximity of connected components. Finally, the connected components that overlap and touch in adjacent lines are separated. The morphology analysis of the terminal letters of Arabic words is here considered. The proposed approach has been experimented on 100 documents reaching an separation accuracy of about 98.6%.

Paper Details

Date Published: 18 January 2010
PDF: 8 pages
Proc. SPIE 7534, Document Recognition and Retrieval XVII, 75340B (18 January 2010); doi: 10.1117/12.839518
Show Author Affiliations
Nazih Ouwayed, LORIA, Univ. of Nancy 2 (France)
Abdel Belaïd, LORIA, Univ. of Nancy 2 (France)
François Auger, Univ. of Nantes, IREENA (France)

Published in SPIE Proceedings Vol. 7534:
Document Recognition and Retrieval XVII
Laurence Likforman-Sulem; Gady Agam, Editor(s)

© SPIE. Terms of Use
Back to Top