Share Email Print

Proceedings Paper

Arabic handwritten: pre-processing and segmentation
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

This paper is concerned with pre-processing and segmentation tasks that influence the performance of Optical Character Recognition (OCR) systems and handwritten/printed text recognition. In Arabic, these tasks are adversely effected by the fact that many words are made up of sub-words, with many sub-words there associated one or more diacritics that are not connected to the sub-word's body; there could be multiple instances of sub-words overlap. To overcome these problems we investigate and develop segmentation techniques that first segment a document into sub-words, link the diacritics with their sub-words, and removes possible overlapping between words and sub-words. We shall also investigate two approaches for pre-processing tasks to estimate sub-words baseline, and to determine parameters that yield appropriate slope correction, slant removal. We shall investigate the use of linear regression on sub-words pixels to determine their central x and y coordinates, as well as their high density part. We also develop a new incremental rotation procedure to be performed on sub-words that determines the best rotation angle needed to realign baselines. We shall demonstrate the benefits of these proposals by conducting extensive experiments on publicly available databases and in-house created databases. These algorithms help improve character segmentation accuracy by transforming handwritten Arabic text into a form that could benefit from analysis of printed text.

Paper Details

Date Published: 8 May 2012
PDF: 8 pages
Proc. SPIE 8406, Mobile Multimedia/Image Processing, Security, and Applications 2012, 84060D (8 May 2012); doi: 10.1117/12.917555
Show Author Affiliations
Makki Maliki, The Univ. of Buckingham (United Kingdom)
Sabah Jassim, The Univ. of Buckingham (United Kingdom)
Naseer Al-Jawad, The Univ. of Buckingham (United Kingdom)
Harin Sellahewa, The Univ. of Buckingham (United Kingdom)

Published in SPIE Proceedings Vol. 8406:
Mobile Multimedia/Image Processing, Security, and Applications 2012
Sos S. Agaian; Sabah A. Jassim; Eliza Yingzi Du, Editor(s)

© SPIE. Terms of Use
Back to Top