Share Email Print

Proceedings Paper

Boost OCR accuracy using iVector based system combination approach
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Optical character recognition (OCR) is a challenging task because most existing preprocessing approaches are sensitive to writing style, writing material, noises and image resolution. Thus, a single recognition system cannot address all factors of real document images. In this paper, we describe an approach to combine diverse recognition systems by using iVector based features, which is a newly developed method in the field of speaker verification. Prior to system combination, document images are preprocessed and text line images are extracted with different approaches for each system, where iVector is transformed from a high-dimensional supervector of each text line and is used to predict the accuracy of OCR. We merge hypotheses from multiple recognition systems according to the overlap ratio and the predicted OCR score of text line images. We present evaluation results on an Arabic document database where the proposed method is compared against the single best OCR system using word error rate (WER) metric.

Paper Details

Date Published: 8 February 2015
PDF: 8 pages
Proc. SPIE 9402, Document Recognition and Retrieval XXII, 94020E (8 February 2015); doi: 10.1117/12.2076241
Show Author Affiliations
Xujun Peng, Raytheon BBN Technologies (United States)
Huaigu Cao, Raytheon BBN Technologies (United States)
Prem Natarajan, The Univ. of Southern California, Marina del Rey (United States)

Published in SPIE Proceedings Vol. 9402:
Document Recognition and Retrieval XXII
Eric K. Ringger; Bart Lamiroy, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?