Share Email Print

Proceedings Paper

Script determination in document images
Author(s): Larry Spitz
Format Member Price Non-Member Price
PDF $14.40 $18.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

We have developed techniques for distinguishing which language is represented in an image of text. This work is restricted to an important subset of the world's languages, using techniques that should be applicable across even more comprehensive samples. The method first classifies the script into two broad classes: European and Asian. This classification is based on the spatial relationships of fiducial points related to the upward concavities in character structures. Script identification within the Asian class, (Japanese, Chinese, Korean) is performed by analysis of the optical density distribution of the text images.

Paper Details

Date Published: 30 March 1995
PDF: 9 pages
Proc. SPIE 2422, Document Recognition II, (30 March 1995); doi: 10.1117/12.205831
Show Author Affiliations
Larry Spitz, Fuji Xerox Palo Alto Lab. (United States)

Published in SPIE Proceedings Vol. 2422:
Document Recognition II
Luc M. Vincent; Henry S. Baird, Editor(s)

© SPIE. Terms of Use
Back to Top