Share Email Print

Proceedings Paper

Turkish handwritten text recognition: a case of agglutinative languages
Author(s): Berrin A. Yanikoglu; Alisher Kholmatov
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

We describe a system for recognizing unconstrained Turkish handwritten text. Turkish has agglutinative morphology and theoretically an infinite number of words that can be generated by adding more suffixes to the word. This makes lexicon-based recognition approaches, where the most likely word is selected among all the alternatives in a lexicon, unsuitable for Turkish. We describe our approach to the problem using a Turkish prefix recognizer. First results of the system demonstrates the promise of this approach, with top-10 word recognition rate of about 40% for a small test data of mixed handprint and cursive writing. The lexicon-based approach with a 17,000 word-lexicon (with test words added) achieves 56% top-10 word recognition rate.

Paper Details

Date Published: 13 January 2003
PDF: 7 pages
Proc. SPIE 5010, Document Recognition and Retrieval X, (13 January 2003); doi: 10.1117/12.476045
Show Author Affiliations
Berrin A. Yanikoglu, Sabanci Univ. (Turkey)
Alisher Kholmatov, Sabanci Univ. (Turkey)

Published in SPIE Proceedings Vol. 5010:
Document Recognition and Retrieval X
Tapas Kanungo; Elisa H. Barney Smith; Jianying Hu; Paul B. Kantor, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?