Share Email Print

Proceedings Paper

Combining macro and micro features for writer identification
Author(s): Sangjik Lee; Sung-Hyuk Cha; Sargur N. Srihari
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

In our previous work of writer identification, a database of handwriting samples (written in English) of over one thousand individuals was created, and two types of computer-generated features of sample handwriting were extracted: macro and micro features. Using these features, writer identification experiments were performed: given that a document is written by one of n writers, the task is to determine the writer. With n = 2, we correctly determined the writer with a 99% accuracy using only 10-character micro features in the writing; with n = 1000, the accuracy is dropped to 80%. To obtain higher performance, we propose a combination of macro and micro level features. First, macro level features are used in a filtering model: the computer program is presented with multiple handwriting samples from a large number (1000) of writers, and the question posed is: Which of the samples are consistent with a test sample? As a result of using the filtering model, a reduced set of documents (100) is obtained and presented to the final identification model which uses the micro level features. We improved our writer identification system from 80% to 87.5% by the proposed filtering-combination technique when n = 1000.

Paper Details

Date Published: 18 December 2001
PDF: 12 pages
Proc. SPIE 4670, Document Recognition and Retrieval IX, (18 December 2001); doi: 10.1117/12.450724
Show Author Affiliations
Sangjik Lee, SUNY/Buffalo (United States)
Sung-Hyuk Cha, Pace Univ. (United States)
Sargur N. Srihari, SUNY/Buffalo (United States)

Published in SPIE Proceedings Vol. 4670:
Document Recognition and Retrieval IX
Paul B. Kantor; Tapas Kanungo; Jiangying Zhou, Editor(s)

© SPIE. Terms of Use
Back to Top