Share Email Print

Proceedings Paper

Combining discriminative SVM models for the improved recognition of investigator names in medical articles
Author(s): Xiaoli Zhang; Jie Zou; Daniel X. Le; George R. Thoma
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

Investigators are people who are listed as members of corporate organizations but not entered as authors in an article. Beginning with journals published in 2008, investigator names are required to be included in a new bibliographic field in MEDLINE citations. Automatic extraction of investigator names is necessary due to the increase in collaborative biomedical research and consequently the large number of such names. We implemented two discriminative SVM models, i.e., SVM and structural SVM, to identify named entities such as the first and last names of investigators from online medical journal articles. Both approaches achieve good performance at the word and name chunk levels. We further conducted an error analysis and found that SVM and structural SVM can offer complementary information about the patterns to be classified. Hence, we combined the two independently trained classifiers where the SVM is chosen as a base learner with its outputs enhanced by the predictions from the structural SVM. The overall performance especially the recall rate of investigator name retrieval exceeds that of the standalone SVM model.

Paper Details

Date Published: 4 February 2013
PDF: 11 pages
Proc. SPIE 8658, Document Recognition and Retrieval XX, 865815 (4 February 2013); doi: 10.1117/12.2007336
Show Author Affiliations
Xiaoli Zhang, National Library of Medicine (United States)
Jie Zou, National Library of Medicine (United States)
Daniel X. Le, National Library of Medicine (United States)
George R. Thoma, National Library of Medicine (United States)

Published in SPIE Proceedings Vol. 8658:
Document Recognition and Retrieval XX
Richard Zanibbi; Bertrand Coüasnon, Editor(s)

© SPIE. Terms of Use
Back to Top