Share Email Print

Proceedings Paper

A semi-supervised learning method to classify grant support zone in web-based medical articles
Author(s): Xiaoli Zhang; Jie Zou; Daniel X. Le; George Thoma
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

Traditional classifiers are trained from labeled data only. Labeled samples are often expensive to obtain, while unlabeled data are abundant. Semi-supervised learning can therefore be of great value by using both labeled and unlabeled data for training. We introduce a semi-supervised learning method named decision-directed approximation combined with Support Vector Machines to detect zones containing information on grant support (a type of bibliographic data) from online medical journal articles. We analyzed the performance of our model using different sizes of unlabeled samples, and demonstrated that our proposed rules are effective to boost classification accuracy. The experimental results show that the decision-directed approximation method with SVM improves the classification accuracy when a small amount of labeled data is used in conjunction with unlabeled data to train the SVM.

Paper Details

Date Published: 19 January 2009
PDF: 8 pages
Proc. SPIE 7247, Document Recognition and Retrieval XVI, 72470W (19 January 2009); doi: 10.1117/12.806076
Show Author Affiliations
Xiaoli Zhang, National Library of Medicine (United States)
Jie Zou, National Library of Medicine (United States)
Daniel X. Le, National Library of Medicine (United States)
George Thoma, National Library of Medicine (United States)

Published in SPIE Proceedings Vol. 7247:
Document Recognition and Retrieval XVI
Kathrin Berkner; Laurence Likforman-Sulem, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?