Share Email Print
cover

Proceedings Paper

Image features for machine-learning-based approach for web image classification
Author(s): Soosun Cho; Chi-Jung Hwang
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

The ubiquity of the Internet has brought about an increasing amount of multi-formatted Web documents. Although image occupies a large part of importance on these increasing Web documents, there have not been many researches for analyzing and understanding it. Many Web images are used for carrying important information but others are not used for it. If images in a Web document can be classified by which have particular information or not, then it would be very useful for analysis and multi-formatting of Web documents. In this paper we introduce the machine learning based methods of classifying Web images as either eliminable or non-eliminable. For this research, we have detected 16 special and rich features for Web images and experimented by using the Bayesian and decision tree methods. As the results, F-measures of 87.09%, 82.72% were achieved for each method and particularly, from the experiments to compare the effects of feature groups, it has proved that the selected features on this study are very useful for Web image classification.

Paper Details

Date Published: 10 January 2003
PDF: 8 pages
Proc. SPIE 5018, Internet Imaging IV, (10 January 2003); doi: 10.1117/12.479719
Show Author Affiliations
Soosun Cho, Electronics and Telecommunications Research Institute (South Korea)
Chi-Jung Hwang, Electronics and Telecommunications Research Institute (South Korea)


Published in SPIE Proceedings Vol. 5018:
Internet Imaging IV
Simone Santini; Raimondo Schettini, Editor(s)

© SPIE. Terms of Use
Back to Top