Share Email Print

Proceedings Paper

Distance functions in dynamic integration of data mining techniques
Author(s): Seppo Jumani Puuronen; Alexey Tsymbal; Vagan Terziyan
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

One of the most important directions in the improvement of data mining and knowledge discovery is the integration of multiple data mining techniques. An integration method needs to be able either to evaluate and select the most appropriate data mining technique or to combine two or more techniques efficiently. A recent integration method for the dynamic integration of multiple data mining techniques is based on the assumption that each of the data mining techniques is the best one inside a certain subarea of the whole domain area. This method uses an instance-based learning approach to collect information about the competence areas of the mining techniques and applies a distance function to determine how close a new instance is to each instance of the training set. The nearest instance or instances are used to predict the performance of the data mining techniques. Because the quality of the integration depends heavily on the suitability of the used distance function, our goal is to analyze the characteristics of different distance functions. In this paper we investigate several distance functions as the very commonly used Euclidean distance function, the Heterogeneous Euclidean- Overlap Metric (HEOM), and the Heterogeneous Value Difference Metric (HVDM), among others. We analyze the effects of the use of different distance functions to the accuracy achieved by dynamic integration when the parameters describing datasets vary. We include also results of our experiments with different datasets which include both nominal and continuous attributes.

Paper Details

Date Published: 6 April 2000
PDF: 11 pages
Proc. SPIE 4057, Data Mining and Knowledge Discovery: Theory, Tools, and Technology II, (6 April 2000); doi: 10.1117/12.381747
Show Author Affiliations
Seppo Jumani Puuronen, Univ. of Jyvaskyla (Finland)
Alexey Tsymbal, Univ. of Jyvaskyla (Finland)
Vagan Terziyan, State Technical Univ. of Radioelectronics (Ukraine)

Published in SPIE Proceedings Vol. 4057:
Data Mining and Knowledge Discovery: Theory, Tools, and Technology II
Belur V. Dasarathy, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?