Share Email Print

Journal of Electronic Imaging

Genetic algorithm for clustering mixed-type data
Author(s): Shiueng-Bien Yang; Yung-Gi Wu
Format Member Price Non-Member Price
PDF $20.00 $25.00
cover GOOD NEWS! Your organization subscribes to the SPIE Digital Library. You may be able to download this paper for free. Check Access

Paper Abstract

The k-modes algorithm was recently proposed to cluster mixed-type data. However, in solving clustering problems, the k-modes algorithm and its variants usually ask the user to provide the number of clusters in the data sets. Unfortunately, the number of clusters is generally unknown to the user. Therefore, clustering becomes a tedious task of trial-and-error and the clustering result is often poor, especially when the number of clusters is large and not easy to guess. Also, it is hard for a user to select the weight between categorical and numeric attributes in the k-modes algorithm. In this paper, a genetic algorithm for clustering large data sets with mixed-type data is proposed, and this algorithm can automatically search the number of clusters in the data set. Also, a weight can be automatically selected by the genetic algorithm to prevent favoring either type of attribute. Experimental results illustrate the effectiveness of the genetic algorithm.

Paper Details

Date Published: 1 January 2011
PDF: 6 pages
J. Electron. Imag. 20(1) 013003 doi: 10.1117/1.3537836
Published in: Journal of Electronic Imaging Volume 20, Issue 1
Show Author Affiliations
Shiueng-Bien Yang, Wenzao Ursuline College of Languages (Taiwan)
Yung-Gi Wu, Chang Jung Christian Univ. (Taiwan)

© SPIE. Terms of Use
Back to Top