Authors
YeongSeog Kim, W Nick Street, Filippo Menczer
Publication date
2000/8/1
Book
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Pages
365-369
Description
Feature subset selection is an important problem in knowledge discovery, not only for the insight gained from determining relevant modeling variables but also for the improved understandability, scalabilit y, and possibly, accuracy of the resulting models. In this paper w econsider the problem of feature selection for unsupervised learning. A number of heuristic criteria can be used to estimate the quality of clusters built from a given featuresubset. Rather than combining such criteria, we use ELSA, an evolutionary local selection algorithm that maintains a diverse population of solutions that approximate the Pareto front in a multidimensional objectiv espace. Eac hevolved solution represents a feature subset and a number of clusters; a standard K-means algorithm is applied to form the given n umber of clusters based on the selected features. Preliminary results on both real and synthetic data show promise in finding P …
Total citations
199920002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320242141419122019192828221627292618231710171614683
Scholar articles
YS Kim, WN Street, F Menczer - Proceedings of the sixth ACM SIGKDD international …, 2000