6533b835fe1ef96bd129fd81
RESEARCH PRODUCT
Robust refinement of initial prototypes for partitioning-based clustering algorithms
Tommi KärkkäinenSami ÄYrämöKirsi Majavasubject
Set (abstract data type)Computer scienceCorrelation clusteringOutlierInitializationSensitivity (control systems)Density estimationNoise (video)Data miningCluster analysiscomputer.software_genrecomputerdescription
Non-uniqueness of solutions and sensitivity to erroneous data are common problems to large-scale data clustering tasks. In order to avoid poor quality of solutions with partitioning-based clustering methods, robust estimates (that are highly insensitive to erroneous data values) are needed and initial cluster prototypes should be determined properly. In this paper, a robust density estimation initialization method that exploits the spatial median estimate to the prototype update is presented. Besides being insensitive to noise and outliers, the new method is also computationally comparable with other traditional methods. The methods are compared by numerical experiments on a set of synthetic and real-world data sets. Conclusions and discussion on the results are given.
year | journal | country | edition | language |
---|---|---|---|---|
2007-11-01 | Recent Advances in Stochastic Modeling and Data Analysis |