6533b86dfe1ef96bd12c95f5
RESEARCH PRODUCT
CLUSTERING INCOMPLETE SPECTRAL DATA WITH ROBUST METHODS
Ilkka PölönenMatti A. EskelinenSami ÄYrämösubject
lcsh:Applied optics. PhotonicsMultivariate statisticsComputer scienceGaussianCorrelation clusteringRobust statisticsspectral datacomputer.software_genrelcsh:Technologysymbols.namesakeCURE data clustering algorithmImputation (statistics)interpolointiCluster analysisK-meansnan-K-spatmedlcsh:Tk-means clusteringlcsh:TA1501-1820robust statistical methodsMissing dataData setlcsh:TA1-2040OutliersymbolsData mininglcsh:Engineering (General). Civil engineering (General)computerclusteringdescription
Abstract. Missing value imputation is a common approach for preprocessing incomplete data sets. In case of data clustering, imputation methods may cause unexpected bias because they may change the underlying structure of the data. In order to avoid prior imputation of missing values the computational operations must be projected on the available data values. In this paper, we apply a robust nan-K-spatmed algorithm to the clustering problem on hyperspectral image data. Robust statistics, such as multivariate medians, are more insensitive to outliers than classical statistics relying on the Gaussian assumptions. They are, however, computationally more intractable due to the lack of closed-form solutions. We will compare robust clustering methods on the bands incomplete data cubes to standard K-means with full data cubes.
year | journal | country | edition | language |
---|---|---|---|---|
2018-01-15 |