6533b86dfe1ef96bd12c95f5

RESEARCH PRODUCT

CLUSTERING INCOMPLETE SPECTRAL DATA WITH ROBUST METHODS

Ilkka PölönenMatti A. EskelinenSami ÄYrämö

subject

lcsh:Applied optics. PhotonicsMultivariate statisticsComputer scienceGaussianCorrelation clusteringRobust statisticsspectral datacomputer.software_genrelcsh:Technologysymbols.namesakeCURE data clustering algorithmImputation (statistics)interpolointiCluster analysisK-meansnan-K-spatmedlcsh:Tk-means clusteringlcsh:TA1501-1820robust statistical methodsMissing dataData setlcsh:TA1-2040OutliersymbolsData mininglcsh:Engineering (General). Civil engineering (General)computerclustering

description

Abstract. Missing value imputation is a common approach for preprocessing incomplete data sets. In case of data clustering, imputation methods may cause unexpected bias because they may change the underlying structure of the data. In order to avoid prior imputation of missing values the computational operations must be projected on the available data values. In this paper, we apply a robust nan-K-spatmed algorithm to the clustering problem on hyperspectral image data. Robust statistics, such as multivariate medians, are more insensitive to outliers than classical statistics relying on the Gaussian assumptions. They are, however, computationally more intractable due to the lack of closed-form solutions. We will compare robust clustering methods on the bands incomplete data cubes to standard K-means with full data cubes.

10.5194/isprs-archives-xlii-3-w3-13-2017https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLII-3-W3/13/2017/