6533b86efe1ef96bd12cb314
RESEARCH PRODUCT
A Feature Set Decomposition Method for the Construction of Multi-classifier Systems Trained with High-Dimensional Data
Yoisel CamposRoberto EstradaFrancesc J. FerriCarlos Morellsubject
Clustering high-dimensional databusiness.industryComputer sciencePattern recognitionInformation theorycomputer.software_genreUncorrelatedDecomposition method (queueing theory)Data miningArtificial intelligencebusinessFeature setcomputerClassifier (UML)Curse of dimensionalitydescription
Data mining for the discovery of novel, useful patterns, encounters obstacles when dealing with high-dimensional datasets, which have been documented as the "curse" of dimensionality. A strategy to deal with this issue is the decomposition of the input feature set to build a multi-classifier system. Standalone decomposition methods are rare and generally based on random selection. We propose a decomposition method which uses information theory tools to arrange input features into uncorrelated and relevant subsets. Experimental results show how this approach significantly outperforms three baseline decomposition methods, in terms of classification accuracy.
year | journal | country | edition | language |
---|---|---|---|---|
2013-01-01 |