6533b81ffe1ef96bd1276f2b

RESEARCH PRODUCT

Correlation-Based and Contextual Merit-Based Ensemble Feature Selection

Iryna SkrypnykSeppo PuuronenAlexey Tsymbal

subject

business.industryComputer scienceFeature selectionMachine learningcomputer.software_genreBase (topology)CorrelationComputingMethodologies_PATTERNRECOGNITIONArtificial intelligenceHeuristicsbusinessFocus (optics)Simple correlationcomputer

description

Recent research has proved the benefits of using an ensemble of diverse and accurate base classifiers for classification problems. In this paper the focus is on producing diverse ensembles with the aid of three feature selection heuristics based on two approaches: correlation and contextual merit -based ones. We have developed an algorithm and experimented with it to evaluate and compare the three feature selection heuristics on ten data sets from UCI Repository. On average, simple correlation-based ensemble has the superiority in accuracy. The contextual merit -based heuristics seem to include too many features in the initial ensembles and iterations were most successful with it.

https://doi.org/10.1007/3-540-44816-0_14