6533b873fe1ef96bd12d4a69

RESEARCH PRODUCT

Ensemble Feature Selection Based on the Contextual Merit

Iryna SkrypnykSeppo PuuronenAlexey Tsymbal

subject

Training setComputer sciencebusiness.industryHeuristicPattern recognitionFeature selectionContext (language use)Machine learningcomputer.software_genreEvaluation functionComputingMethodologies_PATTERNRECOGNITIONEnsembles of classifiersFeature (computer vision)Artificial intelligenceHeuristicsbusinesscomputer

description

Recent research has proved the benefits of using ensembles of classifiers for classification problems. Ensembles constructed by machine learning methods manipulating the training set are used to create diverse sets of accurate classifiers. Different feature selection techniques based on applying different heuristics for generating base classifiers can be adjusted to specific domain characteristics. In this paper we consider and experiment with the contextual feature merit measure as a feature selection heuristic. We use the diversity of an ensemble as evaluation function in our new algorithm with a refinement cycle. We have evaluated our algorithm on seven data sets from UCI. The experimental results show that for all these data sets ensemble feature selection based on the contextual merit and suitable starting amount of features produces an ensemble which with weighted voting never produces smaller accuracy than C4.5 alone with all the features.

https://doi.org/10.1007/3-540-44801-2_12