6533b82ffe1ef96bd129651a

RESEARCH PRODUCT

Local Feature Selection with Dynamic Integration of Classifiers

Seppo PuuronenAlexey Tsymbal

subject

Computer sciencebusiness.industryDimensionality reductionFeature vectorDecision treeFeature selectionPattern recognitionEvaluation functionMachine learningcomputer.software_genreFeature modelk-nearest neighbors algorithmMinimum redundancy feature selectionArtificial intelligencebusinesscomputer

description

Multidimensional data is often feature space heterogeneous so that individual features have unequal importance in different sub areas of the feature space. This motivates to search for a technique that provides a strategic splitting of the instance space being able to identify the best subset of features for each instance to be classified. Our technique applies the wrapper approach where a classification algorithm is used as an evaluation function to differentiate between different feature subsets. In order to make the feature selection local, we apply the recent technique for dynamic integration of classifiers. This allows to determine which classifier and which feature subset should be used for each new instance. Decision trees are used to help to restrict the number of feature combinations analyzed. For each new instance we consider only those feature combinations that include the features present in the path taken by the new instance in the decision tree built on the whole feature set. We evaluate our technique on data sets from the UCI machine learning repository. In our experiments, we use the C4.5 algorithm as the learning algorithm for base classifiers and for the decision trees that guide the local feature selection. The experiments show some advantages of the local feature selection with dynamic integration of classifiers in comparison with the selection of one feature subset for the whole space.

https://doi.org/10.1007/3-540-39963-1_44