6533b85cfe1ef96bd12bc5ee

RESEARCH PRODUCT

Weighted Clustering of Sparse Educational Data

Mirka SaarelaT. Kärkkäinen

subject

sparse educational dataPISAclustering

description

Clustering as an unsupervised technique is predominantly used in unweighted settings. In this paper, we present an efficient version of a robust clustering algorithm for sparse educational data that takes the weights, aligning a sample with the corresponding population, into account. The algorithm is utilized to divide the Finnish student population of PISA 2012 (the latest data from the Programme for International Student Assessment) into groups, according to their attitudes and perceptions towards mathematics, for which one third of the data is missing. Furthermore, necessary modifications of three cluster indices to reveal an appropriate number of groups are proposed and demonstrated. peerReviewed

http://urn.fi/URN:NBN:fi:jyu-201508212733