6533b7dcfe1ef96bd1272821

RESEARCH PRODUCT

Minimum message length clustering: an explication and some applications to vegetation data

L. SalminaL. MucinaMichael Bodley Dale

subject

Fuzzy clusteringEcologyComputer scienceVegetationcomputer.software_genreClass (biology)Minimum message lengthExplicationSection (archaeology)Animal ecologyData miningCluster analysiscomputerEcology Evolution Behavior and Systematics

description

In this paper, we examine the application of a particular approach to induction, the minimum message length principle and illustrate some of the problems that can be addressed through its use. The MML principle seeks to identify an optimal model within some specified parameterised class of models and for this paper we have chosen to concentrate on a single model class, that of mixture separation or fuzzy clustering. The first section presents, in outline, an MML methodology for fuzzy clustering. We then present some applications, including the nature of the within-cluster model, examination of the univocality of results for different groups of species and the effectiveness of presence data compared to purely quantitative data. Finally, we examine some possibilities of extending MML methodology to include within-class correlation of species, the existence of dependence between observed samples and the comparison of different classes of models.

https://doi.org/10.1556/comec.2.2001.2.11