Estimating finite mixtures of semi-Markov chains: an application to the segmentation of temporal sensory data

6533b858fe1ef96bd12b58c6

RESEARCH PRODUCT

Estimating finite mixtures of semi-Markov chains: an application to the segmentation of temporal sensory data

Hervé Cardot Pascal Schlich Guillaume Lecuelle Michel Visalli

subject

future Statistics and Probability FOS: Computer and information sciences Gamma distribution mice Computer science media_common.quotation_subject Population dominance computer.software_genre Statistics - Applications 01 natural sciences Methodology (stat.ME)models Expectation-maximization algorithm Model-based clustering 010104 statistics & probability 0404 agricultural biotechnology [MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]Bayesian information criterion Perception Expectation–maximization algorithm Applications (stat.AP)Temporal dominance of sensations [MATH]Mathematics [math]0101 mathematics education Statistics - Methodology media_common 2. Zero hunger education.field_of_study Markov chain Markov renewal process Statistical model 04 agricultural and veterinary sciences identifiability Mixture model Bayesian information criterion 040401 food science [MATH.MATH-PR]Mathematics [math]/Probability [math.PR]Identifiability Penalized likelihood Data mining Statistics Probability and Uncertainty computer tds Categorical time series sensations

description

Summary In food science, it is of great interest to obtain information about the temporal perception of aliments to create new products, to modify existing products or more generally to understand the mechanisms of perception. Temporal dominance of sensations is a technique to measure temporal perception which consists in choosing sequentially attributes describing a food product over tasting. This work introduces new statistical models based on finite mixtures of semi-Markov chains to describe data collected with the temporal dominance of sensations protocol, allowing different temporal perceptions for a same product within a population. The identifiability of the parameters of such mixture models is discussed. Sojourn time distributions are fitted with a gamma probability distribution and a penalty is added to the log-likelihood to ensure convergence of the expectation–maximization algorithm to a non-degenerate solution. Information criteria are employed for determining the number of mixture components. Then, the individual qualitative trajectories are clustered with the help of the maximum a posteriori probability approach. A simulation study confirms the good behaviour of the estimation procedure proposed. The methodology is illustrated on an example of consumers’ perception of a Gouda cheese and assesses the existence of several behaviours in terms of perception of this product.

year	journal	country	edition	language
2019-08-02

https://dx.doi.org/10.48550/arxiv.1806.04420