0000000000985850

AUTHOR

Catherine Mercier

VARIABLE SELECTION FOR NOISY DATA APPLIED IN PROTEOMICS

International audience; The paper proposes a variable selection method for pro-teomics. It aims at selecting, among a set of proteins, those (named biomarkers) which enable to discriminate between two groups of individuals (healthy and pathological). To this end, data is available for a cohort of individuals: the biological state and a measurement of concentrations for a list of proteins. The proposed approach is based on a Bayesian hierarchical model for the dependencies between biological and instrumental variables. The optimal selection function minimizes the Bayesian risk, that is to say the selected set of variables maximizes the posterior probability. The two main contributions are: (…

research product

Mixed-model of ANOVA for measurement reproducibility in proteomics

This work is a statistical analysis of reproducibility of a MALDI-TOF mass spectrometry experiment. Its aim is to evaluate measurement variability and compare peak intensities from two types of MALDI-TOF platforms. We compared and commented on the abilities of Principal Component Analysis and mixed-model analysis of variance to evaluate the biological variability and the technical variability of peak intensities in different patients. The properties and hypotheses of both methods are summarized and applied to spectra from plasma of patients with Hodgkin lymphoma. Principal Component Analysis checks rapidly the balance between the two variabilities; however, a mixed-model analysis of varianc…

research product

Variance component analysis to assess protein quantification in biomarker discovery. Application to MALDI-TOF mass spectrometry.

International audience; Controlling the technological variability on an analytical chain is critical for biomarker discovery. The sources of technological variability should be modeled, which calls for specific experimental design, signal processing, and statistical analysis. Furthermore, with unbalanced data, the various components of variability cannot be estimated with the sequential or adjusted sums of squares of usual software programs. We propose a novel approach to variance component analysis with application to the matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) technology and use this approach for protein quantification by a classical signal processing algori…

research product