0000000000297593

AUTHOR

David Fournier

Principal components analysis: theory and application to gene expression data analysis

Advances in computational power have enabled research to generate significant amounts of data related to complex biological problems. Consequently, applying appropriate data analysis techniques has become paramount to tackle this complexity. However, theoretical understanding of statistical methods is necessary to ensure that the correct method is used and that sound inferences are made based on the analysis. In this article, we elaborate on the theory behind principal components analysis (PCA), which has become a favoured multivariate statistical tool in the field of omics-data analysis. We discuss the necessary prerequisites and steps to produce statistically valid results and provide gui…

research product

Evidence for the implication of the histone code in building the genome structure

International audience; Histones are punctuated with small chemical modifications that alter their interaction with DNA. One attractive hypothesis stipulates that certain combinations of these histone modifications may function, alone or together, as a part of a predictive histone code to provide ground rules for chromatin folding. We consider four features that relate histone modifications to chromatin folding: charge neutralisation, molecular specificity, robustness and evolvability. Next, we present evidence for the association among different histone modifications at various levels of chromatin organisation and show how these relationships relate to function such as transcription, repli…

research product

Imputation of posterior linkage probability relations reveals a significant influence of structural 3D constraints on linkage disequilibrium

Genetic association studies have become increasingly important in unraveling the genetics of diseases or complex traits. Despite their value for modern genetics, conflicting conclusions often arise through the difficulty of confirming and replicating experimental results. We argue that this problem is largely based on the application of statistical relation measures that are not appropriate for genomic data analysis and demonstrate that the standard measures used for Genome-wide association studies or genomics linkage analysis bear a statistic bias. This may come from the violation of underlying assumptions (such as independence or stationarity) as well as from other conceptual limitations …

research product

Histone Code and Higher-Order Chromatin Folding: A Hypothesis

AbstractHistone modifications alone or in combination are thought to modulate chromatin structure and function; a concept termed histone code. By combining evidence from several studies, we investigated if the histone code can play a role in higher-order folding of chromatin. Firstly using genomic data, we analyzed associations between histone modifications at the nucleosome level. We could dissect the composition of individual nucleosomes into five predicted clusters of histone modifications. Secondly, by assembling the raw reads of histone modifications at various length scales, we noticed that the histone mark relationships that exist at nucleosome level tend to be maintained at the high…

research product

Deciphering the histone code to build the genome structure

Histones are punctuated with small chemical modifications that alter their interaction with DNA. One attractive hypothesis stipulates that certain combinations of these histone modifications may function, alone or together, as a part of a predictive histone code to provide ground rules for chromatin folding. We consider four features that relate histone modifications to chromatin folding: charge neutralisation, molecular specificity, robustness and evolvability. Next, we present evidence for the association among different histone modifications at various levels of chromatin organisation and show how these relationships relate to function such as transcription, replication and cell division…

research product