Bayesian model to detect phenotype-specific genes for copy number data

6533b7dbfe1ef96bd1270284

RESEARCH PRODUCT

Bayesian model to detect phenotype-specific genes for copy number data

Carlos Abellan J.j. Abellán Juan R. González

subject

Male Genotype Gene Dosage HapMap Project Biology lcsh:Computer applications to medicine. Medical informatics Population stratification Bayesian inference Polymorphism Single Nucleotide Biochemistry 03 medical and health sciences Bayes' theorem 0302 clinical medicine Structural Biology medicine Humans Computer Simulation Genetic Predisposition to Disease Genetic Testing Copy-number variation International HapMap Project lcsh:QH301-705.5 Molecular Biology 030304 developmental biology Genetic testing Genetics 0303 health sciences Models Statistical Models Genetic medicine.diagnostic_test Methodology Article Applied Mathematics Confounding Bayes Theorem 3. Good health Computer Science Applications Phenotype lcsh:Biology (General)030220 oncology & carcinogenesis lcsh:R858-859.7 Female DNA microarray Algorithms

description

Abstract Background An important question in genetic studies is to determine those genetic variants, in particular CNVs, that are specific to different groups of individuals. This could help in elucidating differences in disease predisposition and response to pharmaceutical treatments. We propose a Bayesian model designed to analyze thousands of copy number variants (CNVs) where only few of them are expected to be associated with a specific phenotype. Results The model is illustrated by analyzing three major human groups belonging to HapMap data. We also show how the model can be used to determine specific CNVs related to response to treatment in patients diagnosed with ovarian cancer. The model is also extended to address the problem of how to adjust for confounding covariates (e.g., population stratification). Through a simulation study, we show that the proposed model outperforms other approaches that are typically used to analyze this data when analyzing common copy-number polymorphisms (CNPs) or complex CNVs. We have developed an R package, called bayesGen, that implements the model and estimating algorithms. Conclusions Our proposed model is useful to discover specific genetic variants when different subgroups of individuals are analyzed. The model can address studies with or without control group. By integrating all data in a unique model we can obtain a list of genes that are associated with a given phenotype as well as a different list of genes that are shared among the different subtypes of cases.

year	journal	country	edition	language
2012-01-23	BMC Bioinformatics

https://doi.org/10.1186/1471-2105-13-130