6533b826fe1ef96bd12846c5

RESEARCH PRODUCT

Multiple testing in candidate gene situations: a comparison of classical, discrete, and resampling-based procedures.

Gerhard HommelAnja VictorAmelie Elsäβer

subject

Statistics and ProbabilityCandidate geneContrast (statistics)computer.software_genrePolymorphism Single NucleotideSet (abstract data type)Computational MathematicsSample size determinationResamplingData Interpretation StatisticalSample SizeStatisticsMultiple comparisons problemGeneticsCochran–Armitage test for trendRange (statistics)HumansComputer SimulationDiseaseData miningMolecular BiologycomputerGenetic Association StudiesMathematics

description

In candidate gene association studies, usually several elementary hypotheses are tested simultaneously using one particular set of data. The data normally consist of partly correlated SNP information. Every SNP can be tested for association with the disease, e.g., using the Cochran-Armitage test for trend. To account for the multiplicity of the test situation, different types of multiple testing procedures have been proposed. The question arises whether procedures taking into account the discreteness of the situation show a benefit especially in case of correlated data. We empirically evaluate several different multiple testing procedures via simulation studies using simulated correlated SNP data. We analyze FDR and FWER controlling procedures, special procedures for discrete situations, and the minP-resampling-based procedure. Within the simulation study, we examine a broad range of different gene data scenarios. We show that the main difference in the varying performance of the procedures is due to sample size. In small sample size scenarios, the minP-resampling procedure though controlling the stricter FWER even had more power than the classical FDR controlling procedures. In contrast, FDR controlling procedures led to more rejections in higher sample size scenarios.

10.2202/1544-6115.1729https://pubmed.ncbi.nlm.nih.gov/22822520