关键词: Family-wise error rate Genome-wide association studies Multiple testing Post-selection inference

Mesh : Case-Control Studies Computer Simulation Data Interpretation, Statistical Genome-Wide Association Study / statistics & numerical data Humans Mass Screening / methods statistics & numerical data Monte Carlo Method Polymorphism, Single Nucleotide Risk Factors Stroke / genetics Time Factors

来  源:   DOI:10.1111/biom.12957   PDF(Pubmed)

Abstract:
Assessing the statistical significance of risk factors when screening large numbers of 2 × 2 tables that cross-classify disease status with each type of exposure poses a challenging multiple testing problem. The problem is especially acute in large-scale genomic case-control studies. We develop a potentially more powerful and computationally efficient approach (compared with existing methods, including Bonferroni and permutation testing) by taking into account the presence of complex dependencies between the 2 × 2 tables. Our approach gains its power by exploiting Monte Carlo simulation from the estimated null distribution of a maximally selected log-odds ratio. We apply the method to case-control data from a study of a large collection of genetic variants related to the risk of early onset stroke.
摘要:
暂无翻译
公众号