关键词: false discovery rate multiple testing power proportion of true null hypotheses sample size

Mesh : Sample Size Algorithms Software Research Design Genomics

来  源:   DOI:10.3390/genes15030344   PDF(Pubmed)

Abstract:
The false discovery rate (FDR) is a widely used metric of statistical significance for genomic data analyses that involve multiple hypothesis testing. Power and sample size considerations are important in planning studies that perform these types of genomic data analyses. Here, we propose a three-rectangle approximation of a p-value histogram to derive a formula to compute the statistical power and sample size for analyses that involve the FDR. We also introduce the R package FDRsamplesize2, which incorporates these and other power calculation formulas to compute power for a broad variety of studies not covered by other FDR power calculation software. A few illustrative examples are provided. The FDRsamplesize2 package is available on CRAN.
摘要:
错误发现率(FDR)是用于涉及多个假设检验的基因组数据分析的统计显著性的广泛使用的度量。在计划进行这些类型的基因组数据分析的研究中,功率和样本量的考虑非常重要。这里,我们提出了p值直方图的三矩形近似,以得出一个公式来计算涉及FDR的分析的统计能力和样本大小。我们还介绍了R软件包FDRsamplesize2,该软件包结合了这些和其他功率计算公式,以计算其他FDR功率计算软件未涵盖的各种研究的功率。提供了几个说明性示例。FDRsamplesize2软件包在CRAN上可用。
公众号