sample size calculation

样本量计算
  • 文章类型: Journal Article
    背景:中介分析,通常作为估计主要治疗效果的二次分析完成,调查暴露可能通过中介变量直接和间接影响结果的情况。尽管在调解分析中有很多关于权力的研究,其中大部分集中在检测间接影响的能力上。很少考虑调解途径的强度,即,分别为干预-中介路径和中介-结果路径,可能会影响检测总效应的能力,这与一项随机试验中的意向治疗效果相对应。
    方法:我们进行了一项模拟研究,以评估调解途径与测试总治疗效果的能力之间的关系,即,意向治疗效果。考虑一个样本大小,该样本大小是根据通常的公式计算的,用于测试双臂试验中的总效果。我们使用常规中介模型为连续中介和正常结果生成数据。我们使用简单线性回归估计总效应,并评估双侧检验的功效。我们通过改变中介路径的大小,同时保持总效果恒定来探索多个数据生成场景。
    结果:模拟显示,在预期的情况下,估计的总效果是无偏的,但是其标准误差的平均值随着中介-结果路径的大小和中介的残余误差的变化而增加,分别。因此,这会影响测试总效果的能力,当中介-结果路径是非平凡的并且采用了朴素的样本量时,它总是低于计划。分析解释证实,干预-中介者路径不会影响测试总效果的能力,但会影响中介者-结果路径。可以调整通常的效应大小考虑以考虑中介-结果路径的大小及其残余误差。
    结论:疗效和机制评估研究的样本量计算应考虑到中介-结果关联或检测总效果/意向治疗效果低于计划的风险。
    BACKGROUND: Mediation analysis, often completed as secondary analysis to estimating the main treatment effect, investigates situations where an exposure may affect an outcome both directly and indirectly through intervening mediator variables. Although there has been much research on power in mediation analyses, most of this has focused on the power to detect indirect effects. Little consideration has been given to the extent to which the strength of the mediation pathways, i.e., the intervention-mediator path and the mediator-outcome path respectively, may affect the power to detect the total effect, which would correspond to the intention-to-treat effect in a randomized trial.
    METHODS: We conduct a simulation study to evaluate the relation between the mediation pathways and the power of testing the total treatment effect, i.e., the intention-to-treat effect. Consider a sample size that is computed based on the usual formula for testing the total effect in a two-arm trial. We generate data for a continuous mediator and a normal outcome using the conventional mediation models. We estimate the total effect using simple linear regression and evaluate the power of a two-sided test. We explore multiple data generating scenarios by varying the magnitude of the mediation paths whilst keeping the total effect constant.
    RESULTS: Simulations show the estimated total effect is unbiased across the considered scenarios as expected, but the mean of its standard error increases with the magnitude of the mediator-outcome path and the variability in the residual error of the mediator, respectively. Consequently, this affects the power of testing the total effect, which is always lower than planned when the mediator-outcome path is non-trivial and a naive sample size was employed. Analytical explanation confirms that the intervention-mediator path does not affect the power of testing the total effect but the mediator-outcome path. The usual effect size consideration can be adjusted to account for the magnitude of the mediator-outcome path and its residual error.
    CONCLUSIONS: The sample size calculation for studies with efficacy and mechanism evaluation should account for the mediator-outcome association or risk the power to detect the total effect/intention-to-treat effect being lower than planned.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在具有时间至事件终点的临床试验中,相当大比例的患者被治愈(或长期存活)并不少见。如子宫内膜癌试验。在设计临床试验时,应使用混合物固化模型,以充分考虑固化分数。以前,混合固化模型样本量的计算基于组间潜伏期分布的比例风险假设,并采用对数秩检验推导样本量公式。在真正的研究中,两组的潜伏期分布通常不满足比例风险假设。本文推导了以有限平均生存时间为主要终点的混合固化模型的样本量计算公式,并进行了仿真和示例研究。受限平均生存时间检验不受比例风险假设的约束,并且所获得的治疗效果差异可以量化为生存时间增加或减少的年数(或月数),使临床患者与医生的沟通非常方便。模拟结果表明,无论比例风险假设是否满足,混合固化模型的受限平均生存时间检验估计的样本量都是准确的,并且在大多数情况下,在违反比例风险假设的情况下,样本量小于对数秩检验估计的样本量。
    It is not uncommon for a substantial proportion of patients to be cured (or survive long-term) in clinical trials with time-to-event endpoints, such as the endometrial cancer trial. When designing a clinical trial, a mixture cure model should be used to fully consider the cure fraction. Previously, mixture cure model sample size calculations were based on the proportional hazards assumption of latency distribution between groups, and the log-rank test was used for deriving sample size formulas. In real studies, the latency distributions of the two groups often do not satisfy the proportional hazards assumptions. This article has derived a sample size calculation formula for a mixture cure model with restricted mean survival time as the primary endpoint, and did simulation and example studies. The restricted mean survival time test is not subject to proportional hazards assumptions, and the difference in treatment effect obtained can be quantified as the number of years (or months) increased or decreased in survival time, making it very convenient for clinical patient-physician communication. The simulation results showed that the sample sizes estimated by the restricted mean survival time test for the mixture cure model were accurate regardless of whether the proportional hazards assumptions were satisfied and were smaller than the sample sizes estimated by the log-rank test in most cases for the scenarios in which the proportional hazards assumptions were violated.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    试点是小规模的初始实验,旨在指导未来的设计,更大的研究,以提高其效力。在本统计入门课程中,我们强调了五个常见的错误,这些错误限制了试点研究的实用性,并为避免此类错误并提高其有效性提供了实践指导。连接这些错误的共同点是对结果的规划不足和过度解释。这种方法损害了研究计划和未来实验级联的最终目标。为了支持我们的观点,过度解释是一种错误,我们提供了一个简单的模拟,以证明飞行员通常会对所研究的生物终点的变异性产生不准确的估计,并且频繁的低估将导致不确定的和不道德的后续实验.我们认为,计划良好的飞行员是研究级联的重要组成部分,仍然需要高标准实施。
    Pilots are small-scale initial experiments that are intended to guide the design of future, larger studies, with a view to increasing their effectiveness. In this statistical primer we highlight five common mistakes that limit the utility of pilot studies and provide practical guidance to avoid such errors and increase their effectiveness. The common thread connecting these mistakes is insufficient planning and over-interpretation of the results. This approach compromises the ultimate goals of the research programme and the future experimental cascade. In support of our view that over-interpretation is an error, we present a simple simulation to demonstrate that pilots will generally generate an inaccurate estimate of the variability of the biological endpoint under study and that frequent under-estimation will lead to inconclusive and unethical subsequent experiments. We argue that well planned pilots are an important part of the research cascade and still need to be implemented to a high standard.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在单独的阶梯式楔形随机试验(ISW-RT)中,受试者被分配到序列中,每个序列由一个控制期定义,然后是一个实验期。所有序列的总随访时间相同,但是控制期和实验期的持续时间在序列之间变化。据我们所知,与阶梯式楔形整群随机试验(SW-CRT)不同,ISW-RT没有经过验证的样本量计算公式.这项研究的目的是使SW-CRT使用的公式适应个体随机化的情况,并使用蒙特卡洛模拟研究验证这种适应。对于大多数情况,ISW-RT设计的拟议样本大小计算公式产生了令人满意的经验功率,除了边界附近具有操作特征值的情况(即,最小的周期数,非常高或非常低的自相关系数)。总的来说,这些结果为ISW-RT的样本量计算提供了有用的见解。
    In the individual stepped-wedge randomized trial (ISW-RT), subjects are allocated to sequences, each sequence being defined by a control period followed by an experimental period. The total follow-up time is the same for all sequences, but the duration of the control and experimental periods varies among sequences. To our knowledge, there is no validated sample size calculation formula for ISW-RTs unlike stepped-wedge cluster randomized trials (SW-CRTs). The objective of this study was to adapt the formula used for SW-CRTs to the case of individual randomization and to validate this adaptation using a Monte Carlo simulation study. The proposed sample size calculation formula for an ISW-RT design yielded satisfactory empirical power for most scenarios except scenarios with operating characteristic values near the boundary (i.e., smallest possible number of periods, very high or very low autocorrelation coefficient). Overall, the results provide useful insights into the sample size calculation for ISW-RTs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    这项研究引入了一个系统的框架,用于计算研究中的样本量,重点是奶牛的肠甲烷(CH4,g/kg的MI)产量降低。遵循系统审查和荟萃分析(PRISMA)指南的首选报告项目,我们在科学网进行了全面的搜索,Scopus,和PubMedCentral数据库,用于2012年至2023年发表的研究。纳入标准是:报告奶牛CH4产量及其变异性的研究,采用特定的实验设计(拉丁广场设计(LSD),交叉设计,随机完全区组设计(RCBD),和重复测量设计)和测量方法(开路呼吸测量室(RC),GreenFeed系统,和六氟化硫示踪技术),在加拿大进行,美国和欧洲。共150项研究,其中包括177份报告,符合我们的标准并被纳入数据库.我们使用数据库进行样本量计算的方法始于定义6个CH4产量降低水平(5、10、15、20、30和50%)。利用调整后的Cohen的f公式和功效分析,我们计算了从涉及3或4种治疗的研究中减少平衡LSD和RCBD报告所需的样本量。结果表明,受试者内研究(即,与受试者间研究相比,LSD)需要较小的样本量来检测CH4产量降低(即,RCBD)。尽管使用RC的实验通常需要较少的个体,因为它们具有较高的准确性,我们的结果表明,在4种治疗方法的RCBD研究报告中,这一预期优势并不明显.这项研究的一项关键创新是开发了一种基于Web的工具,该工具简化了样本量计算的过程(samplesizecalculator。ucdavis.edu)。使用Python开发,该工具利用广泛的数据库为特定的实验场景提供量身定制的样本量建议。它确保实验有足够的动力来检测CH4排放的有意义的差异,从而有助于科学严谨的研究在这个关键领域的环境和农业研究。凭借其用户友好的界面和强大的后端计算,该工具代表了在乳牛中计划和执行CH4排放研究的方法方面的重大进展,与全球可持续农业实践和环境保护努力保持一致。
    This research introduces a systematic framework for calculating sample size in studies focusing on enteric methane (CH4, g/kg of DMI) yield reduction in dairy cows. Adhering to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, we conducted a comprehensive search across the Web of Science, Scopus, and PubMed Central databases for studies published from 2012 to 2023. The inclusion criteria were: studies reporting CH4 yield and its variability in dairy cows, employing specific experimental designs (Latin Square Design (LSD), Crossover Design, Randomized Complete Block Design (RCBD), and Repeated Measures Design) and measurement methods (Open-circuit respirometry chambers (RC), the GreenFeed system, and the sulfur hexafluoride tracer technique), conducted in Canada, the United States and Europe. A total of 150 studies, which included 177 reports, met our criteria and were included in the database. Our methodology for using the database for sample size calculations began by defining 6 CH4 yield reduction levels (5, 10, 15, 20, 30, and 50%). Utilizing an adjusted Cohen\'s f formula and a power analysis we calculated the sample sizes required for these reductions in balanced LSD and RCBD reports from studies involving 3 or 4 treatments. The results indicate that within-subject studies (i.e., LSD) require smaller sample sizes to detect CH4 yield reductions compared with between-subject studies (i.e., RCBD). Although experiments using RC typically require fewer individuals due to their higher accuracy, our results demonstrate that this expected advantage is not evident in reports from RCBD studies with 4 treatments. A key innovation of this research is the development of a web-based tool that simplifies the process of sample size calculation (samplesizecalculator.ucdavis.edu). Developed using Python, this tool leverages the extensive database to provide tailored sample size recommendations for specific experimental scenarios. It ensures that experiments are adequately powered to detect meaningful differences in CH4 emissions, thereby contributing to the scientific rigor of studies in this critical area of environmental and agricultural research. With its user-friendly interface and robust backend calculations, this tool represents a significant advancement in the methodology for planning and executing CH4 emission studies in dairy cows, aligning with global efforts toward sustainable agricultural practices and environmental conservation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    已经提出了样本量公式,用于在配对设计下存在验证偏差的情况下比较两种灵敏度(特异性)。然而,现有的样本量公式涉及冗长的导数计算,实施起来太复杂。在本文中,我们为三个现有测试中的每一个提出了替代的样本量公式,两个Wald测试和一个加权McNemar测试。建议的样本量公式比现有的公式更直观,更易于实现。此外,通过比较基于三个测试计算的样本大小,我们可以证明,即使加权McNemar检验仅使用来自不一致对的数据,而两个Wald检验也使用来自一致对的额外数据,这三个检验具有相似的样本量。
    Sample size formulas have been proposed for comparing two sensitivities (specificities) in the presence of verification bias under a paired design. However, the existing sample size formulas involve lengthy calculations of derivatives and are too complicated to implement. In this paper, we propose alternative sample size formulas for each of three existing tests, two Wald tests and one weighted McNemar\'s test. The proposed sample size formulas are more intuitive and simpler to implement than their existing counterparts. Furthermore, by comparing the sample sizes calculated based on the three tests, we can show that the three tests have similar sample sizes even though the weighted McNemar\'s test only use the data from discordant pairs whereas the two Wald tests also use the additional data from accordant pairs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    In randomised controlled trials, the outcome of interest could be recurrent events, such as hospitalisations for heart failure. If mortality rates are non-negligible, both recurrent events and competing terminal events need to be addressed when formulating the estimand and statistical analysis is no longer trivial. In order to design future trials with primary recurrent event endpoints with competing risks, it is necessary to be able to perform power calculations to determine sample sizes. This paper introduces a simulation-based approach for power estimation based on a proportional means model for recurrent events and a proportional hazards model for terminal events. The simulation procedure is presented along with a discussion of what the user needs to specify to use the approach. The method is flexible and based on marginal quantities which are easy to specify. However, the method introduces a lack of a certain type of dependence. This is explored in a sensitivity analysis which suggests that the power is robust in spite of that. Data from a randomised controlled trial, LEADER, is used as the basis for generating data for a future trial. Finally, potential power gains of recurrent event methods as opposed to first event methods are discussed.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    集群随机试验是公共卫生和医学研究的重要设计,当出于科学或后勤原因,个体随机化不可行或不受欢迎时。然而,与具有相同总样本量的单独随机试验相比,集群内观察值之间的相关性导致统计功效降低.这种相关性-通常使用集群内相关系数进行量化-必须在样本量计算中考虑,以确保试验具有足够的动力。在本文中,我们首先描述了平行臂CRT的样本量计算原理,并解释如何将这些计算扩展到具有交叉设计的CRT,与基线测量和阶梯楔形设计。我们介绍了指导研究人员进行样本量计算的工具,并讨论了用于选择聚类内相关系数的先验估计的方法。我们还包括关于预期减员的额外考虑,少量的集群,以及协变量在随机化过程和分析中的使用。
    Cluster randomized trials are an essential design in public health and medical research, when individual randomization is infeasible or undesirable for scientific or logistical reasons. However, the correlation among observations within clusters leads to a decrease in statistical power compared to an individually randomised trial with the same total sample size. This correlation - often quantified using the intra-cluster correlation coefficient - must be accounted for in the sample size calculation to ensure that the trial is adequately powered. In this paper, we first describe the principles of sample size calculation for parallel-arm CRTs, and explain how these calculations can be extended to CRTs with cross-over designs, with a baseline measurement and stepped-wedge designs. We introduce tools to guide researchers with their sample size calculation and discuss methods to inform the choice of the a priori estimate of the intra-cluster correlation coefficient for the calculation. We also include additional considerations with respect to anticipated attrition, a small number of clusters, and use of covariates in the randomisation process and in the analysis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    有越来越多的潜在定量生物标志物可以允许早期评估治疗反应或疾病进展。然而,这类生物标志物的测量受到随机变异性的影响。因此,生物标志物在纵向测量中的差异不一定代表真正的变化,但可能是由这种随机测量变异性引起的.在纵向研究中使用定量生物标志物之前,因此,评估测量的可重复性是至关重要的。从测试-重测研究中获得的测量重复性可以通过重复性系数来量化,然后在随后的纵向研究中使用,以确定测量的差异是否代表实际变化或在预期的随机测量变异性范围内。重复性系数的点估计的质量,因此,直接控制纵向研究的评估质量。重复性系数估计精度取决于重测研究中的案例数,但是尽管它发挥了关键作用,不存在用于测试-重测研究的样本量计算的全面框架。为了解决这个问题,我们建立了这样一个框架,这允许测试-重测研究的灵活的样本量计算,基于纵向研究中新引入的关于评估质量的标准。这也允许对先前的测试-再测试研究进行回顾性评估。
    There is an increasing number of potential quantitative biomarkers that could allow for early assessment of treatment response or disease progression. However, measurements of such biomarkers are subject to random variability. Hence, differences of a biomarker in longitudinal measurements do not necessarily represent real change but might be caused by this random measurement variability. Before utilizing a quantitative biomarker in longitudinal studies, it is therefore essential to assess the measurement repeatability. Measurement repeatability obtained from test-retest studies can be quantified by the repeatability coefficient, which is then used in the subsequent longitudinal study to determine if a measured difference represents real change or is within the range of expected random measurement variability. The quality of the point estimate of the repeatability coefficient, therefore, directly governs the assessment quality of the longitudinal study. Repeatability coefficient estimation accuracy depends on the case number in the test-retest study, but despite its pivotal role, no comprehensive framework for sample size calculation of test-retest studies exists. To address this issue, we have established such a framework, which allows for flexible sample size calculation of test-retest studies, based upon newly introduced criteria concerning assessment quality in the longitudinal study. This also permits retrospective assessment of prior test-retest studies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在临床试验中收集计数结果,用于几个治疗领域的新药开发,事件发生率通常用作单个主要终点。大于平均值的计数结果称为过度分散;因此,假设计数结果具有负二项分布。然而,在治疗哮喘和慢性阻塞性肺疾病(COPD)的临床试验中,监管机构建议,除了事件发生率外,还必须将与肺功能相关的连续终点作为主要终点进行评估.需要评估的两个共同主要终点包括过度分散的计数和连续的结果。一些研究人员在各种结果类型的共同主要终点的背景下提出了样本量计算方法。然而,具有两个共同主要终点的试验中样本量计算方法,包括过度分散的计数和连续的结果,计划治疗哮喘和COPD的临床试验时需要,仍有待提出。在这项研究中,我们旨在开发一种假设检验方法和相应的样本量计算方法,该方法具有两个共同的主要终点,包括过度分散的计数和连续结局.在模拟中,我们证明了所提出的样本量计算方法具有足够的功率精度。此外,我们说明了所提出的样本量计算方法在一项针对COPD患者的安慰剂对照3期试验中的应用.
    Count outcomes are collected in clinical trials for new drug development in several therapeutic areas and the event rate is commonly used as a single primary endpoint. Count outcomes that are greater than the mean value are termed overdispersion; thus, count outcomes are assumed to have a negative binomial distribution. However, in clinical trials for treating asthma and chronic obstructive pulmonary disease (COPD), a regulatory agency has suggested that a continuous endpoint related to lung function must be evaluated as a primary endpoint in addition to the event rate. The two co-primary endpoints that need to be evaluated include overdispersed count and continuous outcomes. Some researchers have proposed sample size calculation methods in the context of co-primary endpoints for various outcome types. However, methodologies for sample size calculation in trials with two co-primary endpoints, including overdispersed count and continuous outcomes, required when planning clinical trials for treating asthma and COPD, remain to be proposed. In this study, we aimed to develop a hypothesis-testing method and a corresponding sample size calculation method with two co-primary endpoints including overdispersed count and continuous outcomes. In a simulation, we demonstrated that the proposed sample size calculation method has adequate power accuracy. In addition, we illustrated an application of the proposed sample size calculation method to a placebo-controlled Phase 3 trial for patients with COPD.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号