futility stopping

  • 文章类型: Journal Article
    条件功率(CP)是组顺序设计中徒劳监视的广泛使用的方法。然而,采用CP方法可能导致II型错误率在所需水平上的控制不足。在这项研究中,我们引入了一种灵活的beta支出函数,该函数可以调整II型错误率,同时基于预定的标准化效应大小采用CP进行无效监测(所谓的CP-beta支出函数).该功能描述了整个试验过程中第二类错误率的支出。与其他现有的beta支出函数不同,CP-β支出功能将β支出概念无缝地纳入CP框架,有助于在徒劳监测期间精确分阶段控制II型错误率。此外,从CP-β支出函数导出的停止边界可以通过类似于其他传统β支出函数方法的积分来计算。此外,拟议的CP-β支出函数在试验的不同阶段适应CP量表上的各种阈值,确保其在不同信息时间场景下的适应性。这些属性使CP-β支出函数在其他形式的β支出函数中具有竞争力,使其适用于任何试验组顺序设计与直接实施。仿真研究和来自急性缺血性卒中试验的示例都表明,所提出的方法准确地捕获了预期功率,即使最初确定的样本量不认为停止是徒劳的,并在保持整体I型错误率方面表现出良好的性能,以明显无效。
    Conditional power (CP) serves as a widely utilized approach for futility monitoring in group sequential designs. However, adopting the CP methods may lead to inadequate control of the type II error rate at the desired level. In this study, we introduce a flexible beta spending function tailored to regulate the type II error rate while employing CP based on a predetermined standardized effect size for futility monitoring (a so-called CP-beta spending function). This function delineates the expenditure of type II error rate across the entirety of the trial. Unlike other existing beta spending functions, the CP-beta spending function seamlessly incorporates beta spending concept into the CP framework, facilitating precise stagewise control of the type II error rate during futility monitoring. In addition, the stopping boundaries derived from the CP-beta spending function can be calculated via integration akin to other traditional beta spending function methods. Furthermore, the proposed CP-beta spending function accommodates various thresholds on the CP-scale at different stages of the trial, ensuring its adaptability across different information time scenarios. These attributes render the CP-beta spending function competitive among other forms of beta spending functions, making it applicable to any trials in group sequential designs with straightforward implementation. Both simulation study and example from an acute ischemic stroke trial demonstrate that the proposed method accurately captures expected power, even when the initially determined sample size does not consider futility stopping, and exhibits a good performance in maintaining overall type I error rates for evident futility.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    多阶段随机试验设计可以通过在研究期间实验臂与对照臂相比表现出低或高效能时允许提前终止试验来显著提高效率。然而,适当的推断方法是必要的,因为目标统计量的基本分布由于多级结构而变化。本文重点介绍具有二分法结果的多阶段随机II期试验,如治疗反应,并为比值比提出了精确的条件置信区间。通常的单阶段置信区间在多阶段试验中使用时无效。为了解决这个问题,我们提出了所有可能结果的线性排序。这种排序以每个阶段的响应者总数为条件,并利用结果的确切条件分布函数。这种方法能够估计考虑多阶段设计的精确置信区间。
    A multi-stage randomized trial design can significantly improve efficiency by allowing early termination of the trial when the experimental arm exhibits either low or high efficacy compared to the control arm during the study. However, proper inference methods are necessary because the underlying distribution of the target statistic changes due to the multi-stage structure. This article focuses on multi-stage randomized phase II trials with a dichotomous outcome, such as treatment response, and proposes exact conditional confidence intervals for the odds ratio. The usual single-stage confidence intervals are invalid when used in multi-stage trials. To address this issue, we propose a linear ordering of all possible outcomes. This ordering is conditioned on the total number of responders in each stage and utilizes the exact conditional distribution function of the outcomes. This approach enables the estimation of an exact confidence interval accounting for the multi-stage designs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    随机试验的多阶段设计是在研究期间发现实验臂与对照相比具有低或高的功效时允许提前终止研究。在这样的审判中,早期停止规则会导致治疗效果的最大似然估计存在偏差.我们考虑多阶段随机试验的二分结果,如治疗反应,并研究比值比的估计。通常,随机II期癌症临床试验具有小样本量的两阶段设计,这使得赔率比的估计更具挑战性。在本文中,我们评估了几种现有的比值比估计方法,并提出了随机多阶段试验的偏倚校正估计器,包括随机II期癌症临床试验。通过数值研究,所提出的估计器总体上具有较小的偏差和较小的均方误差。
    A multi-stage design for a randomized trial is to allow early termination of the study when the experimental arm is found to have low or high efficacy compared to the control during the study. In such a trial, an early stopping rule results in bias in the maximum likelihood estimator of the treatment effect. We consider multi-stage randomized trials on a dichotomous outcome, such as treatment response, and investigate the estimation of the odds ratio. Typically, randomized phase II cancer clinical trials have two-stage designs with small sample sizes, which makes the estimation of odds ratio more challenging. In this paper, we evaluate several existing estimation methods of odds ratio and propose bias-corrected estimators for randomized multi-stage trials, including randomized phase II cancer clinical trials. Through numerical studies, the proposed estimators are shown to have a smaller bias and a smaller mean squared error overall.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    UNASSIGNED:我们探索了贝叶斯自适应设计的频率运行特性,该特性允许为徒劳而连续提前停止。特别是,当累积的患者数量多于最初计划时,我们将重点放在功效与样本量的关系上.
    UNASSIGNED:我们考虑II期单臂研究和贝叶斯II期结果自适应随机化设计的情况。对于前者,分析计算是可能的;对于后者,进行了模拟。
    UNASSIGNED:两种情况的结果显示,随着样本量的增加,功率降低。看来,这种影响是由于错误地停止徒劳的累积概率增加。
    UNASSIGNED:为徒劳而错误停止的累积概率增加与早期停止的连续性有关,这增加了应计中期分析的数量。这个问题可以通过,例如,延迟测试的开始,减少要进行的徒劳测试的数量,或通过设定更严格的无效结论标准。
    We explore frequentist operating characteristics of a Bayesian adaptive design that allows continuous early stopping for futility. In particular, we focus on the power versus sample size relationship when more patients are accrued than originally planned.
    We consider the case of a phase II single-arm study and a Bayesian phase II outcome-adaptive randomization design. For the former, analytical calculations are possible; for the latter, simulations are conducted.
    Results for both cases show a decrease in power with an increasing sample size. It appears that this effect is due to the increasing cumulative probability of incorrectly stopping for futility.
    The increase in cumulative probability of incorrectly stopping for futility is related to the continuous nature of the early stopping, which increases the number of interim analyses with accrual. The issue can be addressed by, for instance, delaying the start of testing for futility, reducing the number of futility tests to be performed or by setting stricter criteria for concluding futility.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Clinical Trial
    进行II期试验以研究实验疗法是否足够有效以进行大规模III期试验。尽管设计和分析方法进步很快,单臂两阶段设计仍然是最受欢迎的II期癌症临床试验。在这篇评论文章中,我们讨论了两种常用于II期临床试验的设计和分析方法,但这可能会导致严重的偏见。一个是关于使用样本比例作为单臂两阶段试验的真实反应率的估计器。对于具有无效性临时测试的两阶段设计,忽略两阶段设计,样本比例呈负偏。另一个是关于针对患者人群的单臂II期试验的设计和分析,这些患者人群由具有不同反应率的多个亚群组成。在这种情况下,一种标准设计方法是预测每个亚群的患病率,并根据整个人群的预期应答率选择标准的两阶段设计。在这种情况下,通过使用非分层统计测试,如果观察到的患病率与预测的患病率有很大不同,则标准分析方法可能存在严重偏差。在本文中,我们回顾了为避免这些偏差来源而提出的适当的设计和分析方法。
    A phase II trial is conducted to investigate if an experimental therapy is efficacious enough to proceed to a large-scale phase III trial or not. In spite of the fast progress in design and analysis methods, single-arm two-stage design is still the most popular for phase II cancer clinical trials. In this review article, we discuss two design and analysis methods that are popularly used for phase II clinical trials, but that can cause serious bias. One is about using the sample proportion as the estimator of the true response rate from single-arm two-stage trials. For a two-stage design with a futility interim test, the sample proportion is negatively biased by ignoring the two-stage design. The other is about the design and analysis of single-arm phase II trials for patient populations consisting of multiple sub-populations with different response rates. In this case, a standard design method is to project the prevalence of each subpopulation and select a standard two-stage design based on the expected response rate for the whole population. By using an unstratified statistical testing in this case, the standard analysis method can be seriously biased if the observed prevalence is very different from the projected one. In this paper, we review appropriate design and analysis methods that are proposed to avoid these sources of bias.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Clinical Trial, Phase II
    In this paper, we propose a randomized Bayesian optimal phase II (RBOP2) design with a binary endpoint (e.g., response rate). A beta-binomial distribution is used to model the binary endpoint for a two-arm phase II trial. Posterior probabilities of the endpoint of interest are evaluated at each interim look and used in the decision to stop the trial due to futility. Compared with other Bayesian designs, the proposed RBOP2 design has the following merits: (i) strongly controls the type I error rate at a pre-defined level; (ii) optimizes the stopping boundaries, thus maximizing the power to detect treatment effects and minimizing the expected sample size for futile treatment; (iii) does not limit the number of interim looks, thus enabling frequent trial monitoring; and (iv) allows the stopping boundaries to be pre-defined in the protocol and is easy to implement. We conduct simulation studies to compare the proposed design with a group sequential design and other Bayesian randomized designs and evaluate its operating characteristics under different scenarios.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Stopping for futility is a useful tool in a clinical trial. It is widely used in single-arm trials in oncology and in many two-arm trials. We review three stopping rules for futility. We give recommendations for the optimal timing of futility looks in two-stage trials in terms of the information fraction and the probability of stopping under the alternative hypothesis. We discuss futility stopping in trials with substantial uncertainty about the variability of the outcome and in crossover trials.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    Designs incorporating more than one endpoint have become popular in drug development. One of such designs allows for incorporation of short-term information in an interim analysis if the long-term primary endpoint has not been yet observed for some of the patients. At first we consider a two-stage design with binary endpoints allowing for futility stopping only based on conditional power under both fixed and observed effects. Design characteristics of three estimators: using primary long-term endpoint only, short-term endpoint only, and combining data from both are compared. For each approach, equivalent cut-off point values for fixed and observed effect conditional power calculations can be derived resulting in the same overall power. While in trials stopping for futility the type I error rate cannot get inflated (it usually decreases), there is loss of power. In this study, we consider different scenarios, including different thresholds for conditional power, different amount of information available at the interim, different correlations and probabilities of success. We further extend the methods to adaptive designs with unblinded sample size reassessments based on conditional power with inverse normal method as the combination function. Two different futility stopping rules are considered: one based on the conditional power, and one from P-values based on Z-statistics of the estimators. Average sample size, probability to stop for futility and overall power of the trial are compared and the influence of the choice of weights is investigated.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    Two-arm group sequential designs have been widely used for over 40 years, especially for studies with mortality endpoints. The natural generalization of such designs to trials with multiple treatment arms and a common control (MAMS designs) has, however, been implemented rarely. While the statistical methodology for this extension is clear, the main limitation has been an efficient way to perform the computations. Past efforts were hampered by algorithms that were computationally explosive. With the increasing interest in adaptive designs, platform designs, and other innovative designs that involve multiple comparisons over multiple stages, the importance of MAMS designs is growing rapidly. This article provides break-through algorithms that can compute MAMS boundaries rapidly thereby making such designs practical. For designs with efficacy-only boundaries the computational effort increases linearly with number of arms and number of stages. For designs with both efficacy and futility boundaries the computational effort doubles with successive increases in number of stages.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号