continuous outcome

连续结果
  • 文章类型: Journal Article
    当研究使用不同的量表来衡量连续结果时,对数据进行荟萃分析需要标准化平均差(SMD)。然而,结局通常报告为终点或基线评分的变化.组合相应的SMD可能是有问题的,并且可用的指导建议反对这种做法。我们旨在研究将两种类型的SMD结合在抑郁症严重程度的荟萃分析中的影响。我们使用了药物干预的个体参与者数据(89项研究,27,409名参与者)和互联网提供的认知行为疗法(iCBT;61项研究,13,687名参与者)用于抑郁症,以比较研究水平的终点和基线SMD的变化。接下来,我们使用端点SMD进行了成对(PWMA)和网络荟萃分析(NMA),从基线SMD的变化,或者两者的混合物。从终点计算的特定研究SMD和基线数据的变化在很大程度上相似,尽管对于iCBT干预,3个月时25%的研究与研究特异性SMD之间的重要差异相关(中位数0.01,IQR-0.10,0.13),尤其是在基线失衡的较小试验中.然而,当汇集时,终点和变化SMD之间的差异可以忽略不计。仅合并两个SMD中更有利的部分不会对荟萃分析产生实质性影响,导致药理学和iCBT数据集中的合并SMD差异高达0.05和0.13,分别。我们的发现对抑郁症的荟萃分析有意义,其中我们表明,在估计SMD的终点和变化分数之间的选择对汇总荟萃分析估计没有实质性影响。未来的研究应该复制并将我们的分析扩展到抑郁症以外的领域。
    When studies use different scales to measure continuous outcomes, standardised mean differences (SMD) are required to meta-analyse the data. However, outcomes are often reported as endpoint or change from baseline scores. Combining corresponding SMDs can be problematic and available guidance advises against this practice. We aimed to examine the impact of combining the two types of SMD in meta-analyses of depression severity. We used individual participant data on pharmacological interventions (89 studies, 27,409 participants) and internet-delivered cognitive behavioural therapy (iCBT; 61 studies, 13,687 participants) for depression to compare endpoint and change from baseline SMDs at the study level. Next, we performed pairwise (PWMA) and network meta-analyses (NMA) using endpoint SMDs, change from baseline SMDs, or a mixture of the two. Study-specific SMDs calculated from endpoint and change from baseline data were largely similar, although for iCBT interventions 25% of the studies at 3 months were associated with important differences between study-specific SMDs (median 0.01, IQR -0.10, 0.13) especially in smaller trials with baseline imbalances. However, when pooled, the differences between endpoint and change SMDs were negligible. Pooling only the more favourable of the two SMDs did not materially affect meta-analyses, resulting in differences of pooled SMDs up to 0.05 and 0.13 in the pharmacological and iCBT datasets, respectively. Our findings have implications for meta-analyses in depression, where we showed that the choice between endpoint and change scores for estimating SMDs had immaterial impact on summary meta-analytic estimates. Future studies should replicate and extend our analyses to fields other than depression.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在具有二元终点的集群随机试验中,治疗组和集群之间的异质性结果相关性越来越得到认可。已经开发了分析方法来研究这种异质性。然而,对于具有连续结局的整群随机试验,尚未研究整群特异性结局差异和相关性.
    本文提出了在贝叶斯设置中拟合的具有分层方差结构的模型,以量化跨集群的异质方差,并在结果连续时使用集群级别的协变量进行解释。该模型还可以扩展到分析单独随机组治疗试验中的异质性差异,使用手臂特定的集群级别的协变量,或部分嵌套的设计。进行了仿真研究,以验证新引入的模型在不同设置中的性能。
    仿真表明,总体而言,新推出的模型具有良好的性能,报告方差模型中的组内相关系数和回归参数的偏差较低,覆盖率约为95%.当差异是异质的时,与具有齐次方差的模型相比,我们提出的模型改进了模型拟合。当用于分析喀拉拉邦糖尿病预防计划研究的数据时,我们的模型识别了不同聚类的异质性方差和组内相关系数,并检查了与这种异质性相关的聚类水平特征.
    我们提出了新的分层贝叶斯方差模型,以适应集群随机试验中特定于集群的方差。新开发的方法有助于理解如何在集群中实施和传播干预策略,并有助于改进未来的试验设计。
    UNASSIGNED: Heterogeneous outcome correlations across treatment arms and clusters have been increasingly acknowledged in cluster randomized trials with binary endpoints, where analytical methods have been developed to study such heterogeneity. However, cluster-specific outcome variances and correlations have yet to be studied for cluster randomized trials with continuous outcomes.
    UNASSIGNED: This article proposes models fitted in the Bayesian setting with hierarchical variance structure to quantify heterogeneous variances across clusters and explain it with cluster-level covariates when the outcome is continuous. The models can also be extended to analyzing heterogeneous variances in individually randomized group treatment trials, with arm-specific cluster-level covariates, or in partially nested designs. Simulation studies are carried out to validate the performance of the newly introduced models across different settings.
    UNASSIGNED: Simulations showed that overall the newly introduced models have good performance, reporting low bias and approximately 95% coverage for the intraclass correlation coefficients and regression parameters in the variance model. When variances are heterogeneous, our proposed models had improved model fit over models with homogeneous variances. When used to analyze data from the Kerala Diabetes Prevention Program study, our models identified heterogeneous variances and intraclass correlation coefficients across clusters and examined cluster-level characteristics associated with such heterogeneity.
    UNASSIGNED: We proposed new hierarchical Bayesian variance models to accommodate cluster-specific variances in cluster randomized trials. The newly developed methods inform the understanding of how an intervention strategy is implemented and disseminated differently across clusters and can help improve future trial design.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    自从循证医学时代以来,在临床研究中使用统计数据来创建客观证据已成为理所当然的事情。作为延伸,在临床研究中,在开始研究之前计算正确的样本量以证明临床上的显著差异已变得至关重要.此外,因为样本量计算方法因研究设计而异,没有适用于所有设计的样本量计算公式。了解这一点对我们来说非常重要。在这次审查中,使用R程序(RFoundationforStatisticalComputing)介绍了适用于各种研究设计的每种样本量计算方法。为了临床研究人员根据未来的研究直接利用它,我们提出了实践守则,输出结果,并解释每种情况的结果。
    Since the era of evidence-based medicine, it has become a matter of course to use statistics to create objective evidence in clinical research. As an extension of this, it has become essential in clinical research to calculate the correct sample size to demonstrate a clinically significant difference before starting the study. Also, because sample size calculation methods vary from study design to study design, there is no formula for sample size calculation that applies to all designs. It is very important for us to understand this. In this review, each sample size calculation method suitable for various study designs was introduced using the R program (R Foundation for Statistical Computing). In order for clinical researchers to directly utilize it according to future research, we presented practice codes, output results, and interpretation of results for each situation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:可以从不同的平均差异(MD)和标准偏差(SD)计算出标准化的平均差异(SMD)。本研究旨在探讨临床试验是如何计算,报告并解释了SMD,并检查不同SMD之间的差异。
    方法:我们在PubMed中搜索了估计SMD的普通医学和精神病学的随机对照试验。我们探索了SMD是如何计算和解释的。我们基于不同的MD和SD计算了SMD,以及每个研究的这些SMD估计值的变化。
    结果:我们收录了161篇文章。使用各种MD和SD来计算SMD,然而,69.0%的研究未能提供足够的细节.一项研究中使用不同MD和SD的SMD估计值可能存在很大差异(绝对差异的中位数为0.3,四分位数间距IQR为0.17至0.53)。然而,68.3%的研究基于相同的参考解释了SMD,科恩的经验法则。在小样本量和大报告效应的研究中观察到最大的变化。
    结论:使用不同MD和SD的SMD可能会有很大差异,但是报告往往不够充分,解释过于简化。为了避免选择性报告偏见和误解,预先指定和报告方法并从多个角度解释结果是可取的。
    OBJECTIVE: The standardized mean difference (SMD) can be calculated from different mean differences (MDs) and standard deviations (SDs). This study aims to investigate how clinical trials calculated, reported and interpreted the SMD, and to examine the variation between different SMDs.
    METHODS: We searched the PubMed for randomized controlled trials of general medicine and psychiatry that estimated SMDs. We explored how the SMD was computed and interpreted. We calculated SMDs based on different MDs and SDs, and the variation in these SMD estimates for each study.
    RESULTS: We included 161 articles. Various MDs and SDs were used to calculate SMDs, yet 69.0% studies failed to provide sufficient details. Variations in SMD estimates using different MDs and SDs in one study could be substantial (median of the absolute differences was 0.3, interquartile range IQR 0.17 to 0.53). However, 68.3% studies interpreted the SMD based on the same reference, Cohen\'s rule of thumb. The largest variations were observed in studies with small sample sizes and large reported effects.
    CONCLUSIONS: SMDs using different MDs and SDs could vary considerably, but the report was often insufficient and the interpretation was oversimplified. To avoid selective reporting bias and misinterpretation, prespecifying and reporting the method and interpreting the result from multiple perspectives are desirable.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    未经评估:在非随机研究(NRS)中,连续结果变量(例如,抑郁症状)在基线和随访时进行评估,通常观察到治疗/暴露组和对照组之间基线值的不平衡.这可能会使研究产生偏差,从而导致荟萃分析(MA)估计。这些估计可能因用于处理此问题的统计方法而异。个体参与者数据(IPD)的分析允许跨研究的方法标准化。我们的目标是确定已发表的NRSIPD-MA中用于连续结果的方法,并使用来自甲状腺研究协作(TSC)的两个经验示例比较不同的方法来解释NRS的IPD-MA中结果变量的基线值。
    未经评估:对于我们在MEDLINE中系统地搜索的第一个目标,EMBASE,和Cochrane从成立到2021年2月,以确定已发布的NRS的IPD-MA,这些IPD-MA在连续结果分析中针对基线结果指标进行了调整。对于第二个目标,我们应用协方差分析(ANCOVA),更改分数,IPD-MA中的倾向评分和幼稚方法(忽略基线结果数据)来自NRS的亚临床甲状腺功能亢进与抑郁症状和肾功能之间的关联.我们估计了研究和荟萃分析平均差异(MD)和相对标准误差(SE)。我们使用了固定和随机效果MA。
    未经评估:纳入的18项研究中有10项(56%)使用了变化评分法,7项(39%)研究使用ANCOVA,1项倾向评分(5%).在基线对结果变量进行平衡的研究中,研究估计与方法相似,但在基线失衡的研究中有所不同。在我们的经验例子中,ANCOVA和变化评分显示了相同方向的研究结果,不是倾向得分。在我们的应用中,ANCOVA提供了更精确的估计,在研究和荟萃分析层面,与其他方法相比。当使用变化评分作为结果时,异质性更高,ANCOVA为中度,倾向评分为零。
    UNASSIGNED:ANCOVA在研究和荟萃分析水平上提供了最精确的估计,因此在非随机研究的IPD荟萃分析中似乎更可取。对于组间平衡良好的研究,更改分数,ANCOVA的表现类似。
    UNASSIGNED: In non-randomized studies (NRSs) where a continuous outcome variable (e.g., depressive symptoms) is assessed at baseline and follow-up, it is common to observe imbalance of the baseline values between the treatment/exposure group and control group. This may bias the study and consequently a meta-analysis (MA) estimate. These estimates may differ across statistical methods used to deal with this issue. Analysis of individual participant data (IPD) allows standardization of methods across studies. We aimed to identify methods used in published IPD-MAs of NRSs for continuous outcomes, and to compare different methods to account for baseline values of outcome variables in IPD-MA of NRSs using two empirical examples from the Thyroid Studies Collaboration (TSC).
    UNASSIGNED: For the first aim we systematically searched in MEDLINE, EMBASE, and Cochrane from inception to February 2021 to identify published IPD-MAs of NRSs that adjusted for baseline outcome measures in the analysis of continuous outcomes. For the second aim, we applied analysis of covariance (ANCOVA), change score, propensity score and the naïve approach (ignores the baseline outcome data) in IPD-MA from NRSs on the association between subclinical hyperthyroidism and depressive symptoms and renal function. We estimated the study and meta-analytic mean difference (MD) and relative standard error (SE). We used both fixed- and random-effects MA.
    UNASSIGNED: Ten of 18 (56%) of the included studies used the change score method, seven (39%) studies used ANCOVA and one the propensity score (5%). The study estimates were similar across the methods in studies in which groups were balanced at baseline with regard to outcome variables but differed in studies with baseline imbalance. In our empirical examples, ANCOVA and change score showed study results on the same direction, not the propensity score. In our applications, ANCOVA provided more precise estimates, both at study and meta-analytical level, in comparison to other methods. Heterogeneity was higher when change score was used as outcome, moderate for ANCOVA and null with the propensity score.
    UNASSIGNED: ANCOVA provided the most precise estimates at both study and meta-analytic level and thus seems preferable in the meta-analysis of IPD from non-randomized studies. For the studies that were well-balanced between groups, change score, and ANCOVA performed similarly.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    有必要正确处理总体缺失结果数据,以最大程度地减少系统评价结论中的偏见。已经提出了两阶段模式混合模型来解决聚合丢失的连续结果数据。虽然这种方法与排除缺失的连续结果数据和简单的插补方法相比更合适,它没有提供对缺失的连续结果数据的灵活建模,以彻底调查它们对结论的影响。因此,我们提出了贝叶斯框架下的一阶段模式混合模型方法,以解决干预网络中缺失的连续结果数据,并获得关于不同试验和干预中的错误过程的知识.我们扩展了一个汇总连续结果的分层网络元分析模型,以包含一个错误参数,该参数可以衡量与随机假设的偏离。我们考虑连续数据的各种效应大小估计,和两个信息错误参数,手段的信息性错误差异和手段的信息性错误比率。我们结合了先前对错误参数的信念,同时考虑了先前结构的几种可能性,以说明网络中错误过程可能不同的事实。该方法在来自已发表的评论的两个网络中进行了示例,这些评论包括不同数量的缺失连续结果数据。
    Appropriate handling of aggregate missing outcome data is necessary to minimise bias in the conclusions of systematic reviews. The two-stage pattern-mixture model has been already proposed to address aggregate missing continuous outcome data. While this approach is more proper compared with the exclusion of missing continuous outcome data and simple imputation methods, it does not offer flexible modelling of missing continuous outcome data to investigate their implications on the conclusions thoroughly. Therefore, we propose a one-stage pattern-mixture model approach under the Bayesian framework to address missing continuous outcome data in a network of interventions and gain knowledge about the missingness process in different trials and interventions. We extend the hierarchical network meta-analysis model for one aggregate continuous outcome to incorporate a missingness parameter that measures the departure from the missing at random assumption. We consider various effect size estimates for continuous data, and two informative missingness parameters, the informative missingness difference of means and the informative missingness ratio of means. We incorporate our prior belief about the missingness parameters while allowing for several possibilities of prior structures to account for the fact that the missingness process may differ in the network. The method is exemplified in two networks from published reviews comprising a different amount of missing continuous outcome data.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    Power analysis is a key component for planning prospective studies such as clinical trials. However, some journals in biomedical and psychosocial sciences ask for power analysis for data already collected and analysed before accepting manuscripts for publication. In this report, post hoc power analysis for retrospective studies is examined and the informativeness of understanding the power for detecting significant effects of the results analysed, using the same data on which the power analysis is based, is scrutinised. Monte Carlo simulation is used to investigate the performance of posthoc power analysis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Comparative Study
    The rank-ordered logit (rologit) model was recently introduced as a robust approach for analysing continuous outcomes, with the linear exposure effect estimated by scaling the rank-based log-odds estimate. Here we extend the application of the rologit model to continuous outcomes with ties and ordinal outcomes treated as imperfectly-observed continuous outcomes. By identifying the functional relationship between survival times and continuous outcomes, we explicitly establish the equivalence between the rologit and Cox models to justify the use of the Breslow, Efron and perturbation methods in the analysis of continuous outcomes with ties. Using simulation, we found all three methods perform well with few ties. Although an increasing extent of ties increased the bias of the log-odds and linear effect estimates and resulted in reduced power, which was somewhat worse when the model was mis-specified, the perturbation method maintained a type I error around 5%, while the Efron method became conservative with heavy ties but outperformed Breslow. In general, the perturbation method had the highest power, followed by the Efron and then the Breslow method. We applied our approach to three real-life datasets, demonstrating a seamless analytical workflow that uses stratification for confounder adjustment in studies of continuous and ordinal outcomes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    In the medical literature, hundreds of prediction models are being developed to predict health outcomes in individuals. For continuous outcomes, typically a linear regression model is developed to predict an individual\'s outcome value conditional on values of multiple predictors (covariates). To improve model development and reduce the potential for overfitting, a suitable sample size is required in terms of the number of subjects (n) relative to the number of predictor parameters (p) for potential inclusion. We propose that the minimum value of n should meet the following four key criteria: (i) small optimism in predictor effect estimates as defined by a global shrinkage factor of ≥0.9; (ii) small absolute difference of ≤ 0.05 in the apparent and adjusted R2 ; (iii) precise estimation (a margin of error ≤ 10% of the true value) of the model\'s residual standard deviation; and similarly, (iv) precise estimation of the mean predicted outcome value (model intercept). The criteria require prespecification of the user\'s chosen p and the model\'s anticipated R2 as informed by previous studies. The value of n that meets all four criteria provides the minimum sample size required for model development. In an applied example, a new model to predict lung function in African-American women using 25 predictor parameters requires at least 918 subjects to meet all criteria, corresponding to at least 36.7 subjects per predictor parameter. Even larger sample sizes may be needed to additionally ensure precise estimates of key predictor effects, especially when important categorical predictors have low prevalence in certain categories.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    Sample size justification is required for all clinical studies. However, to many biomedical and clinical researchers, power and sample size analysis seems like a magic trick of statisticians. In this note, we discuss power and sample size calculations and show that biomedical and clinical investigators play a significant role in making such analyses possible and meaningful. Thus, power analysis is really an interactive process and scientific researchers and statisticians are equal partners in the research enterprise.
    所有的临床研究都需要对样本量进行辨证。然 而,对于众多生物医学和临床研究人员来说,把握度 和样本量看起来就像一个统计学家的魔术。在本文中, 我们讨论了把握度和样本量的计算,并说明生物医学 和临床研究人员在该分析的可行性和意义中具有重要 作用。因此,把握度分析的确是一个互动的过程,并 且科学研究人员和统计人员在研究团队中是平等合作 的伙伴。.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号