External validation

外部验证
  • 文章类型: Journal Article
    胎儿生长受限与围产期发病率和死亡率相关。早期识别具有高危胎儿的妇女可以减少围产期不良结局。
    为了评估预测胎儿生长受限和出生体重的现有模型的预测性能,如果需要的话,使用个体参与者数据开发和验证新的多变量模型。
    国际妊娠并发症预测网络中队列的个体参与者数据荟萃分析,决策曲线分析和卫生经济学分析。
    孕妇预订。现有模型的外部验证(9个队列,441,415次怀孕);国际妊娠并发症预测模型的开发和验证(4个队列,237,228次怀孕)。
    产妇临床特征,生化和超声标记。
    胎儿生长受限定义为出生体重<10分,根据胎龄和死胎进行调整,新生儿死亡或分娩前32周出生体重。
    首先,我们使用个体参与者数据荟萃分析对现有模型进行了外部验证.如果需要,我们使用随机截距回归模型开发并验证了新的国际妊娠并发症预测模型,并对变量选择进行了反向剔除,并进行了内部-外部交叉验证.我们估计了具体研究的表现(c统计量,标定斜率,对每个模型进行大范围校准),并使用随机效应荟萃分析进行汇总。使用τ2和95%预测区间量化异质性。我们使用决策曲线分析评估胎儿生长受限模型的临床实用性,和卫生经济学分析基于国家卫生与护理卓越研究所2008模型。
    在119个已发布的模型中,可以验证一个出生体重模型(Poon)。根据我们的定义,没有报道胎儿生长受限。在所有队列中,Poon模型具有良好的汇总校准斜率0.93(95%置信区间0.90至0.96),略有过拟合,平均低估出生体重90.4g(95%置信区间37.9g至142.9g)。新开发的国际妊娠并发症预测-胎儿生长受限模型包括产妇年龄,高度,奇偶校验,吸烟状况,种族,和任何高血压病史,先兆子痫,先前的死产或小于胎龄的婴儿和分娩时的胎龄。这允许以分娩时假定的胎龄范围为条件的预测。合并的表观c统计量和校准为0.96(95%置信区间0.51至1.0),和0.95(95%置信区间0.67至1.23),分别。该模型显示,预测概率阈值在1%到90%之间,净收益为正。除了国际妊娠并发症预测-胎儿生长受限模型中的预测因子外,国际妊娠并发症预测-出生体重模型包括孕妇体重,糖尿病史和受孕方式。内部-外部交叉验证队列的平均校准斜率为1.00(95%置信区间0.78至1.23),没有过度拟合的证据。出生体重平均被低估9.7g(95%置信区间-154.3g至173.8g)。
    由于结果定义的差异,我们无法从外部验证大多数已发布的模型。我们的国际妊娠并发症预测-胎儿生长受限模型的内部-外部交叉验证受到纳入队列中事件少的限制。使用已发布的国家健康与护理卓越研究所2008模型进行的经济评估可能无法反映当前的做法,由于数据匮乏,无法进行全面的经济评估。
    国际妊娠并发症预测模型的性能需要在常规实践中进行评估,它们对决策和临床结果的影响需要评估。
    妊娠并发症的国际预测-胎儿生长受限和妊娠并发症的国际预测-出生体重模型可准确预测分娩时各种假定胎龄的胎儿生长受限和出生体重。这些可用于在预订时对风险状态进行分层,计划监控和管理。
    本研究注册为PROSPEROCRD42011135045。
    该奖项由美国国家卫生与护理研究所(NIHR)卫生技术评估计划(NIHR奖编号:17/148/07)资助,并在《卫生技术评估》中全文发布。28号14.有关更多奖项信息,请参阅NIHR资助和奖励网站。
    十个婴儿中就有一个出生时的年龄比他们小。三分之一这样的小婴儿被认为是“生长受限”,因为他们有并发症,如在子宫内死亡(死产)或出生后(新生儿死亡),脑瘫,或者需要长期住院。当胎儿怀疑生长受限时,他们被密切监测,并经常提前交付,以避免并发症。因此,重要的是,我们及早发现生长受限的婴儿,以便计划护理。我们的目标是提供对母亲生育生长受限婴儿的机会的个性化和准确估计,并预测婴儿在怀孕不同时间点分娩时的体重。要做到这一点,首先,我们测试了现有风险计算器(“预测模型”)在预测生长限制和出生体重方面的准确性。然后,我们开发了新的风险计算器,并研究了它们的临床和经济效益。我们通过在我们的大型数据库库(国际妊娠并发症预测)中访问单个孕妇及其婴儿的数据来做到这一点。已发布的风险计算器对生长限制有各种定义,没有人使用我们的定义来预测生长受限婴儿的机会。有人预测婴儿的出生体重。这个风险计算器表现很好,我们开发了两种新的风险计算器来预测生长受限的婴儿(国际妊娠并发症预测-胎儿生长受限)和出生体重(国际妊娠并发症预测-出生体重)。两个计算器都准确地预测了婴儿出生时生长受限的机会,和它的出生体重。出生体重低于9.7g。在预测低风险和高风险的两个母亲中,计算器表现良好。需要进一步的研究来确定在实践中使用这些计算器的影响,以及在实践中实施它们的挑战。国际妊娠并发症预测-胎儿生长受限和国际妊娠并发症预测-出生体重风险计算器都将告知医疗保健专业人员,并使父母能够就监测和分娩时机做出明智的决定。
    UNASSIGNED: Fetal growth restriction is associated with perinatal morbidity and mortality. Early identification of women having at-risk fetuses can reduce perinatal adverse outcomes.
    UNASSIGNED: To assess the predictive performance of existing models predicting fetal growth restriction and birthweight, and if needed, to develop and validate new multivariable models using individual participant data.
    UNASSIGNED: Individual participant data meta-analyses of cohorts in International Prediction of Pregnancy Complications network, decision curve analysis and health economics analysis.
    UNASSIGNED: Pregnant women at booking. External validation of existing models (9 cohorts, 441,415 pregnancies); International Prediction of Pregnancy Complications model development and validation (4 cohorts, 237,228 pregnancies).
    UNASSIGNED: Maternal clinical characteristics, biochemical and ultrasound markers.
    UNASSIGNED: fetal growth restriction defined as birthweight <10th centile adjusted for gestational age and with stillbirth, neonatal death or delivery before 32 weeks\' gestation birthweight.
    UNASSIGNED: First, we externally validated existing models using individual participant data meta-analysis. If needed, we developed and validated new International Prediction of Pregnancy Complications models using random-intercept regression models with backward elimination for variable selection and undertook internal-external cross-validation. We estimated the study-specific performance (c-statistic, calibration slope, calibration-in-the-large) for each model and pooled using random-effects meta-analysis. Heterogeneity was quantified using τ2 and 95% prediction intervals. We assessed the clinical utility of the fetal growth restriction model using decision curve analysis, and health economics analysis based on National Institute for Health and Care Excellence 2008 model.
    UNASSIGNED: Of the 119 published models, one birthweight model (Poon) could be validated. None reported fetal growth restriction using our definition. Across all cohorts, the Poon model had good summary calibration slope of 0.93 (95% confidence interval 0.90 to 0.96) with slight overfitting, and underpredicted birthweight by 90.4 g on average (95% confidence interval 37.9 g to 142.9 g). The newly developed International Prediction of Pregnancy Complications-fetal growth restriction model included maternal age, height, parity, smoking status, ethnicity, and any history of hypertension, pre-eclampsia, previous stillbirth or small for gestational age baby and gestational age at delivery. This allowed predictions conditional on a range of assumed gestational ages at delivery. The pooled apparent c-statistic and calibration were 0.96 (95% confidence interval 0.51 to 1.0), and 0.95 (95% confidence interval 0.67 to 1.23), respectively. The model showed positive net benefit for predicted probability thresholds between 1% and 90%. In addition to the predictors in the International Prediction of Pregnancy Complications-fetal growth restriction model, the International Prediction of Pregnancy Complications-birthweight model included maternal weight, history of diabetes and mode of conception. Average calibration slope across cohorts in the internal-external cross-validation was 1.00 (95% confidence interval 0.78 to 1.23) with no evidence of overfitting. Birthweight was underestimated by 9.7 g on average (95% confidence interval -154.3 g to 173.8 g).
    UNASSIGNED: We could not externally validate most of the published models due to variations in the definitions of outcomes. Internal-external cross-validation of our International Prediction of Pregnancy Complications-fetal growth restriction model was limited by the paucity of events in the included cohorts. The economic evaluation using the published National Institute for Health and Care Excellence 2008 model may not reflect current practice, and full economic evaluation was not possible due to paucity of data.
    UNASSIGNED: International Prediction of Pregnancy Complications models\' performance needs to be assessed in routine practice, and their impact on decision-making and clinical outcomes needs evaluation.
    UNASSIGNED: The International Prediction of Pregnancy Complications-fetal growth restriction and International Prediction of Pregnancy Complications-birthweight models accurately predict fetal growth restriction and birthweight for various assumed gestational ages at delivery. These can be used to stratify the risk status at booking, plan monitoring and management.
    UNASSIGNED: This study is registered as PROSPERO CRD42019135045.
    UNASSIGNED: This award was funded by the National Institute for Health and Care Research (NIHR) Health Technology Assessment programme (NIHR award ref: 17/148/07) and is published in full in Health Technology Assessment; Vol. 28, No. 14. See the NIHR Funding and Awards website for further award information.
    One in ten babies is born small for their age. A third of such small babies are considered to be ‘growth-restricted’ as they have complications such as dying in the womb (stillbirth) or after birth (newborn death), cerebral palsy, or needing long stays in hospital. When growth restriction is suspected in fetuses, they are closely monitored and often delivered early to avoid complications. Hence, it is important that we identify growth-restricted babies early to plan care. Our goal was to provide personalised and accurate estimates of the mother’s chances of having a growth-restricted baby and predict the baby’s weight if delivered at various time points in pregnancy. To do so, first we tested how accurate existing risk calculators (‘prediction models’) were in predicting growth restriction and birthweight. We then developed new risk-calculators and studied their clinical and economic benefits. We did so by accessing the data from individual pregnant women and their babies in our large database library (International Prediction of Pregnancy Complications). Published risk-calculators had various definitions of growth restriction and none predicted the chances of having a growth-restricted baby using our definition. One predicted baby’s birthweight. This risk-calculator performed well, but underpredicted the birthweight by up to 143 g. We developed two new risk-calculators to predict growth-restricted babies (International Prediction of Pregnancy Complications-fetal growth restriction) and birthweight (International Prediction of Pregnancy Complications-birthweight). Both calculators accurately predicted the chances of the baby being born with growth restriction, and its birthweight. The birthweight was underpredicted by <9.7 g. The calculators performed well in both mothers predicted to be low and high risk. Further research is needed to determine the impact of using these calculators in practice, and challenges to implementing them in practice. Both International Prediction of Pregnancy Complications-fetal growth restriction and International Prediction of Pregnancy Complications-birthweight risk calculators will inform healthcare professionals and empower parents make informed decisions on monitoring and timing of delivery.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:是否可以开发和验证简化的卵巢过度刺激综合征(OHSS)风险评估指数,并充分区分中度/重度OHSS与无OHSS的患者?
    结论:这个易于使用的OHSS风险评估指数在内部和外部验证队列中显示出良好的判别力和较高的校准准确性。
    背景:早期预警和风险分层对于预防OHSS的发生至关重要。我们之前已经开发了一个基于多阶段智能手机应用程序的预测模型来评估OHSS的风险,但是在许多主要机构中,应用程序使用可能不那么方便。需要简化的OHSS风险评估指数。
    方法:OHSS风险评估指数的培训和内部验证使用了2016年1月至2020年12月的回顾性队列数据。从2021年1月至2022年5月,使用前瞻性队列数据库进行外部验证。训练队列中有15.066个周期,内部验证队列中的6502个周期,和外部验证队列中的8097个周期。
    方法:本研究在某三甲医院生殖医学中心进行。包括接受卵巢刺激的不孕妇女。从具有详细医疗记录的本地数据库中提取数据。分多个阶段构建了多阶段风险评估指标。第一阶段是卵巢刺激开始前,第二个是在排卵触发之前,第三个是在取卵后,最后一个阶段是在胚胎移植日,如果新鲜胚胎移植计划。
    结果:我们为中度/重度OHSS建立了简化的多阶段风险评估指数,在培训以及内部和外部验证队列中,通过辨别和校准能力进一步评估了其性能.用C统计量确定OHSS风险评估指标的判别能力。培训中的C统计(分别为阶段1-4:0.631、0.692、0.751、0.788)以及内部(分别为阶段1-4:0.626、0.642、0.755、0.771)和外部验证(分别为阶段1-4:0.668、0.670、0.754、0.773)队列均从阶段1增加到阶段3,趋势相似,并且在阶段3和阶段4之间具有可比性。校准图显示所有三个队列中观察到的和预测的病例之间的高度一致性。基于不同风险分层的OHSS发生率(可忽略风险,低风险,中等风险,和高风险)为0%,0.6%,2.7%,在培训队列中占8.3%,0%,0.6%,3.3%,内部验证队列中的8.5%,和0.1%,1.1%,4.1%,在外部验证队列中为7.2%。
    结论:无法消除包括冷冻保存所有胚胎在内的临床干预措施的影响,因此某些风险因素,如触发日的雌激素水平,可能被分配为较低的风险评分。这项研究的另一个弱点是几种预防性治疗方法,例如口服阿司匹林和来曲唑,没有在模型中记录和评估。尽管OHSS评估指标具有强大的可靠性,该工具不能直接用于临床决策或作为诊断工具.它的价值在于它能够评估各种干预措施的预后并促进临床医生与患者之间的沟通。为了准确和个性化地管理OHSS,应考虑该工具以及进一步的症状和检查的组合。
    结论:可以实施OHSS风险评估指标,便于OHSS的个性化咨询和管理。
    背景:本研究得到了国家重点研发计划(2022YFC2702504)的资助,广东省医学研究基金(A2024003),和新疆支持广东省农村科技(特约记者)计划(KTPYJ2023014)。所有作者都没有什么可透露的。
    背景:不适用。
    OBJECTIVE: Can a simplified ovarian hyperstimulation syndrome (OHSS) risk assessment index be developed and validated with sufficient discrimination of moderate/severe OHSS from those without OHSS?
    CONCLUSIONS: This easy-to-use OHSS risk assessment index shows good discriminative power and high calibration accuracy in internal and external validation cohorts.
    BACKGROUND: An early alert and risk stratification is critical to prevent the occurrence of OHSS. We have previously developed a multi-stage smartphone app-based prediction model to evaluate the risk of OHSS, but app use might not be so convenient in many primary institutions. A simplified OHSS risk assessment index has been required.
    METHODS: This training and internal validation of an OHSS risk assessment index used retrospective cohort data from January 2016 to December 2020. External validation was performed with a prospective cohort database from January 2021 to May 2022. There were 15 066 cycles in the training cohort, 6502 cycles in the internal validation cohort, and 8097 cycles in the external validation cohort.
    METHODS: This study was performed in the reproductive medicine center of a tertiary hospital. Infertile women who underwent ovarian stimulation were included. Data were extracted from the local database with detailed medical records. A multi-stage risk assessment index was constructed at multiple stages. The first stage was before the initiation of ovarian stimulation, the second was before the ovulation trigger, the third was after oocyte retrieval, and the last stage was on the embryo transfer day if fresh embryo transfer was scheduled.
    RESULTS: We established a simplified multi-stage risk assessment index for moderate/severe OHSS, the performance of which was further evaluated with discrimination and calibration abilities in training and internal and external validation cohorts. The discrimination abilities of the OHSS risk assessment index were determined with C-statistics. C-statistics in training (Stages 1-4: 0.631, 0.692, 0.751, 0.788, respectively) and internal (Stages 1-4: 0.626, 0.642, 0.755, 0.771, respectively) and external validation (Stages 1-4: 0.668, 0.670, 0.754, 0.773, respectively) cohorts were all increased from Stage 1 to 3 with similar trends, and were comparable between Stages 3 and 4. Calibration plots showed high agreement between observed and predicted cases in all three cohorts. Incidences of OHSS based on diverse risk stratification (negligible risk, low risk, medium risk, and high risk) were 0%, 0.6%, 2.7%, and 8.3% in the training cohort, 0%, 0.6%, 3.3%, and 8.5% in the internal validation cohort, and 0.1%, 1.1%, 4.1%, and 7.2% in the external validation cohort.
    CONCLUSIONS: The influence from clinical interventions including cryopreservation of all embryos cannot be eliminated and thus certain risk factors like estrogen level on trigger day might be assigned with a lower risk score. Another weakness of the study is that several preventive treatments, for instance oral aspirin and letrozole, were not recorded and evaluated in the model. Despite the robust reliability of OHSS assessment index, this tool cannot be used directly for clinical decision-making or as a diagnostic tool. Its value lies in its capacity to evaluate the prognosis of various interventions and to facilitate clinician-patient communication. The combination of this tool and further symptoms and examinations should be all taken into consideration for accurate and personalized management of OHSS.
    CONCLUSIONS: The OHSS risk assessment index can be implemented to facilitate personalized counseling and management of OHSS.
    BACKGROUND: This study was supported by National Key R&D Program of China (2022YFC2702504), Medical Research Fund Guangdong Provincial (A2024003), and Xinjiang Support Rural Science and Technology (Special Correspondent) Program in Guangdong Province (KTPYJ 2023014). All authors had nothing to disclose.
    BACKGROUND: N/A.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    蛋白质能量消耗(PEW)在成年血液透析患者中发生率很高,是指蛋白质和能量物质减少的状态。已经证明PEW高度影响生存质量并增加死亡风险。然而,临床诊断标准复杂。简化成人血液透析患者PEW的诊断方法,我们之前建立了一个新的临床预测模型,该模型在内部使用自举进行了很好的验证.在这项多中心横断面研究中,我们旨在在一个新的成人血液透析患者队列中对该列线图进行外部验证.
    通过将四个独立变量与国际肾脏营养与代谢学会(ISRNM)诊断标准(包括白蛋白)的一部分相结合,建立了新颖的预测模型。总胆固醇,体重指数(BMI)。我们使用判别(一致性指数)评估了新模型的性能,校准图,和临床影响曲线,以评估其预测效用。
    从9月1日起,2022年8月31日2023年,在上海的五个医疗中心对1158名患者进行了筛查。622例(53.7%)血液透析患者纳入分析。PEW预测模型与0.777(95%CI0.741-0.814)的曲线下面积的鉴别是可接受的。此外,该模型显示了拟合良好的校准曲线。McNemar检验显示新模型具有与金标准诊断方法相似的诊断功效(p>0.05)。
    我们从这个横断面外部验证研究中得到的结果进一步表明,新模型是有效识别成人血液透析患者PEW的有效工具。
    UNASSIGNED: Protein Energy Wasting (PEW) has high incidence in adult hemodialysis patients and refers to a state of decreased protein and energy substance. It has been demonstrated that PEW highly affects the quality of survival and increases the risk of death. Nevertheless, its diagnostic criteria are complex in clinic. To simplify the diagnosis method of PEW in adult hemodialysis patients, we previously established a novel clinical prediction model that was well-validated internally using bootstrapping. In this multicenter cross-sectional study, we aimed to externally validate this nomogram in a new cohort of adult hemodialysis patients.
    UNASSIGNED: The novel prediction model was built by combining four independent variables with part of the International Society of Renal Nutrition and Metabolism (ISRNM) diagnostic criteria including albumin, total cholesterol, and body mass index (BMI). We evaluated the performance of the new model using discrimination (Concordance Index), calibration plots, and Clinical Impact Curve to assess its predictive utility.
    UNASSIGNED: From September 1st, 2022 to August 31st, 2023, 1,158 patients were screened in five medical centers in Shanghai. 622 (53.7%) hemodialysis patients were included for analysis. The PEW predictive model was acceptable discrimination with the area under the curve of 0.777 (95% CI 0.741-0.814). Additionally, the model revealed well-fitted calibration curves. The McNemar test showed the novel model had similar diagnostic efficacy with the gold standard diagnostic method (p > 0.05).
    UNASSIGNED: Our results from this cross-sectional external validation study further demonstrate that the novel model is a valid tool to identify PEW in adult hemodialysis patients effectively.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    区分不复杂和复杂阑尾炎的评分系统有利于确定急性阑尾炎的最佳治疗方法。我们开发了一种评分系统来区分不复杂和复杂的阑尾炎,并使用外部验证评估评分系统的临床实用性。回顾性分析了299例急性阑尾炎患者。一百九十九名患者被分配到模型开发组,而其他100例患者被分配到外部验证组.使用具有六个独立预测因子的最终多变量逻辑回归模型创建了复杂阑尾炎的评分系统。评分系统的受试者工作特征曲线下面积为0.882(95%置信区间:0.835-0.929)。评分系统的分界点为12,敏感性和特异性分别为82.9%和86.2%,分别。在外部验证组中,评分系统的受试者工作特征曲线下面积为0.868(95%置信区间0.794-0.942),两组受试者工作特征曲线下面积差异无统计学意义(P=0.750)。我们新开发的评分系统可能有助于迅速确定急性阑尾炎的最佳治疗方法。
    A scoring system to discriminate between uncomplicated and complicated appendicitis is beneficial to determine the optimal treatment for acute appendicitis. We developed a scoring system to discriminate between uncomplicated and complicated appendicitis and assessed the clinical usefulness of the scoring system using external validation. A total of 299 patients with acute appendicitis were retrospectively reviewed. One hundred and ninety-nine patients were assigned to the model development group, while the other 100 patients were assigned to an external validation group. A scoring system for complicated appendicitis was created using a final multivariate logistic regression model with six independent predictors. The area under the receiver operating characteristic curve of the scoring system was 0.882 (95% confidence interval: 0.835-0.929). The cutoff point of the scoring system was 12, and the sensitivity and specificity were 82.9% and 86.2%, respectively. In the external validation group, the area under the receiver operating characteristic curve of the scoring system was 0.868 (95% confidence interval 0.794-0.942), and there was no significant difference between the groups in the area under the receiver operating characteristic curve (P = 0.750). Our newly developed scoring system may contribute to prompt determination of the optimal treatment for acute appendicitis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:在美国,严重的孕产妇发病率正在增加。有几种工具和评分可以对个人严重孕产妇发病率的风险进行分层。
    目的:我们试图研究和比较四种评分系统预测严重产妇发病率的有效性。
    方法:这是一项关于安全劳动力数据集联盟中所有个体的回顾性队列研究,从2002年到2008年进行。如果个人缺少有关风险因素的信息,则将其排除在外。严重的产妇发病率是根据疾病控制和预防中心定义的,不包括输血。由于担心ICD代码对该指标的特异性及其可变的临床意义,因此排除了输血。使用围产期卓越评估计算每位参与者的风险评分,加州产妇优质护理合作,产科合并症指数,和改良的产科合并症指数。我们根据风险评分计算了严重孕产妇发病率的可能性。通过接收器工作特征曲线下的面积及其95%置信区间来检查预测得分的判别性能。使用自举重新采样比较每个分数的曲线下面积。为每个分数开发校准图以检查拟合优度。使用一致性概率方法来定义最佳表现得分的最佳截止点。
    结果:在153,463个人中,1,115(0.7%)有严重的孕产妇发病率。与围产期卓越评估评分系统相比,加州产妇优质护理协作评分系统的曲线下面积[95%置信区间](0.78[0.77-0.80])明显更高,产科合并症指数和改良的产科合并症指数评分系统0.75[0.73-0.76],.0.67[0.65-0.68],0.66[0.70-0.73];P<0.001)。校准图显示,围产期卓越评分评估系统和产科合并症指数的预测和实际严重孕产妇发病率之间具有极好的一致性(两个Hosmer-Lemeshow检验P值=1.00,表明拟合优度)。
    结论:这项研究验证了四种风险评分系统来预测严重的产妇发病率。加州产妇质量护理协作和围产期卓越评估评分系统都具有良好的区别性,可以预测严重的产妇发病率。围产期卓越评分和产科合并症指数评估具有拟合优度。在理想计算的截止点,围产期卓越评分评估在四个评分中敏感度最高,为71%,这表明仍需要更好的评分系统来预测严重的孕产妇发病率。
    BACKGROUND: Severe maternal morbidity is increasing in the United States. Several tools and scores exist to stratify an individual\'s risk of severe maternal morbidity.
    OBJECTIVE: We sought to examine and compare the validity of four scoring systems for predicting severe maternal morbidity.
    METHODS: This was a retrospective cohort study of all individuals in the Consortium on Safe Labor dataset, which was conducted from 2002 to 2008. Individuals were excluded if they had missing information on risk factors. Severe maternal morbidity was defined based on the Centers for Disease Control and Prevention excluding blood transfusion. Blood transfusion was excluded due to concerns regarding the specificity of ICD codes for this indicator and its variable clinical significance. Risk scores were calculated for each participant using the Assessment of Perinatal Excellence, California Maternal Quality Care Collaborative, Obstetric Comorbidity Index, and Modified Obstetric Comorbidity Index. We calculated the probability of severe maternal morbidity according to the risk scores. The discriminative performance of the prediction score was examined by the areas under receiver operating characteristic curves and their 95% confidence intervals. The area under the curve for each score was compared using the bootstrap resampling. Calibration plots were developed for each score to examine the goodness-of-fit. The concordance probability method was used to define an optimal cutoff point for the best-performing score.
    RESULTS: Of 153, 463 individuals, 1,115 (0.7%) had severe maternal morbidity. The California Maternal Quality Care Collaborative scoring system had a significantly higher area under the curve [95% confidence interval] (0.78 [0.77-0.80]) compared to the Assessment of Perinatal Excellence scoring system, Obstetric Comorbidity Index and Modified Obstetric Comorbidity Index scoring systems 0.75 [0.73-0.76],. 0.67 [0.65-0.68], 0.66 [0.70-0.73]; P < 0.001). Calibration plots showed excellent concordance between the predicted and actual severe maternal morbidity for the Assessment of Perinatal Excellence scoring system and Obstetric Comorbidity Index (both Hosmer-Lemeshow test P-values = 1.00, suggesting goodness-of-fit).
    CONCLUSIONS: This study validated four risk-scoring systems to predict severe maternal morbidity. Both California Maternal Quality Care Collaborative and Assessment of Perinatal Excellence scoring systems had good discrimination to predict severe maternal morbidity. The Assessment of Perinatal Excellence score and the Obstetric Comorbidity Index had goodness-of-fit. At ideal calculated cut-off points, the Assessment of Perinatal Excellence score had the highest sensitivity of the four scores at 71%, indicating that better scoring systems are still needed for predicting severe maternal morbidity.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:本研究旨在建立预测重症监护病房(ICU)COVID-19患者是否需要有创机械通气(IMV)的预后模型,并将其表现与呼吸频率-氧分压(ROX)指数进行比较。
    方法:使用2020年3月至2021年8月在里约热内卢的三家医院收集的数据进行了一项回顾性队列研究。巴西。对18岁及以上诊断为COVID-19的ICU患者进行筛查。排除标准是在ICU入住的前24小时内接受IMV的患者,怀孕,最低限度临终关怀的临床决策和缺少的主要结局数据.收集临床和实验室变量。采用多因素logistic回归分析选择预测变量。模型基于最低Akaike信息标准(AIC)和具有显著p值的最低AIC。对预测性能进行评估以进行区分和校准。使用DeLong算法比较曲线下面积(AUC)。使用国际数据库对模型进行了外部验证。
    结果:在接受筛查的656名患者中,纳入346例患者;155例需要IMV(44.8%),191没有(55.2%),207例患者为男性(59.8%)。根据最低的AIC,动脉高血压,糖尿病,肥胖,序贯器官衰竭评估(SOFA)评分,心率,呼吸频率,外周血氧饱和度(SpO2),温度,呼吸努力信号,和白细胞在入院时被确定为IMV的预测因子。根据具有显著p值的AIC,SOFA得分,SpO2和呼吸努力信号是IMV的最佳预测因子;比值比(95%置信区间):1.46(1.07-2.05),0.81(0.72-0.90),9.13(3.29-28.67),分别。IMV组入院时的ROX指数低于非IMV组(7.3[5.2-9.8]vs9.6[6.8-12.9],p分别<0.001)。在外部验证群体中,ROX指数曲线下面积(AUC)为0.683(准确率63%),AIC模型显示AUC为0.703(准确率69%),具有显著p值的最低AIC模型的AUC为0.725(准确率79%)。
    结论:在患有COVID-19的ICU患者的发展人群中,SOFA评分,SpO2和呼吸努力信号比ROX指数更好地预测IMV的需求。在外部验证群体中,尽管AUC没有显着差异,使用SOFA评分时,准确性更高,与ROX指数相比,SpO2和呼吸努力信号。这表明这些变量可能更有助于预测ICUCOVID-19患者对IMV的需求。
    结果:
    NCT05663528。
    BACKGROUND: This study aimed to develop prognostic models for predicting the need for invasive mechanical ventilation (IMV) in intensive care unit (ICU) patients with COVID-19 and compare their performance with the Respiratory rate-OXygenation (ROX) index.
    METHODS: A retrospective cohort study was conducted using data collected between March 2020 and August 2021 at three hospitals in Rio de Janeiro, Brazil. ICU patients aged 18 years and older with a diagnosis of COVID-19 were screened. The exclusion criteria were patients who received IMV within the first 24 h of ICU admission, pregnancy, clinical decision for minimal end-of-life care and missing primary outcome data. Clinical and laboratory variables were collected. Multiple logistic regression analysis was performed to select predictor variables. Models were based on the lowest Akaike Information Criteria (AIC) and lowest AIC with significant p values. Assessment of predictive performance was done for discrimination and calibration. Areas under the curves (AUC)s were compared using DeLong\'s algorithm. Models were validated externally using an international database.
    RESULTS: Of 656 patients screened, 346 patients were included; 155 required IMV (44.8%), 191 did not (55.2%), and 207 patients were male (59.8%). According to the lowest AIC, arterial hypertension, diabetes mellitus, obesity, Sequential Organ Failure Assessment (SOFA) score, heart rate, respiratory rate, peripheral oxygen saturation (SpO2), temperature, respiratory effort signals, and leukocytes were identified as predictors of IMV at hospital admission. According to AIC with significant p values, SOFA score, SpO2, and respiratory effort signals were the best predictors of IMV; odds ratios (95% confidence interval): 1.46 (1.07-2.05), 0.81 (0.72-0.90), 9.13 (3.29-28.67), respectively. The ROX index at admission was lower in the IMV group than in the non-IMV group (7.3 [5.2-9.8] versus 9.6 [6.8-12.9], p < 0.001, respectively). In the external validation population, the area under the curve (AUC) of the ROX index was 0.683 (accuracy 63%), the AIC model showed an AUC of 0.703 (accuracy 69%), and the lowest AIC model with significant p values had an AUC of 0.725 (accuracy 79%).
    CONCLUSIONS: In the development population of ICU patients with COVID-19, SOFA score, SpO2, and respiratory effort signals predicted the need for IMV better than the ROX index. In the external validation population, although the AUCs did not differ significantly, the accuracy was higher when using SOFA score, SpO2, and respiratory effort signals compared to the ROX index. This suggests that these variables may be more useful in predicting the need for IMV in ICU patients with COVID-19.
    RESULTS:
    UNASSIGNED: NCT05663528.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:Stockholm3是一种综合的血液检测融合蛋白生物标志物,遗传指标,和临床数据来预测具有临床意义的前列腺癌风险(活检后国际泌尿外科病理学学会≥2级)。我们的研究旨在从外部验证Stockholm3,并将其性能与使用前列腺特异性抗原(PSA)和鹿特丹前列腺癌风险计算器(RPCRC)进行临床重要的前列腺癌检测进行比较。
    方法:我们收集了在Martini-Klinik接受前列腺活检的男性的数据,德国,2014年至2017年。根据PSA水平升高或可疑的直肠指检来选择参与者,所有患者均接受10-12核心系统活检,但未进行磁共振成像靶向活检.我们评估了Stockholm3和RPCRC在临床上有意义的前列腺癌检测中的性能。此外,我们比较了建议使用Stockholm3和RPCRC进行活检和活检结果的男性比例和PSA≥3ng/ml.
    我们的研究涵盖了405名活检男性,年龄中位数为66岁(四分位数间距[IQR]:60-72),PSA水平为7ng/ml(IQR:5.2-10.8),和Stockholm3得分在18(IQR:10-34)。其中,128名男性(31%)接受了有临床意义的前列腺癌诊断。采用推荐的Stockholm3阈值(≥15)可以将不必要的活检减少52%,与使用PSA≥3ng/ml作为活检标准相比,检测到92%的有临床意义的病例。Stockholm3和RPCRC都表现出强烈的鉴别力,曲线下面积为0.80(95%置信区间[CI]:0.76-0.85)和0.75(95%CI:0.70-0.80),分别。Stockholm3校准良好,而与观察到的结果相比,RPCRC低估了风险。此外,Stockholm3产生了积极的临床净收益,而RPCRC对临床相关阈值的净获益为负.
    结论:使用Stockholm3可以检测到92%的有临床意义的前列腺癌病例,同时减少52%的不必要的活检,与PSA≥3ng/ml标准相比,根据我们对接受系统活检的男性队列的分析.
    结果:在405名男性的德国临床队列中,Stockholm3,一种早期前列腺癌检测的血液检测,表现出良好的临床效益。它确定了大量临床上有意义的病例,同时在没有这种疾病的男性和临床上无意义的前列腺癌患者中减少了一半以上不必要的活检。
    OBJECTIVE: Stockholm3 is a comprehensive blood test amalgamating protein biomarkers, genetic indicators, and clinical data to predict clinically significant prostate cancer risk (International Society of Urological Pathology grade ≥2 upon biopsy). Our study aims to externally validate Stockholm3 and compare its performance with the use of prostate-specific antigen (PSA) and the Rotterdam Prostate Cancer Risk Calculator (RPCRC) for clinically significant prostate cancer detection.
    METHODS: We gathered data from men subjected to prostate biopsies at the Martini-Klinik, Germany, between 2014 and 2017. Participants were selected based on elevated PSA levels or suspicious digital rectal examinations, all undergoing a 10-12-core systematic biopsy without a magnetic resonance imaging-targeted biopsy. We assessed Stockholm3 and RPCRC performance for clinically significant prostate cancer detection. Furthermore, we compared the proportion of men recommended for biopsy and biopsy outcomes with Stockholm3 and RPCRC against PSA ≥3 ng/ml.
    UNASSIGNED: Our study encompassed 405 biopsied men, with a median age of 66 yr (interquartile range [IQR]: 60-72), PSA levels at 7 ng/ml (IQR: 5.2-10.8), and Stockholm3 scores at 18 (IQR: 10-34). Among them, 128 men (31%) received clinically significant prostate cancer diagnoses. Employing the recommended Stockholm3 threshold (≥15) could have reduced unnecessary biopsies by 52%, while detecting 92% of clinically significant cases compared with using PSA ≥3 ng/ml as a biopsy criterion. Both Stockholm3 and RPCRC exhibited strong discrimination, with area under the curve values of 0.80 (95% confidence interval [CI]: 0.76-0.85) and 0.75 (95% CI: 0.70-0.80), respectively. Stockholm3 demonstrated good calibration, while RPCRC underestimated the risk compared with observed outcomes. Moreover, Stockholm3 yielded positive clinical net benefits, whereas RPCRC yielded negative net benefits for clinically relevant thresholds.
    CONCLUSIONS: Stockholm3 utilization could detect 92% of clinically significant prostate cancer cases while simultaneously reducing unnecessary biopsies by 52%, compared with the PSA ≥3 ng/ml criterion, based on our analysis within a cohort of men who underwent systematic biopsies.
    RESULTS: In a German clinical cohort of 405 men, Stockholm3, a blood test for early prostate cancer detection, exhibited favorable clinical benefits. It identified a substantial number of clinically significant cases while reducing unnecessary biopsies by over half in men without the disease and those with clinically nonsignificant prostate cancer.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:心房颤动(AF)在重症监护病房(ICU)患者中很常见,并显着提高了住院死亡率。现有的评分系统或模型对ICU中的AF患者具有有限的预测能力。我们的研究开发并验证了机器学习模型,以预测ICU房颤患者的院内死亡风险。
    结果:分析了重症监护医学信息集市(MIMIC)-IV数据集和eICU合作研究数据库(eICU-CRD)。在比较的十个分类器中,适应性增强(AdaBoost)在预测房颤患者的全因死亡率方面表现更好.开发并验证了具有15个特征的紧凑模型。所有可变和紧凑模型均表现出出色的性能,训练集中的接收器工作特征曲线(AUC)下面积为1(95%置信区间[CI]:1.0-1.0)。在MIMIC-IV测试集中,所有可变模型和紧凑模型的AUC分别为0.978(95%CI:0.973-0.982)和0.977(95%CI:0.972-0.982),分别。在外部验证集中,所有可变和紧凑模型的AUC分别为0.825(95%CI:0.815-0.834)和0.807(95%CI:0.796-0.817),分别。
    结论:基于AdaBoost的预测模型经过内部和外部验证,强调其在评估ICU房颤患者院内死亡风险方面的强大预测能力。
    BACKGROUND: Atrial fibrillation (AF) is common among intensive care unit (ICU) patients and significantly raises the in-hospital mortality rate. Existing scoring systems or models have limited predictive capabilities for AF patients in ICU. Our study developed and validated machine learning models to predict the risk of in-hospital mortality in ICU patients with AF.
    RESULTS: Medical Information Mart for Intensive Care (MIMIC)-IV dataset and eICU Collaborative Research Database (eICU-CRD) were analyzed. Among ten classifiers compared, adaptive boosting (AdaBoost) showed better performance in predicting all-cause mortality in AF patients. A compact model with 15 features was developed and validated. Both the all variable and compact models exhibited excellent performance with area under the receiver operating characteristic curves (AUCs) of 1(95%confidence interval [CI]: 1.0-1.0) in the training set. In the MIMIC-IV testing set, the AUCs of the all variable and compact models were 0.978 (95% CI: 0.973-0.982) and 0.977 (95% CI: 0.972-0.982), respectively. In the external validation set, the AUCs of all variable and compact models were 0.825 (95% CI: 0.815-0.834) and 0.807 (95% CI: 0.796-0.817), respectively.
    CONCLUSIONS: An AdaBoost-based predictive model was subjected to internal and external validation, highlighting its strong predictive capacity for assessing the risk of in-hospital mortality in ICU patients with AF.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    暂无摘要。
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    睡眠唤醒(SW)周期检测是从活动图提取时间睡眠指标的关键步骤。已经开发了各种监督学习算法,然而,它们从传感器到传感器或从研究到研究的普遍性值得怀疑。在本文中,我们详细介绍并验证了一种无监督算法-CircaCP-用于从活动记录中检测SW周期。它首先使用稳健的cosinor模型来估计昼夜节律,然后在每个昼夜节律周期内搜索单个变化点(CP)。使用CircaCP,我们从MESA睡眠研究中的2125个人数据中估计了睡眠/觉醒开始时间(S/WOT),并将估计的S/WOT与自我报告的S/WOT事件标记进行了比较,使用Bland-Altman分析以及方差成分分析。平均而言,CircaCP估计的SOT比事件标记报告的SOT落后3.6分钟,CircaCP和WOTs比标记物报告的WOTs少1分钟。这些差异在S/WOT中占不到0.2%的可变性,考虑到主体之间差异的其他来源。根植于人类昼夜节律的基本原理,我们的算法从儿童的臀部佩戴的ActiGraph数据无缝地转移到老年成人的手腕佩戴的Actiwatch数据。我们的算法的通用性表明,它可以广泛应用于其他传感器和研究收集的活动图。
    Sleep-wake (SW) cycle detection is a key step for extracting temporal sleep metrics from actigraphy. Various supervised learning algorithms have been developed, yet their generalizability from sensor to sensor or study to study is questionable. In this paper, we detail and validate an unsupervised algorithm-CircaCP-for detecting SW cycles from actigraphy. It first uses a robust cosinor model to estimate circadian rhythm, then searches for a single change point (CP) within each circadian cycle. Using CircaCP, we estimated sleep/wake onset times (S/WOTs) from 2125 individuals\' data in the MESA sleep study and compared the estimated S/WOTs against self-reported S/WOT event markers, using Bland-Altman analysis as well as variance component analysis. On average, SOTs estimated by CircaCP were 3.6 min behind those reported by event markers, and WOTs by CircaCP were less than 1 min behind those reported by markers. These differences accounted for less than 0.2% variability in S/WOTs, considering other sources of between-subject variations. Rooted in first principles of human circadian rhythms, our algorithm transferred seamlessly from children\'s hip-worn ActiGraph data to ageing adults\' wrist-worn Actiwatch data. The generalizability of our algorithm suggests that it can be widely applied to actigraphy collected by other sensors and studies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号