reference standards

参考标准
  • 文章类型: Journal Article
    比较和组合来自不同实验室和不同年份的稳定同位素数据集对许多研究领域至关重要,比如同位素水文学,温室气体观测,食物研究,同位素取证,古重建,等。数据兼容性(即组合数据的能力)与数据质量有关。数据可比性的先决条件是基于具有准确分配的值和不确定性的可靠参考材料(RM),将数据标准化为通用的稳定同位素标度(通常称为校准)。尽管如此,这并不保证数据兼容性(相互协议)。尽管与数据兼容性和测量不确定度相关的计量概念已被开发并应用于一般的分析化学,这些概念尚未完全应用于稳定同位素研究。这可能会影响日常校准,分析数据和,因此,数据兼容性。此外,IRMS用户通常自己准备不同的实验室标准。此后,然后,用户应该理解用于分配RM价值和不确定性的当代概念,以及与RM相关的限制和潜在问题。RM的历史,准备报告以及过去的一些问题提供了教训。其中包括LSVEC的δ13C漂移(2017年之前δ13C尺度上的第二个锚点),对值分配原则的修订,引入LSVEC的替代品,相关争议和潜在的低估了次级RM的不确定性。这篇综述描述了与同位素尺度相关的计量概念,RM和校准层次结构以及数据兼容性。主要RM及其不确定性通过计量学概念的镜头进行了回顾。进一步关注δ13C的VPDB量表和量表不连续性问题,这可以显著降低δ13C中的数据兼容性。在日常实践中,应将RM的值和不确定性分配的给定示例视为值和不确定性计算的示例。
    Comparing and combining stable isotope datasets from different laboratories and different years is essential for many research areas, such as isotope hydrology, greenhouse gas observations, food studies, isotope forensics, palaeo-reconstructions, etc. Data compatibility (i.e. the ability to combine data) is related to the data quality. The prerequisite for data comparability is data normalization to a common stable isotope scale (often referred to as calibration) based on reliable reference materials (RMs) with accurately assigned values and uncertainties. Still, that does not guarantee the data compatibility (mutual agreement). Albeit metrological concepts related to data compatibility and measurement uncertainty have been developed and applied to analytical chemistry in general, these concepts have not yet been fully applied to stable isotope research. This can affect daily calibrations, analytical data and, therefore, data compatibility. In addition, IRMS users often prepare different laboratory standards themselves. Thereafter, users should then understand the contemporary concepts used for assigning RM value and uncertainty, as well as the limitations and potential problems associated with RMs. The history of RMs, preparation reports and also some problems in the past provide lessons to be learned. These include the δ13C drift of LSVEC (the second anchor on the δ13C scale before 2017), revisions to the value assignment principles, the introduction of replacements for LSVEC, related disputes and the potential underestimation of uncertainties for secondary RMs. The review describes metrological concepts related to isotopic scales, RMs and calibration hierarchies and data compatibility. The main RMs and their uncertainties are reviewed through the lens of metrology concepts. Additional focus is given to the VPDB scale for δ13C and issues of scale discontinuity, which can significantly reduce data compatibility in δ13C. The given examples of value and uncertainty assignment for RMs should be viewed as an example of value and uncertainty calculation in daily practice.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Systematic Review
    在通过生物电阻抗分析(BIA)估算身体成分时,预测方程的适当使用取决于所使用的设备和受试者的年龄,地理祖先,健康状况,身体活动水平和性别。然而,文献中许多孤立的预测方程的存在使得正确的选择具有挑战性,因为用户可能无法区分其适当性。因此,本系统综述旨在根据所使用的独立参数对每个预测方程进行分类.通过对国际电子数据库的系统搜索,确定了1988年至2023年之间发表的64项研究。我们包括提供从准则方法得出的预测方程的研究,比如脂肪的多隔间模型,无脂和瘦的软质,全身水和细胞外水的稀释技术,身体细胞质量的全身钾,和磁共振成像或计算机断层扫描的骨骼肌质量。如果采用非标准方法或开发的预测方程涉及回归模型中没有特定代码或变量的混合种群,则排除了研究。共检索到106个预测方程;86个预测方程基于手脚,20个基于分段技术,没有方程使用手到手和腿到腿。对受试者的特征进行分类,19岁的是未成年人,成人2619运动员,老年人26人,患有疾病的人16人,包括两性。从业者现在拥有更新的预测方程列表,用于使用BIA评估身体成分。鼓励研究人员为当前文献中未涵盖的场景生成新的预测方程。PROSPERO中的注册码:CRD42023467894。
    The appropriate use of predictive equations in estimating body composition through bioelectrical impedance analysis (BIA) depends on the device used and the subject\'s age, geographical ancestry, healthy status, physical activity level and sex. However, the presence of many isolated predictive equations in the literature makes the correct choice challenging, since the user may not distinguish its appropriateness. Therefore, the present systematic review aimed to classify each predictive equation in accordance with the independent parameters used. Sixty-four studies published between 1988 and 2023 were identified through a systematic search of international electronic databases. We included studies providing predictive equations derived from criterion methods, such as multi-compartment models for fat, fat-free and lean soft mass, dilution techniques for total-body water and extracellular water, total-body potassium for body cell mass, and magnetic resonance imaging or computerized tomography for skeletal muscle mass. The studies were excluded if non-criterion methods were employed or if the developed predictive equations involved mixed populations without specific codes or variables in the regression model. A total of 106 predictive equations were retrieved; 86 predictive equations were based on foot-to-hand and 20 on segmental technology, with no equations used the hand-to-hand and leg-to-leg. Classifying the subject\'s characteristics, 19 were for underaged, 26 for adults, 19 for athletes, 26 for elderly and 16 for individuals with diseases, encompassing both sexes. Practitioners now have an updated list of predictive equations for assessing body composition using BIA. Researchers are encouraged to generate novel predictive equations for scenarios not covered by the current literature.Registration code in PROSPERO: CRD42023467894.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    人工智能辅助(AI)龋齿诊断的准确性可能会因许多因素而有很大差异。这篇评论旨在评估AI模型在bitewingX射线照片上进行龋齿检测和分类的诊断准确性。2010年以后的出版物在五个数据库中进行了筛选。开发了定制的偏倚风险(RoB)评估工具,并将其应用于935篇参考文献中符合纳入标准的14篇文章。数据集大小范围从112到3686射线照片。虽然86%的研究报告了一个准确率≥80%的模型,大多数表现出不明确或偏倚风险高.三项研究将模型的诊断性能与牙医进行了比较,其中模型始终显示出更高的平均灵敏度。五项研究被纳入双变量诊断随机效应荟萃分析,用于整体龋齿检测。诊断比值比为55.8(95%CI=28.8-108.3),敏感性和特异性分别为0.87(0.76-0.94)和0.89(0.75-0.960),分别。进行了牙本质和釉质龋齿检测的独立荟萃分析,显示灵敏度为0.84(0.80-0.87)和0.71(0.66-0.75),分别。尽管AI模型的诊断性能很有希望,缺乏高质量,充分报告,外部验证的研究突出了当前的挑战和未来的研究需求。
    The accuracy of artificial intelligence-aided (AI) caries diagnosis can vary considerably depending on numerous factors. This review aimed to assess the diagnostic accuracy of AI models for caries detection and classification on bitewing radiographs. Publications after 2010 were screened in five databases. A customized risk of bias (RoB) assessment tool was developed and applied to the 14 articles that met the inclusion criteria out of 935 references. Dataset sizes ranged from 112 to 3686 radiographs. While 86 % of the studies reported a model with an accuracy of ≥80 %, most exhibited unclear or high risk of bias. Three studies compared the model\'s diagnostic performance to dentists, in which the models consistently showed higher average sensitivity. Five studies were included in a bivariate diagnostic random-effects meta-analysis for overall caries detection. The diagnostic odds ratio was 55.8 (95 % CI= 28.8 - 108.3), and the summary sensitivity and specificity were 0.87 (0.76 - 0.94) and 0.89 (0.75 - 0.960), respectively. Independent meta-analyses for dentin and enamel caries detection were conducted and showed sensitivities of 0.84 (0.80 - 0.87) and 0.71 (0.66 - 0.75), respectively. Despite the promising diagnostic performance of AI models, the lack of high-quality, adequately reported, and externally validated studies highlight current challenges and future research needs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:药物审查(MR)是医疗保健专业人员对患者药物安全性和有效性的系统评估。用于描述MR活动的语言,例如停药和增加剂量,应在研究中保持一致,以帮助研究人员比较不同服务的运作方式并确定其影响机制。
    目的:开发与药物评价相关的标准化术语和活性定义的国际分类法。
    方法:这是一项与国际药物审评专家进行的基于Delphi的三阶段共识研究。系统审查为调查提供了MR活动术语。专家们在李克特量表上对每个活动术语及其定义进行了评估,并提供了书面反馈。共识是75%的小组协议。在每个阶段,共识要素被保留,并使用反馈来修改定义。
    结果:从四个国家招募了七名专家(应答率15.2%):英国(n=4),新西兰(n=1),澳大利亚(n=1),马来西亚(n=1)。以下术语达成共识:术语药物作为MR术语的描述符;停止药物治疗,开始用药,剂量增加,剂量减少,剂型改变,和药物安全性和有效性监测器,以描述MR活动;教育描述医疗保健专业人员和患者/护理人员教育的交付。
    结论:标准化药物审查活动术语和定义已在未来的所有MR研究中被普遍采用,以促进不同环境下过程评估的有意义的比较。
    BACKGROUND: Medication review (MR) is the systematic assessment of a patient\'s medications for safety and effectiveness by a healthcare professional. The language used to describe MR activity, such as stopped medicine and increased dose, should be consistent across studies to assist researchers compare how different services operate and identify their mechanism of impact.
    OBJECTIVE: To develop an international taxonomy of standardized terms and activity definitions related to medication reviews.
    METHODS: This was a three-stage Delphi-based consensus study with international medication review experts. A systematic review provided MR activity terms for the survey. Experts rated their consensus on each activity term and its definition on a Likert scale and provided written feedback. The consensus was 75% panel agreement. At each stage, consensus elements were retained, and feedback was used to revise definitions.
    RESULTS: Seven experts were recruited for the study (response rate 15.2%) from four countries: the United Kingdom (n = 4), New Zealand (n = 1), Australia (n = 1), and Malaysia (n = 1). The following terms achieved consensus: the term Medication as a descriptor for MR terms; discontinue medication, start medication, dose increase, dose decrease, dosage form change, and medication safety and efficacy monitor to describe MR activity; Educate to describe the delivery of healthcare professionals and patients/carers education.
    CONCLUSIONS: Standardized medication review activity terms and definitions have been selected for universal adoption in all future MR research to facilitate a meaningful comparison of process evaluations within different settings.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Systematic Review
    在当前的神经外科领域,一直强调提供最好的护理和最大的价值。这项工作不仅需要手术,而且需要围手术期服务的不断优化。最近的工作证明了标准化技术在限制并发症同时促进最佳结果的能力。在这篇评论文章中,讨论了常见小儿神经外科手术的手术和围手术期护理方案。这些文章讨论了各种机构如何通过标准化优化程序。我们的目标是通过优化方案来改善患者的预后。
    In the current neurosurgical field, there is a constant emphasis on providing the best care with the most value. Such work requires the constant optimization of not only surgical but also perioperative services. Recent work has demonstrated the power of standardized techniques in limiting complication while promoting optimal outcomes. In this review article, protocols addressing operative and perioperative care for common pediatric neurosurgical procedures are discussed. These articles address how various institutions have optimized procedures through standardization. Our objective is to improve patient outcomes through the optimization of protocols.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Systematic Review
    目的:本系统综述旨在阐明与序列分析(SA)相关的方法学实践和报告标准,以识别现实世界中的临床路径。使用常规收集的数据。
    方法:我们进行了方法学系统综述,搜索五个医疗和健康数据库:MEDLINE,PsycINFO,CINAHL,EMBASE和WebofScience。搜索涵盖了从这些数据库开始到2023年2月28日的文章。搜索策略包括两组独特的搜索术语,特别侧重于序列分析和临床路径。
    结果:19项研究符合本系统评价的资格标准。近60%的纳入研究发表于2021年或之后,其中很大一部分来自加拿大(n=7)和法国(n=5)。90%的研究坚持基本的SA步骤。最佳匹配(OM)方法是最常用的相异度量(63%),而使用Ward连锁的凝聚层次聚类是首选的聚类算法(53%)。然而,必须强调的是,大多数研究未充分报告与SA相关的关键方法学决策.
    结论:本综述强调了在报告数据管理程序和SA流程中的关键方法学选择方面提高透明度的必要性。制定报告指南和为评估SA质量而量身定制的强大评估工具对于该领域的研究人员来说是无价的。
    OBJECTIVE: This systematic review aims to elucidate the methodological practices and reporting standards associated with sequence analysis (SA) for the identification of clinical pathways in real-world scenarios, using routinely collected data.
    METHODS: We conducted a methodological systematic review, searching five medical and health databases: MEDLINE, PsycINFO, CINAHL, EMBASE and Web of Science. The search encompassed articles from the inception of these databases up to February 28, 2023. The search strategy comprised two distinctive sets of search terms, specifically focused on sequence analysis and clinical pathways.
    RESULTS: 19 studies met the eligibility criteria for this systematic review. Nearly 60% of the included studies were published in or after 2021, with a significant proportion originating from Canada (n = 7) and France (n = 5). 90% of the studies adhered to the fundamental SA steps. The optimal matching (OM) method emerged as the most frequently employed dissimilarity measure (63%), while agglomerative hierarchical clustering using Ward\'s linkage was the preferred clustering algorithm (53%). However, it is imperative to underline that a majority of the studies inadequately reported key methodological decisions pertaining to SA.
    CONCLUSIONS: This review underscores the necessity for enhanced transparency in reporting both data management procedures and key methodological choices within SA processes. The development of reporting guidelines and a robust appraisal tool tailored to assess the quality of SA would be invaluable for researchers in this field.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Systematic Review
    目的:血管内技术创新需要在高质量随机对照试验(RCTs)中进行严格的评估。然而,由于许多方法上的挑战,评估血管内介入的随机对照试验是复杂的,并且可能难以设计,行为,和报告。本系统评价旨在评估下肢外周动脉疾病(PAD)血管内介入治疗的RCT报告质量。
    方法:对Medline,Embase,从开始到2021年12月的CochraneLibrary数据库用于确定RCT,包括接受任何腹股沟下下肢血管内介入治疗的PAD患者.提取研究数据,并根据非药物治疗报告试验扩展合并标准(CONSORT-NPT)和干预描述和复制模板(TIDeR)清单进行评估。描述性统计用于总结一般研究细节和试验报告标准。
    结果:在筛选了6567篇摘要和526篇全文之后,确定了112项符合条件的研究,报告228种不同的血管内装置和技术。提供了47项(21%)干预措施的详细信息,这些细节被认为足以复制所调查的干预措施。有人认为不清楚该描述是否在另外56(24%)中报告了足够的细节,125人(55%)的描述被判断为不充分。提供了184项干预描述(81%),细节水平可变(有些在134[59%],精确在50[22%])。在25项(22%)试验中报告了干预措施或某些方面的标准化,但只有一个人指定将监测对研究方案的依从性。
    结论:研究下肢血管内治疗的随机对照报告标准的质量受到严重限制,因为干预措施描述不佳,标准化,并报告。
    OBJECTIVE: Endovascular technology innovation requires rigorous evaluation in high quality randomised controlled trials (RCTs). However, due to numerous methodological challenges, RCTs evaluating endovascular interventions are complex and potentially difficult to design, conduct, and report. This systematic review aimed to assess the quality of reporting of RCTs for endovascular interventions for lower limb peripheral arterial disease (PAD).
    METHODS: A systematic review of Medline, Embase, and the Cochrane Library databases from inception to December 2021 was performed to identify RCTs including participants with PAD undergoing any infrainguinal lower limb endovascular intervention. Study data were extracted and assessed against the Consolidating Standards of Reporting Trials extension for Non-Pharmacological Treatments (CONSORT-NPT) and the Template for Intervention Description and Replication (TIDieR) checklists. Descriptive statistics were used to summarise general study details and reporting standards of the trials.
    RESULTS: After screening 6 567 abstracts and 526 full text articles, 112 eligible studies were identified, reporting on 228 different endovascular devices and techniques. Details judged sufficient to replicate the investigated intervention were provided for 47 (21%) interventions. It was unclear whether the description was reported with sufficient details in a further 56 (24%), and the description was judged inadequate in 125 (55%). Any intervention descriptions were provided for 184 (81%), with variable levels of detail (some in 134 [59%] and precise in 50 [22%]). Standardisation of intervention or some aspect of this was reported in 25 (22%) trials, but only one specified that adherence to the study protocol would be monitored.
    CONCLUSIONS: The quality of the reporting standards of RCTs investigating lower limb endovascular treatments is severely limited because the interventions are poorly described, standardised, and reported. PROSPERO registration number: CRD42022288214.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    研究人员越来越多地测量癌症治疗期间骨骼肌(SM)和脂肪组织(AT)的变化,以了解对患者预后的影响。最近的荟萃分析报道了这篇文献中的高度异质性,表示所得估计的不确定性。以姑息性化疗为范例,我们旨在系统地总结评估癌症治疗过程中SM和AT变化的研究中变异性的来源,并为未来的研究提出标准,以便进行可靠的荟萃分析.包括在实体瘤姑息性化疗期间测量成年患者计算机断层扫描定义的SM和/或AT变化的研究,没有日期或地理限制。在按摘要/标题筛选的2496种出版物中,全文回顾83篇,抽取38篇,代表8个肿瘤部位的34个独特队列。基线测量的时间通常定义为治疗前,而终点时间从治疗开始后6周到进展时间不等。小于50%指定了测量之间的实际时间间隔。测量误差很少讨论(8/34)。单个公制(cm2/m2,cm2或%)用于描述18/34队列中的SM变化,而10/34提供了多个指标,6/34没有描述性指标。10/34队列的AT变化指标和性别特异性报告可用。在24种出版物中评估了SM丢失和总生存期之间的关联。SM损失的分类范围从任何损失到可变时间间隔内>14%的损失。年龄和性别是最常见的协变量,50%的模型有疾病反应。尽管有大量的数据和努力,研究设计中的异质性,报告和统计分析阻碍了关于癌症治疗期间SM和AT变化的严重程度和结局的证据综合.研究设计的拟议标准包括选择同质队列,明确基线/终点时间的定义,并注意测量误差。标准报告应包括按性别分列的基线SM和AT,实际扫描间隔,SM和AT使用多个度量和观察到的变化范围的可视化变化。按性别报告将增进对SM和AT变化中性二态性的理解。评估组织变化对结果的影响需要调整相关的协变量和并发疾病反应。研究人员和出版商采用这些标准将改变当前的范式,以实现对未来研究的荟萃分析,并将该领域推向SM和AT变更在临床护理中的有意义的应用。
    Investigators are increasingly measuring skeletal muscle (SM) and adipose tissue (AT) change during cancer treatment to understand impact on patient outcomes. Recent meta-analyses have reported high heterogeneity in this literature, representing uncertainty in the resulting estimates. Using the setting of palliative-intent chemotherapy as an exemplar, we aimed to systematically summarize the sources of variability among studies evaluating SM and AT change during cancer treatment and propose standards for future studies to enable reliable meta-analysis. Studies that measured computed tomography-defined SM and/or AT change in adult patients during palliative-intent chemotherapy for solid tumours were included, with no date or geographical limiters. Of 2496 publications screened by abstract/title, 83 were reviewed in full text and 38 included for extraction, representing 34 unique cohorts across 8 tumour sites. The timing of baseline measurement was frequently defined as prior to treatment, while endpoint timing ranged from 6 weeks after treatment start to time of progression. Fewer than 50% specified the actual time interval between measurements. Measurement error was infrequently discussed (8/34). A single metric (cm2 /m2 , cm2 or %) was used to describe SM change in 18/34 cohorts, while multiple metrics were presented for 10/34 and no descriptive metrics for 6/34. AT change metrics and sex-specific reporting were available for 10/34 cohorts. Associations between SM loss and overall survival were evaluated in 24 publications, with classification of SM loss ranging from any loss to >14% loss over variable time intervals. Age and sex were the most common covariates, with disease response in 50% of models. Despite a wealth of data and effort, heterogeneity in study design, reporting and statistical analysis hinders evidence synthesis regarding the severity and outcomes of SM and AT change during cancer treatment. Proposed standards for study design include selection of homogenous cohorts, clear definition of baseline/endpoint timing and attention to measurement error. Standard reporting should include baseline SM and AT by sex, actual scan interval, SM and AT change using multiple metrics and visualization of the range of change observed. Reporting by sex would advance understanding of sexual dimorphism in SM and AT change. Evaluating the impact of tissue change on outcomes requires adjustment for relevant covariates and concurrent disease response. Adoption of these standards by researchers and publishers would alter the current paradigm to enable meta-analysis of future studies and move the field towards meaningful application of SM and AT change to clinical care.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Systematic Review
    目的:概述各种败血症国际疾病分类(ICD)编码方法及其诊断准确性。
    方法:我们在1991年至2020年之间进行了系统的范围审查(搜索词:脓毒症,编码,和流行病学),包括报告脓毒症ICD编码方法准确性的研究。研究按ICD编码方法分组,诊断准确性参数的数量,ICD版本,参考标准,设计,国家,设置,数据集类型和败血症定义。ICD编码方法分为显式或隐式,与明确的方法进一步分为宽的和窄的组。描述性统计用于提供数据。
    结果:我们分析了17项研究,其中16人(94.1%)使用回顾性医学图表审查作为临床败血症的参考标准,8人(47.1%)使用医院管理数据来识别脓毒症.对各种ICD编码方法进行了53次评估,其中32种(60.4%)是显式方法,21种(39.6%)是隐式方法。编码方法的灵敏度中位数<75%,但特异性中位数>85%。然而,所有ICD编码方法的诊断准确性参数差异很大.大多数研究显示出较高的方法学质量。
    结论:目前的ICD编码方法都不是鉴别脓毒症的最佳方法。
    OBJECTIVE: To provide an overview of various sepsis International Classification of Diseases (ICD) coding methods and their diagnostic accuracy.
    METHODS: We undertook a systematic scoping review between 1991 and 2020 (search terms: sepsis, coding, and epidemiology) to include studies reporting the accuracy of a sepsis ICD coding method. Studies were grouped by ICD coding method, number of diagnostic accuracy parameters, ICD version, reference standard, design, country, setting, type of dataset and sepsis definition. ICD coding methods were categorised as explicit or implicit, with the explicit methods further divided into wide and narrow groups. Descriptive statistics were used to present data.
    RESULTS: We analysed 17 studies, of which 16 (94.1%) used retrospective medical chart review as the reference standard for clinical sepsis, and eight (47.1%) used hospital administrative data to identify sepsis. There were 53 assessments of various ICD coding methods, with 32 (60.4%) of them being explicit and 21 (39.6%) implicit methods. The coding methods had a median sensitivity of <75% but a median specificity of >85%. However, a wide variation was noted in the diagnostic accuracy parameters of all ICD coding methods. Most of the studies showed high methodological quality.
    CONCLUSIONS: None of the current ICD coding methods is optimal for identifying sepsis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    大多数诊断研究从分析中排除缺失值和不确定的结果,或者应用简单的方法导致有偏差的准确性估计。这可能是由于缺乏可用性或对适当方法的认识。本范围审查旨在概述诊断准确性研究中参考标准或指标测试中处理缺失值和不确定结果的策略。在MEDLINE进行系统的文献检索,科克伦图书馆,和WebofScience,我们可以识别出许多文章,这些文章提出了解决参考标准中缺失值的方法。还有几篇文章描述了有关索引测试中缺失值或不确定结果的方法。后者包括归责,频率论和贝叶斯可能性,基于模型,和潜在的类方法。虽然参考标准中缺失值的方法在实践中经常使用,对于解决索引测试中的缺失值和不确定结果的方法,情况并非如此。我们对可用方法的全面概述和描述可能会进一步提高对这些方法的认识,并将增强其应用。未来的研究需要比较这些方法在不同条件下的性能,以便为它们在各种诊断准确性研究场景中的使用提供有效和可靠的建议。
    Most diagnostic studies exclude missing values and inconclusive results from the analysis or apply simple methods resulting in biased accuracy estimates. This may be due to the lack of availability or awareness of appropriate methods. This scoping review aimed to provide an overview of strategies to handle missing values and inconclusive results in the reference standard or index test in diagnostic accuracy studies. Conducting a systematic literature search in MEDLINE, Cochrane Library, and Web of Science, we could identify many articles proposing methods for addressing missing values in the reference standard. There are also several articles describing methods regarding missing values or inconclusive results in the index test. The latter encompass imputation, frequentist and Bayesian likelihood, model-based, and latent class methods. While methods for missing values in the reference standard are regularly applied in practice, this is not true for methods addressing missing values and inconclusive results in the index test. Our comprehensive overview and description of available methods may raise further awareness of these methods and will enhance their application. Future research is needed to compare the performance of these methods under different conditions to give valid and robust recommendations for their usage in various diagnostic accuracy research scenarios.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号