Observer variation

观察者变化
  • 文章类型: Journal Article
    光化性角化病(AK)是一种常见的癌前皮肤疾病,主要影响皮肤白皙和大量紫外线暴露的老年男性。AK的临床意义与其恶性转化和发展为鳞状细胞癌(SCC)的潜力有关。AK的准确诊断对于适当的治疗至关重要,疗效评估,降低发展SCC的风险。然而,由于当前诊断工具的主观性质,临床医生的变异性对实现一致和可靠的AK诊断提出了重大挑战。因此,没有普遍接受的测量AK的标准。这篇综述评估了当前用于评估和诊断AK的方法,通过观察者之间和观察者之间的共识,关注临床医生的变异性。八项经过同行评审的研究调查了AK评估的各种方法的可靠性,表明观察者之间或观察者之间的共识存在很大差异。大多数方法只显示轻微到中等的可靠性。一些人认为,共识讨论和简化的评定量表可以适度提高诊断的可靠性。然而,仍然存在变异性以及缺乏普遍接受的AK测量标准,这凸显了需要更稳健和标准化的诊断和评估方法.审查强调需要改进诊断工具和标准化方法,以提高AK评估的准确性和可靠性。它还提出了一种使用1,3-二羟基丙酮(DHA)染色的新型检查方法,可以改善AK病变的可视化和识别。这些领域的进步具有巨大的潜力,在AK管理中有希望更好的临床实践和患者预后。
    Actinic keratosis (AK) is a common precancerous skin condition predominantly affecting older males with fair skin and significant UV exposure. The clinical significance of AK is related to its potential for malignant transformation and progression to squamous cell carcinoma (SCC). Accurate diagnosis of AK is essential for adequate treatment, evaluation of therapeutic efficacy, and mitigating the risk of developing SCC. However, clinician variability due to the subjective nature of current diagnostic tools presents significant challenges to achieving consistent and reliable AK diagnoses. Thus, there is no universally accepted standard for measuring AK.This review evaluates current methods for evaluating and diagnosing AK, focusing on clinician variability through inter- and intraobserver agreement. Eight peer-reviewed studies investigating the reliability of various approaches for AK evaluation show substantial variability in interobserver or intraobserver agreement, with most methods demonstrating only slight to moderate reliability. Some suggest that consensus discussions and simplified rating scales can modestly improve diagnostic reliability. However, remaining variability and the lack of a universally accepted standard for measuring AK underscore the need for more robust and standardized diagnostic and evaluation methods.The review emphasizes the need for improved diagnostic tools and standardized methods to enhance the accuracy and reliability of AK assessments. It also proposes applying a novel examination approach using 1,3-dihydroxyacetone (DHA) staining which may improve the visualization and identification of AK lesions. Advancements in these areas have significant potential, promising better clinical practices and patient outcomes in AK management.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:评估在测量慢性颈部疼痛患者颈部活动范围时,测角法和屈光度的测试重测和评估者间的可靠性。
    方法:可靠性研究。选择30例慢性颈痛患者。在相隔7天的两个时间点通过测角法和屈光度法测量颈椎的活动范围。为了表征样品,我们使用了数字疼痛评定量表,疼痛相关的灾难性思维量表,颈部残疾指数。组内相关系数(ICC),计算测量标准误差(SEM)和最小可检测变化(MDC)。使用Spearman相关系数(rho)进行测角和屈光度测量之间的相关性。
    结果:对于测角,我们发现了优异的重测可靠性(ICC≥0.986,SEM≤1.89%,MDC≤5.23%)和评估者间可靠性(ICC≥0.947,SEM≤3.91%,MDC≤10.84%)。同样,我们发现了优异的重测可靠性(ICC≥0.969,SEM≤2.71%,MDC≤7.52%)和评估者间可靠性(ICC≥0.981,SEM≤1.88%,MDC≤5.20%)用于屈光度测定。最后,我们观察到所有颈椎运动(rho≥0.993)的测角和屈伸之间有很强的相关性.
    结论:测角和屈光度测量对于评估慢性颈痛患者的颈椎活动范围是可靠的。
    OBJECTIVE: To assess the test-retest and inter-rater reliability of goniometry and fleximetry in measuring cervical range of motion in individuals with chronic neck pain.
    METHODS: A reliability study. Thirty individuals with chronic neck pain were selected. Cervical range of motion was measured by goniometry and fleximetry at two time points 7 days apart. To characterize the sample, we used the numerical pain rating scale, Pain-Related Catastrophizing Thoughts Scale, and Neck Disability Index. Intraclass correlation coefficient (ICC), standard error of measurement (SEM) and minimum detectable change (MDC) were calculated. Correlations between goniometry and fleximetry measurements were performed using Spearman\'s correlation coefficient (rho).
    RESULTS: For goniometry, we found excellent test-retest reliability (ICC ≥ 0.986, SEM ≤ 1.89%, MDC ≤ 5.23%) and inter-rater reliability (ICC ≥ 0.947, SEM ≤ 3.91%, MDC ≤ 10.84%). Similarly, we found excellent test-retest reliability (ICC ≥ 0.969, SEM ≤ 2.71%, MDC ≤ 7.52%) and inter-rater reliability (ICC ≥ 0.981, SEM ≤ 1.88%, MDC ≤ 5.20%) for fleximetry. Finally, we observed a strong correlation between the goniometry and the fleximetry for all cervical movements (rho ≥ 0.993).
    CONCLUSIONS: Goniometry and fleximetry measurements are reliable for assessing cervical range of motion in individuals with chronic neck pain.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:本研究评估了性腺尿道道轴(GMS)尿道下裂评分和尿道下裂客观阴茎评估(HOPE)评分的评分者之间的协议,旨在标准化疾病分类,以在尿道下裂的临床相关特征中保持一致。
    方法:从两个独立的机构收集儿童尿道下裂的照片。三名评估者使用GMS和HOPE对照片进行评分,由于照片限制,不包括希望中的阴茎扭转和曲率评估。
    结果:共包括528张照片。有了GMS,Fleiss\'多评分者kappa对龟头尿道板的一致性为0.745,0.869用于鼻道,和0.745轴。希望分数,鼻道位置的协议是0.888,鼻道形状为0.669,0.730的龟头形状,和0.708的皮肤形状。鼻道评估形状的一致性较低可能归因于HOPE中缺乏定量分类方法。专家依赖于他们基于所提供的示例照片和他们的索引患者的主观判断。
    结论:虽然专家在使用GMS和HOPE评分标准评估尿道下裂时达成了很高的共识,只有鼻道的位置达成了近乎完美的一致,突出表明当前的评分系统在疾病分类中需要主观因素.
    OBJECTIVE: This study evaluates the inter-rater agreements of both the Glans-Urethral Meatus-Shaft (GMS) hypospadias score and Hypospadias Objective Penile Evaluation (HOPE) score, aiming to standardize disease classification for consistent agreement in clinically relevant characteristics of hypospadias.
    METHODS: Photos of hypospadias in children were collected from two separate institutions. Three raters scored the photos using GMS and HOPE, excluding penile torsion and curvature assessment in HOPE due to photo limitations.
    RESULTS: A total of 528 photos were included. With GMS, Fleiss\' multi-rater kappa showed an agreement of 0.745 for glans-urethral plate, 0.869 for meatus, and 0.745 for shaft. For HOPE scores, the agreements were 0.888 for position of meatus, 0.669 for shape of meatus, 0.730 for shape of glans, and 0.708 for the shape of the skin. The lower agreement in the shape of the meatus evaluation may be attributed to the lack of a quantitative classification method in HOPE. Experts rely on their subjective judgment based on the provided example photos and their index patient.
    CONCLUSIONS: While there is high agreement among experts when evaluating hypospadias using the GMS and HOPE scoring criteria, only the position of the meatus achieved nearly perfect agreement highlighting that the current scoring systems entail a subjective element in disease classification.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    研究了将冠状动脉造影解释为诊断工具的可靠性。此外,我们评估了观察者间冠状动脉病变变异对临床决策的影响.我们进行这项研究的动机之一是研究空白,我们的目标是获得有关不同心脏病专家之间观察者间变异性的最新信息。
    我们的目标是量化独立看过血管造影照片的心脏病专家的观察者间变异性。在先前的研究中,心脏病专家在对冠状动脉狭窄的侵入性冠状动脉造影的视觉评估中存在分歧并不少见。三位在冠状动脉造影方面有丰富经验的心脏病专家,包括每个病人的初级心脏病专家,独立阅读多伦多总医院200名患者的血管造影照片。
    我们的研究表明,所有参与的观察者之间的平均一致性为77.4%;因此,冠状动脉造影解释的观察者间变异性为22.6%.
    冠状动脉造影仍然是指导冠状动脉病变的金标准技术。有时候,冠状动脉造影结果低估或高估病变的功能严重程度。在通过有创冠状动脉造影解释冠状动脉狭窄的严重程度时,还应考虑观察者之间的变异性。这项研究表明,关于冠状动脉造影的观察者间变异性仍然存在(22.6%)。
    通俗易懂的语言总结:诊断冠状动脉狭窄的金标准方法,有创冠状动脉造影也有一些挑战。这些挑战之一是各种心脏病专家在确定每种冠状动脉狭窄的严重程度方面的差异。在这项研究中,我们重点研究了冠状动脉造影解释中观察者间变异性的差异.三名有冠状动脉造影经验的心脏病专家分别阅读了每位患者的冠状动脉造影照片。总的来说,选择多伦多总医院有血管造影史的患者200例。研究表明,所有参与的心脏病专家对冠状动脉造影结果的总体一致性为77.4%。换句话说,在读者中观察到22.6%的观察者间变异性。
    UNASSIGNED: The reliability of interpretation of coronary angiography as a diagnostic tool was investigated. Furthermore, the impact of interobserver variability of coronary lesions on clinical decision-making was assessed. One of our motivations to do this research was the research gaps and our aim to have up-to-date information regarding interobserver variability among different cardiologists.
    UNASSIGNED: Our objective was to quantify interobserver variability among cardiologists who have seen angiograms independently. Disagreement among cardiologists in the visual assessment of invasive coronary angiography of coronary artery stenosis is not uncommon in previous studies. Three cardiologists with extensive experience in coronary angiography, including the primary cardiologist of each patient, read the angiograms of 200 patients from Toronto General Hospital independently.
    UNASSIGNED: Our research showed the mean agreement among all participating observers was 77.4%; therefore, the interobserver variability of coronary angiography interpretation was 22.6%.
    UNASSIGNED: Coronary angiography is still the gold-standard technique for guidance regarding coronary lesions. Sometimes, coronary angiography results in underestimation or overestimation of a lesion\'s functional severity. Interobserver variability should also be considered when interpreting the severity of coronary stenoses via invasive coronary angiography. This research shows that interobserver variability regarding coronary angiograms is still present (22.6%).
    Plain language summary: The gold-standard method for diagnosing coronary stenosis, invasive coronary angiography has some challenges too. One of these challenges has been the difference among various cardiologists regarding determination of severity of each coronary stenosis. In this study, we focused on differences in interobserver variability in coronary angiography interpretation. Three cardiologists who were experienced in coronary angiography read each patient’s coronary angiogram separately. Overall, 200 patients with a history of angiography at Toronto General Hospital were selected randomly. The research showed that overall agreement among all participating cardiologists with regard to the reading of coronary angiograms was 77.4%. In other words, interobserver variability of 22.6% was seen among the readers.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:本研究旨在研究二分法前后视觉可访问伦勃朗图像(VASARI)特征集的观察者内部和观察者之间的一致性,以及胶质母细胞瘤(GBM)患者的双向VASARI特征与总生存期(OS)之间的关系。
    方法:这项回顾性研究纳入了2016年1月至2022年6月间经病理证实为IDH1野生型GBM的351例患者。首先,VASARI特征由四位具有不同经验水平的放射科医师在二分法之前和之后进行评估。计算科恩的卡帕系数(κ)来衡量观察者内部和观察者之间的一致性。然后,在使用倾向得分匹配对混杂因素进行调整后,Kaplan-Meier曲线用于比较每个二分VASARI特征的OS差异。接下来,以3:2的比例将患者随机分为训练集(n=211)和测试集(n=140).根据训练集,采用Cox比例风险回归分析建立联合模型和临床模型来预测OS,并用测试集评估了模型的性能。
    结果:在二分法后,κ值为0.61-0.8的11个VASARI特征显示出几乎完美的一致性,所有读取器的κ值范围为0.874-1.000。7个VASARI特征与GBM患者OS相关。对于操作系统预测,组合模型在两个训练集中都优于临床模型(C指数,0.762vs.0.723)和测试集(C指数,0.812vs.0.702)。
    结论:二分性VASARI特征表现出优异的观察者间和观察者内一致性。组合模型在OS预测方面优于临床模型。
    OBJECTIVE: This study aimed to investigate the intra- and inter-observer consistency of the Visually Accessible Rembrandt Images (VASARI) feature set before and after dichotomization, and the association between dichotomous VASARI features and the overall survival (OS) in glioblastoma (GBM) patients.
    METHODS: This retrospective study included 351 patients with pathologically confirmed IDH1 wild-type GBM between January 2016 and June 2022. Firstly, VASARI features were assessed by four radiologists with varying levels of experience before and after dichotomization. Cohen\'s kappa coefficient (κ) was calculated to measure the intra- and inter-observer consistency. Then, after adjustment for confounders using propensity score matching, Kaplan-Meier curves were used to compare OS differences for each dichotomous VASARI feature. Next, patients were randomly stratified into a training set (n = 211) and a test set (n = 140) in a 3:2 ratio. Based on the training set, Cox proportional hazards regression analysis was adopted to develop combined and clinical models to predict OS, and the performance of the models was evaluated with the test set.
    RESULTS: Eleven VASARI features with κ value of 0.61-0.8 demonstrated almost perfect agreement after dichotomization, with the range of κ values across all readers being 0.874-1.000. Seven VASARI features were correlated with GBM patient OS. For OS prediction, the combined model outperformed the clinical model in both training set (C-index, 0.762 vs. 0.723) and test set (C-index, 0.812 vs. 0.702).
    CONCLUSIONS: The dichotomous VASARI features exhibited excellent inter- and intra-observer consistency. The combined model outperformed the clinical model for OS prediction.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:卡尺功能用于全厚度黄斑孔(FTMHs)的手动测量。我们旨在研究是否可以在两个常用制造商之间的观察者间差异之外检测到可重复的差异,这些制造商在光谱域光学相干断层扫描(OCT)中的手动卡尺设备中与FTMH相关的指标。
    方法:这是非干预性的,回顾性,观察性研究。两名独立的观察者检查了8只眼睛(16个OCT)扫描和128个测量值(最小线性直径(MLD),FTMHs的基底直径和两侧的孔高度),采用海德堡光谱和TopconTriton(OCT机器)。通过Bland-Altman图和组内相关系数(ICC)分析,分析了测量的观察者间协议和OCT机器协议。Spectralis和Triton的水平b扫描间距为125µm和50µm,分别。
    结果:总体而言,我们报告了观察者间(ICC0.991(95%CI0.985~0.995,p<0.001))和OCT机器(ICC0.993(95%CI0.987~0.996,p<0.001))变异性的绝对一致性.与Spectralis相比,Triton的较低水平分辨率会导致观察者之间的差异,在较小的水平测量。如果选择不同的参考扫描,Spectralis中较低的水平扫描密度会导致观察者之间相对较大的变化,和一贯较小的MLD测量比Triton。没有1:1比例的垂直测量导致不准确的夸大的倾斜垂直测量。Calliper功能以其他方式显示相同的校准。
    结论:我们报告了观察者和OCT机器在测量方面的良好一致性。然而,本文展示了几个可能影响FTMHs眼睛测量结果可靠性的因素,例如孔的尺寸以及不同的图像分辨率,密度扫描协议或OCT机器观察平台的垂直缩放。
    OBJECTIVE: The calliper function is used for manual measurements of full thickness macular holes (FTMHs). We aimed to investigate whether a reproducible difference could be detected beyond interobserver variability between two commonly used manufacturers in their manual calliper facility in spectral domain optical coherence tomography (OCT) for metrics related to FTMH.
    METHODS: This is a non-interventional, retrospective, observational study. Two independent observers examined 8 eyes (16 OCT) scans and 128 measurements (minimal linear diameter (MLD), basal diameter and hole height on both sides) of FTMHs, taken on Heidelberg Spectralis and Topcon Triton (OCT machines). The interobserver agreement and OCT machine agreement of measurements were analysed by Bland-Altman plots and intraclass correlation coefficient (ICC) analysis. Spectralis and Triton had 125 µm and 50 µm horizontal b-scan spacing, respectively.
    RESULTS: Overall, we report high absolute agreement in interobserver (ICC 0.991 (95% CI 0.985 to 0.995, p<0.001)) and OCT machine (ICC 0.993 (95% CI 0.987 to 0.996, p<0.001)) variability. Lower horizontal resolution in Triton compared with Spectralis leads to interobserver variability, in smaller horizontal measurements. Lower horizontal scanning density in Spectralis lead to relatively large interobserver variation if different reference scans were chosen, and consistently smaller MLD measurements than Triton. Vertical measurements without 1:1 scaling lead to inaccurate exaggerated oblique vertical measurements. Calliper function appears otherwise identically calibrated.
    CONCLUSIONS: We report excellent interobserver and OCT machine agreement in measurements. However, the paper shows several factors that could influence the reliability of measurements acquired in eyes with FTMHs, such as the dimension of the hole as well as different image resolution, density scanning protocols or vertical scaling of the OCT machines viewing platform.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:计算特定尺寸剂量估计值(SSDE)需要根据计算机断层扫描(CT)图像测量患者的前后(AP)和侧向厚度。然而,由于观察者之间和观察者之间的差异,这些测量可能会发生变化。本研究旨在研究这些变化对计算的SSDE准确性的影响。
    方法:邀请了4名具有1-10年经验的放射技师测量30个胸部的AP和外侧厚度,腹部,和骨盆CT图像。图像来自基于互联网的数据库,并匿名用于分析。使用MicroDicom软件训练观察者进行测量,并要求1周后重复测量。这项研究获得了Taibah大学机构审查委员会的批准,并从观察员处获得书面知情同意书.使用Python库Pingouin(版本0.5.3)进行统计分析,Seaborn(版本0.12.2),和Matplotlib(版本3.7.1)。
    结果:研究表明,对于计算的有效直径和AP厚度测量,观察者之间具有出色的一致性,组内相关系数(ICC)值分别为0.95和0.96。横向厚度测量的一致性较低,ICC值为0.89。第二轮测量产生了几乎相同水平的观察员之间的协议,有效直径的ICC值为0.97,1.0用于AP厚度,横向厚度为0.88。当观察者的一致性被检查时,计算的有效直径具有出色的一致性,所有观察者的ICC值范围为0.91至1.0。尽管横向厚度测量的一致性较低,但仍观察到了这一点。其ICC值范围为0.78至1.0。
    结论:研究结果表明,计算SSDE所需的测量结果对于观察者之间和观察者之间的差异是稳健的。这对于SSDE的临床使用为CT扫描设置诊断参考水平很重要。
    BACKGROUND: Calculating size-specific dose estimates (SSDEs) requires measurement of the patient\'s anteroposterior (AP) and lateral thickness based on computed tomography (CT) images. However, these measurements can be subject to variation due to inter-observer and intra-observer differences. This study aimed to investigate the impact of these variations on the accuracy of the calculated SSDE.
    METHODS: Four radiographers with 1-10 years of experience were invited to measure the AP and lateral thickness on 30 chest, abdomen, and pelvic CT images. The images were sourced from an internet-based database and anonymized for analysis. The observers were trained to perform the measurements using MicroDicom software and asked to repeat the measurements 1 week later. The study was approved by the institutional review board at Taibah University, and written informed consent was obtained from the observers. Statistical analyses were performed using Python libraries Pingouin (version 0.5.3), Seaborn (version 0.12.2), and Matplotlib (version 3.7.1).
    RESULTS: The study revealed excellent inter-observer agreement for the calculated effective diameter and AP thickness measurements, with Intraclass correlation coefficients (ICC) values of 0.95 and 0.96, respectively. The agreement for lateral thickness measurements was lower, with an ICC value of 0.89. The second round of measurements yielded nearly the same levels of inter-observer agreement, with ICC values of 0.97 for the effective diameter, 1.0 for AP thickness, and 0.88 for lateral thickness. When the consistency of the observer was examined, excellent consistency was found for the calculated effective diameter, with ICC values ranging from 0.91 to 1.0 for all observers. This was observed despite the lower consistency in the lateral thickness measurements, which had ICC values ranging from 0.78 to 1.0.
    CONCLUSIONS: The study\'s findings suggest that the measurements required for calculating SSDEs are robust to inter-observer and intra-observer differences. This is important for the clinical use of SSDEs to set diagnostic reference levels for CT scans.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • DOI:
    文章类型: Journal Article
    无人值守的自动办公室血压测量(uAOBP)开始被认为是在实践中血压测量的首选方法。支持这一立场的证据越来越多。uAOBP通过减少测量误差来降低观察者内部和观察者之间的变异性。此外,它降低了白大衣的效果。目前验证和使用最多的三种协议,与不同的国家和/或设置偏好不同的协议:BpTRU协议,SPRINT协议和30分钟OBP协议。在本概述中,所有三个协议都是根据当前可用的证据进行广泛讨论的,包括赞成和反对,准确性和预后价值。
    Unattended automatic office blood pressure measurement (uAOBP) is starting to become recognised as the preferred method of blood pressure measurement in practice. The body of evidence to support this position is growing. uAOBP decreases intra- and interobserver variability by reducing measurement error. In addition it reduces the white coat effect. Currently three protocols are validated and used the most, with different countries and/or settings preferring different protocols: BpTRU protocol, SPRINT protocol and 30-minute OBP protocol. In this overview all three protocols are discussed extensively based on current available evidence including pro and con\'s, accuracy and prognostic value.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    骨质疏松性椎体压缩性骨折(OVF)通常会导致诊断问题,并且与转移性椎体压缩性骨折(MVF)的年龄段相吻合。尽管射线照相是第一种诊断技术,它通常是不准确的描绘脱矿和软组织病变。磁共振成像(MRI)是诊断的选择。OVF最相关的体征是椎体内积液或积液信号,其他椎体畸形无水肿及高龄。MVF诊断最相关的发现是软组织肿块和椎弓根强度信号不对称。然而,这些发现在临床实践中的可重复性是中等的。
    Osteoporotic vertebral compression fractures (OVF) usually lead to problems of diagnosis and coincide with the age group of metastatic vertebral compression fractures (MVF). Although radiography is the first diagnostic technique, it is generally inaccurate for depicting demineralization and soft tissue lesions. Magnetic resonance imaging (MRI) is the diagnostic choice. The most relevant signs of OVF are Intravertebral fluid collection or fluid signal, other vertebral deformities without edema and older age. Among the most relevant findings for MVF diagnosis are soft tissue mass and pedicle intensity signal asymmetries. However, reproducibility of these findings in clinical practice is moderate.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目标:开发一种简单的,主观,以及可靠的异维甲酸诱导的动脉造影变化的分级量表。
    方法:在分析了从全身异维甲酸使用者获得的囊图图像后,提出了一个分级量表,并命名为“细部图健康评分”。“评分范围从1到3,腺体反射率和可识别的边缘降低。共有11名医学专业人员被要求使用拟议的量表对从异维A酸使用者获得的10个眼图图像进行分级,并分为三组:(A)具有眼图经验的眼科医生,(B)没有眼动脉造影经验的眼科医生,和(C)放射科医师。确定了kappa统计量以测试评估者间的可靠性。
    结果:总κ约为0.64。A组的kappa得分,B,C分别为0.78、0.59和0.90。2年级的kappa分数最低(A为0.62、0.35和0.82,B,C,分别)和3级最高(A为0.78、0.90和1.0,B和C,分别)。此外,C组的kappa得分最高,B组最低。
    结论:指数健康评分表现出良好的评分者间可靠性,特别是在严重的情况下。
    OBJECTIVE: To develop a simple, subjective, and reliable grading scale for isotretinoin-induced meibography changes.
    METHODS: After analyzing meibography images obtained from systemic isotretinoin users, a grading scale was proposed and named \"meibography health score.\" The score ranged from 1 to 3, with decreasing gland reflectivity and identifiable margins. A total of 11 medical professionals were asked to grade 10 meibography images obtained from isotretinoin users using the proposed scale and were divided into three groups: (A) ophthalmologists with experience with meibography, (B) ophthalmologists with no experience with meibography, and (C) radiologists. The kappa statistic was determined to test interrater reliability.
    RESULTS: The overall kappa was approximately 0.64. The kappa scores for Groups A, B, and C were 0.78, 0.59, and 0.90, respectively. Grade 2 had the lowest kappa scores (0.62, 0.35, and 0.82 for A, B, and C, respectively) and grade 3 the highest (0.78, 0.90, and 1.0 for A, B and C, respectively). Furthermore, Group C had the highest kappa scores and Group B the lowest.
    CONCLUSIONS: The meibography health score exhibited good interrater reliability, particularly in severe cases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号