Interobserver agreement

观察员间协定
  • 文章类型: Journal Article
    背景:由于皮肤病理学领域并不是一门精确的科学,它容易产生个人主观性,有时会导致对某些组织学特征的诊断和评估存在分歧。在黑色素瘤的情况下,一些变量,如回归与低观察者之间的一致性有关。相反,其他变量,如Breslow厚度的测量显示出高再现性。
    目的:我们研究的主要目的是调查60例连续黑色素瘤的多个特征,以建立观察者间的可重复性。
    结果:我们在Manises医院进行了一项观察性和描述性研究,瓦伦西亚,西班牙,IVO基金会,瓦伦西亚,西班牙,和12号医院,马德里,西班牙。所有研究变量的平均一致性水平中等(Cohen的kappa系数统计值=0.5)。最高的一致性对应于息肉形态,色素沉着,溃疡,和太阳弹性沉着症。另一方面,对于细胞多态性和肿瘤坏死的存在,达成了最低水平的协议。
    结论:我们的平均协议水平中等,这反映了一些测量的特征,如细胞多态性或坏死的存在不能用于未来的研究或必须重新定义和再现性,重建。在进行研究时,有必要对研究变量进行分析,以证明其衡量或分类某一特征的有效性。还建议保证变量是可重复的,以便能够将它们用于其他研究或常规临床实践。
    BACKGROUND: Since the field of dermatopathology is not an exact science, it is prone to personal subjectivity, which sometimes causes disagreements on the diagnosis and assessment of some histological features. In the case of melanoma, some variables such as regression are associated with low interobserver agreement. On the contrary, other variables such as the measurement of Breslow thickness show high reproducibility.
    OBJECTIVE: The main objective of our study was to investigate multiple features of 60 consecutive cases of melanoma to establish interobserver reproducibility.
    RESULTS: We conducted an observational and descriptive study at Hospital de Manises, Valencia, Spain, IVO Foundation, Valencia, Spain, and Hospital 12 de Octubre, Madrid, Spain. The mean level of agreement of all study variables was moderate (Cohen\'s kappa coefficient statistic = 0.5). The highest agreement corresponded to polypoid morphology, pigmentation, ulceration, and solar elastosis. On the other hand, the lowest level agreement was reached for the presence of cellular pleomorphism and tumor necrosis.
    CONCLUSIONS: Our mean level of agreement was moderate, which reflects that some of the measured characteristics such as cellular pleomorphism or the presence of necrosis cannot be used for future studies or must be redefined and their reproducibility, reestablished. When conducting a research study, it is necessary to analyze the study variables to demonstrate their validity to measure or classify a certain feature. It is also advisable to warrant that that the variables are reproducible to be able to use them for other studies or in the routine clinical practice.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    程序忠诚是指评估或干预程序的程度(即,独立变量)的实施与规定的协议一致。程序保真度是证明实验和临床治疗的内部有效性的重要因素。先前的评论评估了在已发表的经验文章中是否包含程序忠实,这表明在特定期刊中对程序忠实程序和措施的报道不足。我们对言语行为分析(TAVB)进行了系统评价,以评估2007年至2021年程序保真度报告的趋势。在本报告所述期间,在TAVB发表的253篇文章中,144篇文章(168项研究)符合进一步分析的包容性标准。我们的结果显示,54%的研究报告了程序保真度数据,略高于以前的评论。相比之下,在审查的研究中报告了高百分比的观察者协议数据(即,93%)。包括对结果和应用研究意义的进一步讨论。
    Procedural fidelity refers to the degree to which procedures for an assessment or intervention (i.e., independent variables) are implemented consistent with the prescribed protocols. Procedural fidelity is an important factor in demonstrating the internal validity of an experiment and clinical treatments. Previous reviews evaluating the inclusion of procedural fidelity in published empirical articles demonstrated underreporting of procedural fidelity procedures and measures within specific journals. We conducted a systematic review of The Analysis of Verbal Behavior (TAVB) to evaluate the trends in procedural fidelity reporting from 2007 to 2021. Of the 253 articles published in TAVB during the reporting period, 144 of the articles (168 studies) met inclusionary criteria for further analysis. Our results showed that 54% of studies reported procedural fidelity data, which is slightly higher than previous reviews. In comparison, interobserver-agreement data were reported for a high percentage of studies reviewed (i.e., 93%). Further discussion of results and applied research implications are included.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:脂肪垫细针抽吸标本的刚果红染色是一种用于评估淀粉样蛋白沉积的方法。然而,这些标本可能对细胞病理学家提出诊断挑战。作为正在进行的内部质量改进措施的一部分,这项研究的目的是评估这些标本的部门内部观察者之间的一致性,并确定影响解释变异性的因素。
    方法:有7名参与者,其中包括3名学员,3个细胞病理学家,和1位细胞技术专家.每位参与者回顾了50张刚果红染色的脂肪垫细针抽吸载玻片。解释分为3组:阴性,不确定/可疑,和积极的。与会者还注意到他们在每个案件中遇到的任何口译挑战。
    结果:所有参与者之间仅有轻微的观察者共识(κ=0.133)。按参与者组分层,学员之间的观察者之间的共识与穷人(κ=0.028)略有接近,而细胞病理学家之间的共识是公平的(κ=0.249)。2位观察者之间的最高一致性是2位细胞病理学家之间的一致性,并且一致性水平是中等水平(κ=0.426)。仅有3例(6.0%)观察者意见完全一致,而在25例(50.0%)中,解释有2个类别的差异。参与者报告的主要诊断挑战是当遇到弱或局灶性双折射时,以及由于染色质量差和过度染色而复杂化的病例。
    结论:我们发现所有研究参与者之间仅有轻微的观察者共识。挑战的主要领域是双折射较弱的情况,导致参与者之间的解释差异很大。
    BACKGROUND: Congo red staining of fat pad fine needle aspiration specimens is a method utilized for evaluation of amyloid deposition. However, these specimens can pose diagnostic challenges for cytopathologists. As part of ongoing internal quality improvement measures, the objective of this study was to evaluate the intradepartmental interobserver agreement of these specimens and to identify factors that affect the variability of the interpretations.
    METHODS: There were 7 participants, which included 3 trainees, 3 cytopathologists, and 1 cytotechnologist. Each participant reviewed 50 Congo red stained fat pad fine needle aspiration slides. The interpretations were categorized into 3 groups: negative, indeterminate/suspicious, and positive. The participants also noted any interpretation challenges they encountered for each case.
    RESULTS: There was only slight interobserver agreement among all participants (κ = 0.133). Stratified by participant group, the interobserver agreement among the trainees was slight bordering on poor (κ = 0.028) and among cytopathologists was fair (κ = 0.249). The highest agreement between 2 observers was between 2 cytopathologists and the level of agreement was moderate bordering on fair (κ = 0.426). There were only 3 cases (6.0%) with full agreement among observers, while in 25 cases (50.0%), there were 2 category differences in interpretations. The primary diagnostic challenge reported by participants was when weak or focal birefringence was encountered as well as cases complicated by poor stain quality and overstaining.
    CONCLUSIONS: We found only slight interobserver agreement among all study participants. A major area of challenge was cases with weak birefringence resulting in high variance of interpretation among participants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    评估和比较可能用于诊断2019年冠状病毒病(COVID-19)的磁共振成像(MRI)序列。
    该研究包括42例患者,他们在CT后24小时内因任何原因接受了COVID-19肺炎的胸部计算机断层扫描(CT)和胸部MRI检查。T2加权快速自旋回波周期性旋转重叠的平行线与增强重建(PROPELLER)(T2W-FSE-P),采用稳态采集的快速成像,T2脂肪饱和FSE,在显示COVID-19发现的能力方面,比较了轴向T1肝脏采集与体积加速(LAVA)和单次FSE图像。
    患者的平均年龄为47.2±24岁。在患者中,男性22人(52.4%),女性20人(47.6%)。图像质量评分的观察者间组内系数(ICC)在T2W-FSE-P序列中最高,在T1LAVA序列中最低。所有基于病变的观察者间协议评估均具有统计学意义,kappa值在0.798和0.998之间变化。
    研究中评估的所有5个序列都成功显示了COVID-19的实质发现。由于T2W-FSE-P序列在观察者之间的一致性和ICC中的图像质量得分都是最好的,认为可将其纳入胸部MRI检查以辅助COVID-19的诊断。
    UNASSIGNED: To evaluate and compare magnetic resonance imaging (MRI) sequences that could potentially be used in the diagnosis of coronavirus disease 2019 (COVID-19).
    UNASSIGNED: Included in the study were 42 patient who underwent thorax computed tomography (CT) for COVID-19 pneumonia and thorax MRI for any reason within 24 h after CT. The T2-weighted fast spin echo periodically rotated overlapping parallel lines with enhanced reconstruction (PROPELLER) (T2W-FSE-P), fast imaging employing steady-state acquisition, T2 fat-saturated FSE, axial T1 liver acquisition with volume acceleration (LAVA) and single-shot FSE images were compared in terms of their ability to show COVID-19 findings.
    UNASSIGNED: The mean age of the patients was 47.2 ± 24 years. Of the patients, 22 were male (52.4%) and 20 (47.6%) were female. The interobserver intraclass coefficient (ICC) for the image quality score was the highest in the T2W-FSE-P sequence and lowest in the T1 LAVA sequence. All of the lesion-based evaluations of the interobserver agreement were statistically significant, with the kappa value varying between 0.798 and 0.998.
    UNASSIGNED: All 5 sequences evaluated in the study were successful in showing the parenchymal findings of COVID-19. Since the T2W-FSE-P sequence had the best scores in both interobserver agreement and ICC for the image quality score, it was considered that it can be included in thorax MRI examinations to assist the diagnosis of COVID-19.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:胆汁细胞学(SSBC)评分系统旨在提高胆汁细胞学诊断的准确性。这里,SSBC的实用性得到了多个细胞技术人员的验证。
    方法:由24名细胞技术人员使用SSBC评估胆汁细胞学标本。在使用SSBC(首次评估)之前,根据三类对样品进行评估:良性,不确定,和恶性。然后使用SSBC进行第一评分评估(FSE);将评分系统中的每个项目分类为存在或不存在。分发包含诊断标准的说明书后,使用SSBC进行第二次评分评估(SSE).使用诊断准确性以及观察者之间和观察者之间的共识来评估每种方法。
    结果:在首次评估中,几个样品被评估为不确定。虽然SSE的特异性提高了,与FSE相比,灵敏度和准确性下降。总体观察员间协议对所有参数都是公平的,包括异常染色质,不规则的核间距离,不规则重叠的细胞核,不规则的簇边缘,并在FSE和SSE中进行最终评估。在FSE和SSE中,通过组织学类型进行的最终评估对高分化的管状腺癌略有一致,对低分化的管状腺癌几乎完美一致。对于中分化管状腺癌,协议在FSE是温和的,在SSE是公平的。对于胆管炎,在FSE中观察到轻微的协议,在上交所提高到公平。
    结论:尽管SSBC有望提高特异性,关于SSBC标准和个体间评估差异存在歧义。因此,客观评估方法应该修改。
    BACKGROUND: The scoring system for bile cytology (SSBC) aims to improve bile cytology diagnostic accuracy. Here, the practicality of SSBC was verified by multiple cytotechnologists.
    METHODS: Bile cytological specimens were evaluated by 24 cytotechnologists using SSBC. The samples were assessed before using the SSBC (first-time assessment) according to three categories: benign, indeterminate, and malignant. A first scoring evaluation (FSE) was then performed using SSBC; each item in the scoring system was classified as present or absent. After distributing an instruction sheet with diagnostic criteria, a second scoring evaluation (SSE) was performed using SSBC. Each method was evaluated using diagnostic accuracy and interobserver and intraobserver agreement.
    RESULTS: Several samples were assessed as indeterminate in the first-time assessment. Although the specificity of the SSE improved, the sensitivity and accuracy decreased compared with those of the FSE. The overall interobserver agreement was fair for all parameters, including abnormal chromatin, irregular internuclear distances, irregularly overlapped nuclei, irregular cluster margins, and final evaluation in the FSE and SSE. The final evaluation by histological type exhibited slight agreement for well-differentiated tubular adenocarcinoma and almost perfect agreement for poorly differentiated tubular adenocarcinoma in the FSE and SSE. For moderately differentiated tubular adenocarcinoma, agreement was moderate in the FSE and fair in the SSE. For cholangitis, a slight agreement was observed in the FSE, which improved to fair in the SSE.
    CONCLUSIONS: Although the SSBC is expected to improve specificity, there exists ambiguity regarding SSBC criteria and interindividual assessment differences. Therefore, the objective assessment method should be revised.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:2024年,世界卫生组织(WHO)计划发布WHO软组织细胞病理学报告系统(WHORSSTC)。这个系统建立具有明确定义的类别,标准,并估计软组织肿瘤的恶性肿瘤(ROM)风险。对ROM的估计是基于相对较少的已发表研究。尽管报告系统的可重复性非常重要,但报告系统中并未解决观察者间的可重复性。
    方法:在1985年1月1日至2022年12月31日期间,对所有软组织肿瘤的细胞学标本进行了作者个人咨询文件和教学集(L.J.L.)的手动搜索。本研究仅包括有手术病理随访记录的病例。每个病例的载玻片由三名细胞病理学家独立评估,每个病例被分配到WHORSSTC类别之一。计算每个WHORSSTC类别的ROM。通过kappa和加权kappa统计数据评估了观察者之间的协议。
    结果:按类别划分的恶性肿瘤风险为:类别1:0%,类别2:28%,类别3:57%,第四类:47%,第5类:63%,第6类:88%。评估者之间协议的Kappa统计数据从0.2183到0.3465不等,加权kappa从0.3778到0.5217不等。
    结论:WHORSSTC显示恶性风险从“良性”类别(28%)发展到“恶性”类别(88%)。观察员之间的协议是公平的。
    BACKGROUND: In 2024, the World Health Organization (WHO) is scheduled to publish the WHO Reporting System for Soft Tissue Cytopathology (WHORSSTC). This system establishes categories with well-defined definitions, criteria, and estimated risks of malignancy (ROMs) for soft tissue tumors. The estimates of ROM are based on a relatively small number of published studies. Interobserver reproducibility is not addressed in the reporting system even though reproducibility of a reporting system is highly important.
    METHODS: A manual search of one authors personal consultation files and teaching set (L.J.L.) was conducted for all cytologic specimens of soft tissue tumors accessioned between January 1, 1985 and December 31, 2022. Only cases with documented surgical pathology follow-up were included in the study. Slides from each case were evaluated independently by three cytopathologists with each case assigned to one of the WHORSSTC categories. A ROM for each of the WHORSSTC categories was calculated. Interobserver agreement was evaluated by the kappa and weighted kappa statistics.
    RESULTS: Risk for malignancy by category were: Category 1: 0%, Category 2: 28%, Category 3: 57%, Category 4: 47%, Category 5: 63%, and Category 6: 88%. Kappa statistics for agreement between raters varied from 0.2183 to 0.3465 and weighted kappa varied from 0.3778 to 0.5217.
    CONCLUSIONS: The WHORSSTC showed a progression of malignancy risk from the category \"benign\" (28%) to the category \"malignant\" (88%). Interobserver agreement was only fair.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:很少有细胞学不确定的甲状腺细针穿刺(FNA)带有BRAFV600E。这里,我们评估了Bethesda甲状腺细胞病理学报告系统(TBSRTC)III类(非典型性意义[AUS])具有BRAFV600E的FNA的观察者之间的共识,并将其特征与非BRAFV600E改变的特征进行了对比。注意细胞病理学经验。
    方法:7名评审员评估了5个带有BRAFV600E的AUS甲状腺FNA。对盲人评论家来说,病例与19个属于其他TBSRTC类别的FNA混合,其中除BRAFV600E以外的遗传改变已被鉴定(共24个FNA)。计算了针对“索引”和最流行(“模式”)诊断的观察者间协议。对另外4例BRAFV600E病例进行了独立审查。
    结果:评审人员包括3名学员和3名美国病理学委员会(董事会)认证的细胞病理学家。董事会认证的细胞病理学家,其经验范围从2到15个以上的亚专业实践年,知道AUS费率。在260个澳大利亚FNA中的5个(2%)中鉴定出BRAFV600E。具有更多经验的细胞病理学家之间的观察者共识更高。在11例具有RAS样改变的病例中,有6例的模式诊断与指标诊断不同;在5个BRAFV600EFNA中的4个中,模式诊断为AUS。
    结论:携带BRAFV600E的甲状腺FNA的不确定意义的异常并不常见,但相对可重复,特别是有经验的病理学家。在这种情况下,建议在V600上对BRAF进行排序。
    OBJECTIVE: Few cytologically indeterminate thyroid fine-needle aspirations (FNAs) harbor BRAF V600E. Here, we assess interobserver agreement for The Bethesda System for Reporting Thyroid Cytopathology (TBSRTC) category III (atypia of undetermined significance [AUS]) FNAs harboring BRAF V600E and contrast their features with those harboring non-BRAF V600E alterations, with attention to cytopathology experience.
    METHODS: Seven reviewers evaluated 5 AUS thyroid FNAs harboring BRAF V600E. To blind reviewers, cases were intermixed with 19 FNAs falling within other TBSRTC categories and in which genetic alterations other than BRAF V600E had been identified (24 FNAs total). Interobserver agreement against both \"index\" and most popular (\"mode\") diagnoses was calculated. Four additional BRAF V600E cases were independently reviewed.
    RESULTS: Reviewers included 3 trainees and 3 American Board of Pathology (board)-certified cytopathologists. Board-certified cytopathologists, whose experience ranged from 2 to more than 15 subspecialty practice years, had known AUS rates. BRAF V600E was identified in 5 of 260 (2%) AUS FNAs. Interobserver agreement was higher among cytopathologists with more experience. Mode diagnosis differed from index diagnosis in 6 of 11 cases harboring RAS-like alterations; mode diagnosis was AUS in 4 of 5 BRAF V600E FNAs.
    CONCLUSIONS: Atypia of undetermined significance of thyroid FNAs harboring BRAF V600E is uncommon yet relatively reproducible, particularly among pathologists with experience. It is advisable to sequence BRAF across V600 in such cases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目标:在荷兰,产前心脏造影(aCTG)评估胎儿健康是在产科医生主导的护理中进行的.一项创新举措开始评估aCTG是否用于特定适应症-减少胎动,外部头部版本,或过期妊娠-在非产科医生主导的独立初级保健助产士的护理环境中是可行的。在重组和转移任务和职责时,质量评估至关重要。因此,我们的目的是评估参与荷兰产科护理的四个专业群体之间和内部关于aCTG评估的观察者之间和内部关于aCTG评估的共识。
    方法:这是一项针对47名荷兰初级保健助产士的前瞻性研究,医院的助产士,居民,和产科医生。以1个月的间隔两次评估十个aCTG迹线。为了确保有代表性的样品,我们使用了两组不同的10条aCTG迹线。我们用协议的比例计算了协议的程度。
    结果:四个专业组之间和内部就aCTG分类达成的观察者共识的比例从0.82到0.94不等。观察员内部(0.86-0.94)的每个专业小组的协议比例略高于观察员之间的协议。对于各种aCTG组件,观察者之间达成一致的比例从0.64(存在收缩)到0.98(基线心率)不等.
    结论:在健康女性的aCTG痕迹分类中,产妇护理专业人员之间和内部的一致水平比例相当。这意味着这些专业团体同样能够很好地对健康孕妇的aCTG进行分类。
    OBJECTIVE: In the Netherlands, antenatal cardiotocography (aCTG) to assess fetal well-being is performed in obstetrician-led care. An innovative initiative was started to evaluate whether aCTG for specific indications-reduced fetal movements, external cephalic version, or postdate pregnancy-is feasible in non-obstetrician-led care settings by independent primary care midwives. Quality assessment is essential when reorganizing and shifting tasks and responsibilities. Therefore, we aimed to assess the inter- and intraobserver agreement for aCTG assessments between and within four professional groups involved in Dutch maternity care regarding the overall classification and assessment of the various components of aCTG.
    METHODS: This was a prospective study among 47 Dutch primary care midwives, hospital-based midwives, residents, and obstetricians. Ten aCTG traces were assessed twice at a 1 month interval. To ensure a representative sample, we used two different sets of 10 aCTG traces each. We calculated the degree of agreement using the proportions of agreement.
    RESULTS: The proportions of agreement for interobserver agreement on the classification of aCTG between and within the four professional groups varied from 0.82 to 0.94. The proportions of agreement for each professional group were slightly higher for intraobserver (0.86-0.94) than for interobserver agreement. For the various aCTG components, the proportions of agreement for interobserver agreement varied from 0.64 (presence of contractions) to 0.98 (baseline heart frequency).
    CONCLUSIONS: The proportion of agreement levels between and within the maternity care professionals in the classification of aCTG traces among healthy women were comparable. This means that these professional groups are equally well able to classify aCTGs in healthy pregnant women.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:改良Rutgeerts评分(mRS)广泛用于评估回结肠切除术后克罗恩病(CD)的内镜术后复发(ePOR),以指导治疗决策。为了提高这种内镜评估的有效性和预后价值,提出了两个新的分数。这项研究评估了当前(mRS)和新的内窥镜评分对CD中ePOR的观察者之间的一致性。
    方法:16位荷兰学术和非学术IBD专家评估了从9个荷兰中心检索的术后CD患者(n=66)的内镜录像(n=71)。四个胃肠病学家使用mRS和新提议的内窥镜评分评估每个视频的炎症程度:REMIND评分(吻合和新末端回肠的单独评分)和更新的Rutgeerts评分(吻合线的病变评估,回肠入口,回肠体和新回肠)。此外,回肠体的病变,回肠入口,新发回肠,分别评估结肠和/或回肠盲环.使用Fleiss\'加权kappa评估了观察员之间的协议。
    结果:MRS的Fleiss\'加权κ为0.67(95%置信区间[CI]0.59-0.74)。新末端回肠病变的REMIND评分加权κ为0.73(95%CI0.65-0.80),吻合口病变为0.46(95%CI0.35-0.58)。更新的Rutgeerts评分的加权kappa为0.69(95%CI0.62-0.77)。回肠体病变的加权κ,回肠入口,新发回肠,结肠和回肠盲环为0.61(95%CI0.49-0.73),0.63(95%CI0.54-0.72),0.61(95%CI0.49-0.74),0.83(95%CI0.62-1.00)和0.68(95%CI0.46-0.89)。
    结论:mRS的观察者间协议是实质性的。同样,观察员之间的协议对于更新的Rutgeerts分数是实质性的。根据REMIND评分,对于新回肠的病变,观察者之间的共识是实质性的,而仅中度吻合口病变。由于临床实践中的治疗决策是基于这些评估,并且这些评分在临床研究中用作结果度量,进一步完善观察员间协定至关重要。
    OBJECTIVE: The modified Rutgeerts score (mRS) is widely used for the assessment of endoscopic postoperative recurrence (ePOR) in Crohn\'s disease (CD) after ileocolic resection to guide therapeutic decisions. To improve the validity and prognostic value of this endoscopic assessment, 2 new scores have been proposed. This study assessed the interobserver agreement of the current score (mRS) and the new endoscopic score for ePOR in CD.
    METHODS: Sixteen Dutch academic and nonacademic inflammatory bowel disease specialists assessed endoscopic videos (n = 71) of postoperative CD patients (n = 66) retrieved from 9 Dutch centers. Each video was assessed for degree of inflammation by 4 gastroenterologists using the mRS and the new proposed endoscopic score: the REMIND score (separate score of anastomosis and neoterminal ileum) and the updated Rutgeerts score (assessment of lesions at the anastomotic line, ileal inlet, ileal body, and neoterminal ileum). In addition, lesions at the ileal body, ileal inlet, neoterminal ileum, and colonic and/or ileal blind loop were separately assessed. Interobserver agreement was assessed by using Fleiss\' weighted kappa.
    RESULTS: Fleiss\' weighted kappa for the mRS was .67 (95% confidence interval [CI], .59-.74). The weighted kappa for the REMIND score was .73 (95% CI, .65-.80) for lesions in the neoterminal ileum and .46 (95% CI, .35-.58) for anastomotic lesions. The weighted kappa for the updated Rutgeerts score was .69 (95% CI, .62-.77). The weighted kappa for lesions in the ileal body, ileal inlet, neoterminal ileum, and colonic and ileal blind loop was .61 (95% CI, .49-.73), .63 (95% CI, .54-.72), .61 (95% CI, .49-.74), .83 (95% CI, .62-1.00) and .68 (95% CI, .46-.89), respectively.
    CONCLUSIONS: The interobserver agreement of the mRS is substantial. Similarly, the interobserver agreement is substantial for the updated Rutgeerts score. According to the REMIND score, the interobserver agreement was substantial for lesions in the neoterminal ileum, although only moderate for anastomotic lesions. Because therapeutic decisions in clinical practice are based on these assessments, and these scores are used as outcome measure in clinical studies, further improvement of the interobserver agreement is essential.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:肛门细胞学是高危人群肛门癌筛查的工具。除了准确性,解释的可重复性至关重要.作者评估了两名细胞病理学家之间肛门细胞学解释的一致性。
    方法:两名具有至少10年宫颈细胞学专业知识的读者对人类免疫缺陷病毒(HIV)阴性男男性行为者(MSM)的液基细胞学切片进行了评估。对解释不一致的病例进行了审查,达成了共识。使用专有的HPV基因分型试验进行人乳头瘤病毒(HPV)基因分型。计算未加权和加权Cohenκ和95%置信区间(CI)值。
    结果:总体而言,评估了713张足以解释的幻灯片(MSM:中位年龄,33年)。对620个样品(87.0%)进行HPV测试。考虑二分法解释(上皮内病变或恶性肿瘤与阴性意义不明或更差的非典型鳞状细胞),两位读者之间的粗略一致性为93.3%(kappa=0.82;95%CI,0.77-0.87).一旦就不和谐案件达成共识,发现上皮内病变或恶性肿瘤类别阴性的最佳一致性(528个样本中的511个;96.8%),而意义未确定的非典型鳞状细胞显示出最低的一致性(117个样本中的90个,76.9%)。考虑到各个细胞学类别,总体一致性为92.1%(κ=0.85;95%CI,0.81-0.89).不一致的解释与高危型HPV感染无关。HPV16感染,或MSM年龄。
    结论:在这项研究中,结果表明观察者之间达成了极好的共识,证实了在人类免疫缺陷病毒阴性MSM的情况下使用肛门细胞学。
    BACKGROUND: Anal cytology represents a tool for anal cancer screening in high-risk populations. In addition to accuracy, the reproducibility of the interpretation is of key importance. The authors evaluated the agreement of anal cytologic interpretation between two cytopathologists.
    METHODS: Liquid-based cytologic slides from human immunodeficiency virus (HIV)-negative men who have sex with men (MSM) were evaluated by two readers with at least 10 years of expertise in cervical cytology. Cases with a discordant interpretation were reviewed, and a consensus was reached. Human papillomavirus (HPV) genotyping was performed using a proprietary HPV genotyping test. Unweighted and weighted Cohen kappa and 95% confidence interval (CI) values were calculated.
    RESULTS: Overall, 713 slides that were adequate for interpretation were evaluated (MSM: median age, 33 years). An HPV test was performed on 620 samples (87.0%). Considering a dichotomous interpretation (negative for intraepithelial lesion or malignancy vs. atypical squamous cells of undetermined significance or worse), the crude agreement between the two readers was 93.3% (kappa = 0.82; 95% CI, 0.77-0.87). Once a consensus for discordant cases was reached, the best agreement was found for the negative for intraepithelial lesion or malignancy category (511 of 528 samples; 96.8%), whereas the atypical squamous cells of undetermined significance category showed the lowest agreement (90 of 117 samples, 76.9%). Considering the individual cytologic categories, overall agreement was 92.1% (kappa = 0.85; 95% CI, 0.81-0.89). The discordant interpretations were not associated with high-risk HPV infection, HPV16 infection, or MSM age.
    CONCLUSIONS: The results indicating excellent interobserver agreement in this study substantiate the use of anal cytology in the setting of human immunodeficiency virus-negative MSM.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号