关键词: Accuracy Artificial intelligence ChatGPT Optic disc drusen Patient information

来  源:   DOI:10.1007/s40123-023-00800-2   PDF(Pubmed)

Abstract:
BACKGROUND: Optic disc drusen (ODD) are acellular deposits in the optic nerve head, which are most often benign and asymptomatic. Patients may develop visual field defects and be at increased risk of ischemic co-morbidities. As ODD can be difficult to distinguish from papilledema, patients are at risk of unnecessary clinical workups. Patient information is a key aspect of ODD management. In this study, we explored the accuracy of ChatGPT responses for typical patient questions on ODD.
METHODS: Two content experts reached consensus on 20 typical patient questions. We retrieved five separate responses for each question from ChatGPT, totaling 100 responses. Each response was evaluated on a 5-point Likert-scale on accuracy by each content expert in an individual fashion.
RESULTS: The two experts were in fair/substantial agreement in the evaluation of responses (Cronbach\'s alpha: 0.64). Of the 100 responses, 17 were relevant and without any inaccuracies, 78 were relevant and with inaccuracies not being harmful, and five were relevant and with inaccuracies potentially harmful. The lowest accuracy scores were obtained for questions dealing with treatment and prognosis.
CONCLUSIONS: ChatGPT often provides relevant answers for patient questions on ODD, but inaccuracies become potentially harmful when questions deal with treatment and prognosis.
摘要:
背景:视盘玻璃疣(ODD)是视神经乳头中的无细胞沉积物,最常见的是良性和无症状。患者可能会出现视野缺损,并且缺血性并发症的风险增加。因为ODD很难与乳头水肿区分开来,患者有不必要的临床检查风险.患者信息是ODD管理的关键方面。在这项研究中,我们探讨了ChatGPT回答ODD典型患者问题的准确性.
方法:两位内容专家就20个典型患者问题达成共识。我们从ChatGPT为每个问题检索了五个单独的回答,共100个回复。每个内容专家以个人方式在5点Likert量表上评估每个响应的准确性。
结果:两位专家在评估回答时达成了相当/实质性的一致(Cronbach的alpha:0.64)。在100个回答中,17个是相关的,没有任何不准确之处,78个是相关的,不准确的是无害的,五个是相关的,并且不准确,可能有害。对于处理治疗和预后的问题,获得了最低的准确性得分。
结论:ChatGPT通常为患者关于ODD的问题提供相关答案,但是当问题涉及治疗和预后时,不准确会变得潜在有害。
公众号