目标:评估人工智能(AI)语言模型(ChatGPT-4、BARD、MicrosoftCopilot)简化放射学报告,评估可读性,可理解性,可操作性,紧急分类。
方法:本研究评估了这些AI模型在将放射学报告翻译成患者友好语言并提供可理解和可操作的建议和紧迫性分类方面的有效性。使用人工智能工具处理了30份放射学报告,并评估其输出的可读性(FleschReadingEase,Flesch-Kincaid等级),可理解性(PEMAT),和紧迫性分类的准确性。进行方差分析和卡方检验以比较模型的性能。
结果:所有三种AI模型都成功地将医学术语转化为更易于理解的语言。与BARD显示优越的可读性得分。在可理解性方面,所有模型的得分都在70%以上,ChatGPT-4和BARD领先(p<0.001,两者)。然而,人工智能模型在紧迫性建议的准确性方面各不相同,差异无统计学意义(p=0.284)。
结论:AI语言模型已被证明在简化放射学报告方面是有效的,从而潜在地提高患者对健康决策的理解和参与度。然而,他们根据放射学报告评估医疗状况的紧迫性的准确性表明需要进一步完善.
结论:将AI纳入放射学交流可以赋予患者权力,但是进一步的发展对于全面和可行的患者支持至关重要。
OBJECTIVE: Evaluate Artificial Intelligence (AI)
language models (ChatGPT-4, BARD, Microsoft Copilot) in simplifying radiology
reports, assessing readability, understandability, actionability, and urgency classification.
METHODS: This study evaluated the effectiveness of these AI models in translating radiology reports into patient-friendly
language and providing understandable and actionable suggestions and urgency classifications. Thirty radiology reports were processed using AI tools, and their outputs were assessed for readability (Flesch Reading Ease, Flesch-Kincaid Grade Level), understandability (PEMAT), and the accuracy of urgency classification. ANOVA and Chi-Square tests were performed to compare the models\' performances.
RESULTS: All three AI models successfully transformed medical jargon into more accessible
language, with BARD showing superior readability scores. In terms of understandability, all models achieved scores above 70%, with ChatGPT-4 and BARD leading (p < 0.001, both). However, the AI models varied in accuracy of urgency recommendations, with no significant statistical difference (p = 0.284).
CONCLUSIONS: AI
language models have proven effective in simplifying radiology
reports, thereby potentially improving patient comprehension and engagement in their health decisions. However, their accuracy in assessing the urgency of medical conditions based on radiology reports suggests a need for further refinement.
CONCLUSIONS: Incorporating AI in radiology communication can empower patients, but further development is crucial for comprehensive and actionable patient support.