背景:ChatGPT是由OpenAI在2022年末开发和发布的免费人工智能(AI)语言模型。这项研究旨在评估ChatGPT的性能,以准确回答2022年由美国整形外科医师协会(ASPS)发布的《眼睑下垂管理指南》中的临床问题(CQ)。
方法:指南中的CQs被用作英语和日语的问题来源。对于每个问题,ChatGPT为CQ提供了答案,证据质量,推荐力度,引用匹配,回答单词数。我们比较了英语和日语查询中每个组件中ChatGPT的性能。
结果:最终分析共包含11个问题,ChatGPT正确回答了61.3%的问题。与CQ的日语答案相比,ChatGPT在CQ的英语答案中具有更高的准确率(76.4%对46.4%;p=0.004)和字数(123个单词对35.9个单词;p=0.004)。证据质量无统计学差异,推荐力度,和参考匹配。总共提出了697个参考文献,但其中只有216个(31.0%)存在。
结论:ChatGPT显示出作为治疗上睑下垂的辅助工具的潜力。然而,重要的是要认识到现有的人工智能模型有明显的局限性,其主要作用应该是补充医疗专业人员的专业知识。
方法:在受尊重的权威下进行观察性研究。该期刊要求作者为每篇文章分配一定程度的证据。对于这些循证医学评级的完整描述,请参阅目录或在线作者说明www。springer.com/00266.
BACKGROUND: ChatGPT is a free artificial intelligence (AI) language model developed and released by OpenAI in late 2022. This study aimed to evaluate the performance of ChatGPT to accurately answer clinical questions (CQs) on the Guideline for the Management of
Blepharoptosis published by the American Society of Plastic Surgeons (ASPS) in 2022.
METHODS: CQs in the guideline were used as question sources in both English and Japanese. For each question, ChatGPT provided answers for CQs, evidence quality, recommendation strength, reference match, and answered word counts. We compared the performance of ChatGPT in each component between English and Japanese queries.
RESULTS: A total of 11 questions were included in the final analysis, and ChatGPT answered 61.3% of these correctly. ChatGPT demonstrated a higher accuracy rate in English answers for CQs compared to Japanese answers for CQs (76.4% versus 46.4%; p = 0.004) and word counts (123 words versus 35.9 words; p = 0.004). No statistical differences were noted for evidence quality, recommendation strength, and reference match. A total of 697 references were proposed, but only 216 of them (31.0%) existed.
CONCLUSIONS: ChatGPT demonstrates potential as an adjunctive tool in the management of
blepharoptosis. However, it is crucial to recognize that the existing AI model has distinct limitations, and its primary role should be to complement the expertise of medical professionals.
METHODS: Observational study under respected authorities. This journal requires that authors assign a level of evidence to each article. For a full description of these Evidence-Based Medicine ratings, please refer to the Table of Contents or the online Instructions to Authors www.springer.com/00266 .