METHODS: 49 clinical cases of pediatric sialadenitis were retrospectively reviewed. ChatGPT was given patient data, and it offered differential diagnoses, proposed further tests, and suggested treatments. The decisions made by the treating otolaryngologists were contrasted with the answers provided by ChatGPT. Analysis was done on ChatGPT response consistency and interrater reliability.
RESULTS: ChatGPT showed 78.57% accuracy in primary diagnosis, and 17.35% of cases were considered likely. On the other hand, otolaryngologists recommended fewer further examinations than ChatGPT (111 vs. 60, p < 0.001). For additional exams, poor agreement was found between ChatGPT and otolaryngologists. Only 28.57% of cases received a pertinent and essential treatment plan via ChatGPT, indicating that the platform\'s treatment recommendations were frequently lacking. For treatment ratings, judges\' interrater reliability was greatest (Kendall\'s tau = 0.824, p < 0.001). For the most part, ChatGPT\'s response constancy was high.
CONCLUSIONS: Although ChatGPT has the potential to correctly diagnose pediatric sialadenitis, there are a number of noteworthy limitations with regard to its ability to suggest further testing and treatment regimens. Before widespread clinical use, more research and confirmation are required. To guarantee that chatbots are utilized properly and effectively to supplement human expertise rather than to replace it, a critical viewpoint is required.
方法:对49例小儿涎腺炎的临床资料进行回顾性分析。ChatGPT被给予患者数据,它提供了鉴别诊断,提出了进一步的测试,并建议治疗。治疗的耳鼻喉科医生做出的决定与ChatGPT提供的答案进行了对比。对ChatGPT响应一致性和评分者间可靠性进行了分析。
结果:ChatGPT在主要诊断中显示78.57%的准确率,17.35%的病例被认为是可能的。另一方面,耳鼻喉科医师推荐的进一步检查比ChatGPT少(111vs.60,p<0.001)。对于额外的考试,ChatGPT和耳鼻喉科医师之间的一致性较差。只有28.57%的病例通过ChatGPT接受了相关和必要的治疗计划,这表明该平台的治疗建议经常缺乏。对于治疗评级,法官之间的可靠性最高(肯德尔的tau=0.824,p<0.001)。在大多数情况下,ChatGPT的反应恒定性很高。
结论:尽管ChatGPT有可能正确诊断小儿涎腺炎,关于其建议进一步检测和治疗方案的能力,存在许多值得注意的局限性.在广泛临床使用之前,需要更多的研究和确认。为了保证聊天机器人得到适当和有效的利用,以补充人类的专业知识,而不是取代它。需要一个批判性的观点。