聊天机器人可以在临床实践中加强小儿唾液腺炎的管理吗？Can chatbots enhance the management of pediatric sialadenitis in clinical practice?-医云文献数字医云科研云海量医学决策数据服务

Abstract：

OBJECTIVE: The purpose of this study was to assess how well ChatGPT, an AI-powered chatbot, performed in helping to manage pediatric sialadenitis and identify when sialendoscopy was necessary.
METHODS: 49 clinical cases of pediatric sialadenitis were retrospectively reviewed. ChatGPT was given patient data, and it offered differential diagnoses, proposed further tests, and suggested treatments. The decisions made by the treating otolaryngologists were contrasted with the answers provided by ChatGPT. Analysis was done on ChatGPT response consistency and interrater reliability.
RESULTS: ChatGPT showed 78.57% accuracy in primary diagnosis, and 17.35% of cases were considered likely. On the other hand, otolaryngologists recommended fewer further examinations than ChatGPT (111 vs. 60, p < 0.001). For additional exams, poor agreement was found between ChatGPT and otolaryngologists. Only 28.57% of cases received a pertinent and essential treatment plan via ChatGPT, indicating that the platform\'s treatment recommendations were frequently lacking. For treatment ratings, judges\' interrater reliability was greatest (Kendall\'s tau = 0.824, p < 0.001). For the most part, ChatGPT\'s response constancy was high.
CONCLUSIONS: Although ChatGPT has the potential to correctly diagnose pediatric sialadenitis, there are a number of noteworthy limitations with regard to its ability to suggest further testing and treatment regimens. Before widespread clinical use, more research and confirmation are required. To guarantee that chatbots are utilized properly and effectively to supplement human expertise rather than to replace it, a critical viewpoint is required.

摘要：

目的：本研究的目的是评估ChatGPT的水平，一个人工智能驱动的聊天机器人，在帮助治疗小儿唾液腺炎和确定何时需要进行鼻内镜检查时进行。
方法:对49例小儿涎腺炎的临床资料进行回顾性分析。ChatGPT被给予患者数据，它提供了鉴别诊断，提出了进一步的测试，并建议治疗。治疗的耳鼻喉科医生做出的决定与ChatGPT提供的答案进行了对比。对ChatGPT响应一致性和评分者间可靠性进行了分析。
结果：ChatGPT在主要诊断中显示78.57％的准确率，17.35%的病例被认为是可能的。另一方面,耳鼻喉科医师推荐的进一步检查比ChatGPT少（111vs.60，p<0.001)。对于额外的考试，ChatGPT和耳鼻喉科医师之间的一致性较差。只有28.57％的病例通过ChatGPT接受了相关和必要的治疗计划，这表明该平台的治疗建议经常缺乏。对于治疗评级，法官之间的可靠性最高(肯德尔的tau=0.824，p<0.001)。在大多数情况下,ChatGPT的反应恒定性很高。
结论：尽管ChatGPT有可能正确诊断小儿涎腺炎，关于其建议进一步检测和治疗方案的能力,存在许多值得注意的局限性.在广泛临床使用之前，需要更多的研究和确认。为了保证聊天机器人得到适当和有效的利用，以补充人类的专业知识，而不是取代它。需要一个批判性的观点。