%0 Journal Article
%T A content-aware chatbot based on GPT 4 provides trustworthy recommendations for Cone-Beam CT guidelines in dental imaging.
%A Russe MF
%A Rau A
%A Ermer MA
%A Rothweiler R
%A Wenger S
%A Klöble K
%A Schulze RKW
%A Bamberg F
%A Schmelzeisen R
%A Reisert M
%A Semper-Hogg W
%J Dentomaxillofac Radiol
%V 53
%N 2
%D 2024 Feb 8
%M 38180877
%F 3.525
%R 10.1093/dmfr/twad015
%X <B>OBJECTIVE: </B>To develop a content-aware chatbot based on GPT-3.5-Turbo and GPT-4 with specialized knowledge on the German S2 Cone-Beam CT (CBCT) dental imaging guideline and to compare the performance against humans.<BR><B>METHODS: </B>The LlamaIndex software library was used to integrate the guideline context into the chatbots. Based on the CBCT S2 guideline, 40 questions were posed to content-aware chatbots and early career and senior practitioners with different levels of experience served as reference. The chatbots' performance was compared in terms of recommendation accuracy and explanation quality. Chi-square test and one-tailed Wilcoxon signed rank test evaluated accuracy and explanation quality, respectively.<BR><B>RESULTS: </B>The GPT-4 based chatbot provided 100% correct recommendations and superior explanation quality compared to the one based on GPT3.5-Turbo (87.5% vs. 57.5% for GPT-3.5-Turbo; P = .003). Moreover, it outperformed early career practitioners in correct answers (P = .002 and P = .032) and earned higher trust than the chatbot using GPT-3.5-Turbo (P = 0.006).<BR><B>CONCLUSIONS: </B>A content-aware chatbot using GPT-4 reliably provided recommendations according to current consensus guidelines. The responses were deemed trustworthy and transparent, and therefore facilitate the integration of artificial intelligence into clinical decision-making.