dermatology resident

皮肤科住院医师
  • 文章类型: Journal Article
    背景:皮肤科患者教育材料(PEM)的书写水平通常高于全国平均水平的七至八年级阅读水平。ChatGPT-3.5,GPT-4,DermGPT,和DocsGPT是响应用户提示的大型语言模型(LLM)。我们的项目评估了它们在指定阅读水平下生成皮肤病学PEM的用途。
    目的:本研究旨在评估在未指定和指定的阅读水平下,选择LLM在常见和罕见皮肤病学中产生PEM的能力。Further,该研究旨在评估这些LLM生成的PEM的意义保存情况,由皮肤科住院医师评估。
    方法:当前美国皮肤病学会PEMs的Flesch-Kincaid阅读水平(FKRL)评估了4种常见(特应性皮炎,寻常痤疮,牛皮癣,和带状疱疹)和4例罕见(大疱性表皮松解症,大疱性类天疱疮,层状鱼鳞病,和扁平苔藓)皮肤病。我们提示ChatGPT-3.5,GPT-4,DermGPT,和DocsGPT以“在[FKRL]中创建关于[条件]的患者教育讲义”,以在未指定的五年级和七年级FKRL中每个条件迭代生成10个PEM,使用MicrosoftWord可读性统计进行评估。由2名皮肤科住院医师评估了LLM中意义的保留。
    结果:当前的美国皮肤病学会PEMs对常见和罕见疾病的平均(SD)FKRL为9.35(1.26)和9.50(2.3),分别。对于常见疾病,LLM生产的PEM的FKRL介于9.8和11.21之间(未指定提示),在4.22和7.43之间(五年级提示),在5.98和7.28之间(七年级提示)。对于罕见疾病,LLM生产的PEM的FKRL范围在9.85和11.45之间(未指定提示),在4.22和7.43之间(五年级提示),在5.98和7.28之间(七年级提示)。在五年级阅读水平,与ChatGPT-3.5相比,GPT-4在常见和罕见条件下都能更好地生产PEM(分别为P=.001和P=.01),DermGPT(分别为P<.001和P=.03),和DocsGPT(分别为P<.001和P=.02)。在七年级的阅读水平,ChatGPT-3.5、GPT-4、DocsGPT、或DermGPT在生产常见条件下的PEM(所有P>.05);然而,对于罕见的情况,ChatGPT-3.5和DocsGPT的表现优于GPT-4(分别为P=.003和P<.001)。意义分析的保留表明,对于共同条件,DermGPT在整体阅读便利性方面排名最高,患者的可理解性,和准确性(14.75/15,98%);对于罕见的情况,GPT-4产生的施舍排名最高(14.5/15,97%)。
    结论:GPT-4的表现似乎优于ChatGPT-3.5,DocsGPT,和DermGPT在五年级FKRL的常见和罕见的情况下,尽管ChatGPT-3.5和DocsGPT在7级FKRL中在罕见情况下的表现均优于GPT-4。LLM生产的PEM可以可靠地满足七级FKRL的选择常见和罕见的皮肤病,并且易于阅读,患者可以理解,而且大多是准确的。LLM可能在提高健康素养和传播无障碍方面发挥作用,在皮肤病学中可以理解的PEM。
    BACKGROUND: Dermatologic patient education materials (PEMs) are often written above the national average seventh- to eighth-grade reading level. ChatGPT-3.5, GPT-4, DermGPT, and DocsGPT are large language models (LLMs) that are responsive to user prompts. Our project assesses their use in generating dermatologic PEMs at specified reading levels.
    OBJECTIVE: This study aims to assess the ability of select LLMs to generate PEMs for common and rare dermatologic conditions at unspecified and specified reading levels. Further, the study aims to assess the preservation of meaning across such LLM-generated PEMs, as assessed by dermatology resident trainees.
    METHODS: The Flesch-Kincaid reading level (FKRL) of current American Academy of Dermatology PEMs was evaluated for 4 common (atopic dermatitis, acne vulgaris, psoriasis, and herpes zoster) and 4 rare (epidermolysis bullosa, bullous pemphigoid, lamellar ichthyosis, and lichen planus) dermatologic conditions. We prompted ChatGPT-3.5, GPT-4, DermGPT, and DocsGPT to \"Create a patient education handout about [condition] at a [FKRL]\" to iteratively generate 10 PEMs per condition at unspecified fifth- and seventh-grade FKRLs, evaluated with Microsoft Word readability statistics. The preservation of meaning across LLMs was assessed by 2 dermatology resident trainees.
    RESULTS: The current American Academy of Dermatology PEMs had an average (SD) FKRL of 9.35 (1.26) and 9.50 (2.3) for common and rare diseases, respectively. For common diseases, the FKRLs of LLM-produced PEMs ranged between 9.8 and 11.21 (unspecified prompt), between 4.22 and 7.43 (fifth-grade prompt), and between 5.98 and 7.28 (seventh-grade prompt). For rare diseases, the FKRLs of LLM-produced PEMs ranged between 9.85 and 11.45 (unspecified prompt), between 4.22 and 7.43 (fifth-grade prompt), and between 5.98 and 7.28 (seventh-grade prompt). At the fifth-grade reading level, GPT-4 was better at producing PEMs for both common and rare conditions than ChatGPT-3.5 (P=.001 and P=.01, respectively), DermGPT (P<.001 and P=.03, respectively), and DocsGPT (P<.001 and P=.02, respectively). At the seventh-grade reading level, no significant difference was found between ChatGPT-3.5, GPT-4, DocsGPT, or DermGPT in producing PEMs for common conditions (all P>.05); however, for rare conditions, ChatGPT-3.5 and DocsGPT outperformed GPT-4 (P=.003 and P<.001, respectively). The preservation of meaning analysis revealed that for common conditions, DermGPT ranked the highest for overall ease of reading, patient understandability, and accuracy (14.75/15, 98%); for rare conditions, handouts generated by GPT-4 ranked the highest (14.5/15, 97%).
    CONCLUSIONS: GPT-4 appeared to outperform ChatGPT-3.5, DocsGPT, and DermGPT at the fifth-grade FKRL for both common and rare conditions, although both ChatGPT-3.5 and DocsGPT performed better than GPT-4 at the seventh-grade FKRL for rare conditions. LLM-produced PEMs may reliably meet seventh-grade FKRLs for select common and rare dermatologic conditions and are easy to read, understandable for patients, and mostly accurate. LLMs may play a role in enhancing health literacy and disseminating accessible, understandable PEMs in dermatology.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在2021年3月至8月之间进行了一项横断面研究;在西部的沙特委员会皮肤科培训计划的所有中心对所有皮肤科居民(n=79)进行了电子调查,东部和南部地区。参与是自愿的,并在研究前获得书面知情同意书.这项调查由70名皮肤科居民完成,88.61%的反应率。所有的参与者都知道研究的目的,并获得了他们的知情同意。研究设计符合赫尔辛基宣言的道德标准,并得到沙特阿拉伯伊玛目穆罕默德·伊本·沙特伊斯兰大学机构审查委员会的批准。共有70名受访者填写了问卷。47.1%的受访者存在高度情绪耗竭(EE),65.7%的人个人成就(PA)较低,在皮肤科居民的所有倦怠维度中,高度人格解体(DP)是最不普遍的(24.3%)。21.4%的皮肤科居民存在整体倦怠。多变量分析表明,男性发生EE的几率明显低于女性[比值比(OR)=0.2,P=0.016],每个诊所就诊的患者数量较高(OR=1.09,P=0.032)与具有较高的EE风险的几率有关。而对工作-生活平衡的满意度较高与EE高风险的几率较低相关(OR=0.47,P=0.005).较高的研究小时数/周与较低的PA几率相关(OR=0.95,P=0.02)。同样,职业满意度越高,PA低的几率越低(OR=0.35,P=0.042).多因素分析显示,只有运动(OR=0.21,P=0.05)和工作生活平衡满意度(OR=0.42,P=0.008)与较低的职业倦怠风险相关。我们的研究增加了沙特阿拉伯皮肤科居民的倦怠率,除了可以预测倦怠的可能风险因素,到文学。这些发现可用于改善培训计划并降低居民的倦怠率。
    A cross-sectional study was conducted between March and August 2021; an electronic survey was administered to all dermatology residents (n=79) in all centers with the Saudi Board Dermatology Training Program in the western, eastern and southern regions. Participation was voluntary, and written informed consent was obtained before the study. The survey was completed by 70 dermatology residents, for an 88.61% response rate. All the participants were aware of the study\'s aims, and their informed consent was obtained. The study design complied with the Declaration of Helsinki ethical standards and was approved by the Institutional Review Board at Imam Muhammad Ibn Saud Islamic University in Saudi Arabia. A total of 70 respondents completed the questionnaire. High emotional exhaustion (EE) was present in 47.1% of respondents, low personal accomplishment (PA) was present in 65.7%, and high depersonalization (DP) was the least prevalent (24.3%) across all burnout dimensions of dermatology residents. Overall burnout was present in 21.4% of the dermatology residents. Multivariate analysis showed that the odds of EE were significantly lower in males than females [odds ratio (OR)=0.2, P=0.016] and the higher number of patients seen per clinic (OR=1.09, P=0.032) was associated with higher odds of having a high risk of EE, while higher satisfaction with work-life balance was associated with lower odds of a high risk of EE (OR=0.47, P=0.005). A higher number of study hours/week was associated with lower odds of low PA (OR=0.95, P=0.02). Similarly, higher satisfaction with career was associated with lower odds of low PA (OR=0.35, P=0.042). Multivariate analysis showed that only exercise (OR=0.21, P=0.05) and satisfaction with work-life balance (OR=0.42, P=0.008) were associated with a lower risk of burnout. Our study adds the burnout rates among dermatology residents in Saudi Arabia, in addition to possible risk factors that can predict burnout, to the literature. These findings can be applied to improve training programs and reduce the burnout rate among residents.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号