创成式 AI generative AI-医云文献数字医云科研云海量医学决策数据服务

generative AI 关注

创成式 AI

文献(119篇)

百科

视频

1 Assessing Laterality Errors in Radiology: Comparing Generative AI and Natural Language Processing.

放射学中的侧向错误评估：比较生成 AI 和自然语言处理。影响指数 : 6.24
发表时间：Jul 2024 1
来源期刊：J Am Coll Radiol PMID：38960083

DOI：10.1016/j.jacr.2024.06.014
文章类型： Journal Article

目标：我们比较了生成AI的性能(G-AI，ATARI）和自然语言处理（NLP）工具，用于识别放射学报告和图像中的侧向错误。
方法：我们使用基于NLP(mPower)的工具来识别在其QA仪表板中标记为侧向错误的放射学报告。NLP模型检测并突出显示放射学报告中的侧向性不匹配。从NLP标记的侧向错误的1124份放射学报告的初始池中，我们选择并评估了898份包含射线照相术的报告，CT,MRI,和超声模式，以确保全面覆盖。放射科医师审查了每个放射学报告，以评估是否存在标记的侧向错误（报告错误-真阳性）或不存在（NLP错误-假阳性）。接下来,我们将ATARI应用于237例连续NLP真阳性(118例)和假阳性(119例)侧向错误的放射学报告和图像.我们估计了NLP和G-AI工具的准确性，以识别整体和模态侧向误差。
结果：在898个NLP标记的侧向错误中，64%（574/898）有NLP错误，36%（324/898）报告错误。文本查询ATARI功能以97.4％的准确率（115/118报告；95％CI=96.5％-98.3％）正确识别不存在侧向性不匹配（NLP假阳性）。组合视觉和文本查询导致98.3%的准确率(116/118报告/图像;95%CI=97.6%-99.0%)单独查询具有98.3%的准确率(116/118图像;95%CI=97.6%-99.0%)。
结论：生成AI授权的ATARI原型优于评估的NLP工具，用于确定放射学报告中的真实和虚假侧向错误，同时实现基于图像的侧向性确定。复杂放射学报告中ATARI文本查询的潜在错误强调了进一步改进技术的必要性。
OBJECTIVE: We compared the performance of generative AI (G-AI, ATARI) and natural language processing (NLP) tools for identifying laterality errors in radiology reports and images.
METHODS: We used an NLP-based (mPower) tool to identify radiology reports flagged for laterality errors in its QA Dashboard. The NLP model detects and highlights laterality mismatches in radiology reports. From an initial pool of 1124 radiology reports flagged by the NLP for laterality errors, we selected and evaluated 898 reports that encompassed radiography, CT, MRI, and ultrasound modalities to ensure comprehensive coverage. A radiologist reviewed each radiology report to assess if the flagged laterality errors were present (reporting error - true positive) or absent (NLP error - false positive). Next, we applied ATARI to 237 radiology reports and images with consecutive NLP true positive (118 reports) and false positive (119 reports) laterality errors. We estimated accuracy of NLP and G-AI tools to identify overall and modality-wise laterality errors.
RESULTS: Among the 898 NLP-flagged laterality errors, 64% (574/898) had NLP errors and 36% (324/898) were reporting errors. The text query ATARI feature correctly identified the absence of laterality mismatch (NLP false positives) with a 97.4% accuracy (115/118 reports; 95% CI = 96.5% - 98.3%). Combined Vision and text query resulted in 98.3% accuracy (116/118 reports/images; 95% CI = 97.6% - 99.0%) query alone had a 98.3% accuracy (116/118 images; 95% CI = 97.6% - 99.0%).
CONCLUSIONS: The generative AI-empowered ATARI prototype outperformed the assessed NLP tool for determining true and false laterality errors in radiology reports while enabling an image-based laterality determination. Underlying errors in ATARI text query in complex radiology reports emphasize the need for further improvement in the technology.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文
2 The Role of Humanization and Robustness of Large Language Models in Conversational Artificial Intelligence for Individuals With Depression: A Critical Analysis.

大型语言模型的人性化和鲁棒性在抑郁症个体的会话人工智能中的作用：批判性分析。影响指数 : 6.332
发表时间：Jul 2024 2
来源期刊：JMIR Ment Health PMID：38958218

DOI：10.2196/56569
文章类型： Journal Article

■大型语言模型（LLM）支持的服务由于在许多任务中的出色性能而在各种应用程序中越来越受欢迎，如情绪分析和回答问题。最近,研究一直在探索它们在数字健康环境中的潜在用途，特别是在心理健康领域。然而,实施LLM增强的会话人工智能(CAI)提出了重要的道德，技术,和临床挑战。在这篇观点论文中，我们讨论了2个挑战，这些挑战会影响LLM增强的CAI对于有心理健康问题的个人的使用，专注于抑郁症患者的用例：将LLM增强的CAI人性化的趋势以及他们缺乏情境化的鲁棒性。我们的方法是跨学科的，依靠哲学的考虑，心理学,和计算机科学。我们认为，LLM增强的CAI的人性化取决于对使用LLM模拟“类似人类”特征的含义的反映，以及这些系统在与人类的互动中应该扮演什么角色。Further,确保LLM稳健性的情境化需要考虑抑郁症患者语言产生的特殊性，以及它随时间的演变。最后,我们提供了一系列建议,以促进负责任的设计和部署LLM增强的CAI,为抑郁症患者提供治疗支持.
UNASSIGNED: Large language model (LLM)-powered services are gaining popularity in various applications due to their exceptional performance in many tasks, such as sentiment analysis and answering questions. Recently, research has been exploring their potential use in digital health contexts, particularly in the mental health domain. However, implementing LLM-enhanced conversational artificial intelligence (CAI) presents significant ethical, technical, and clinical challenges. In this viewpoint paper, we discuss 2 challenges that affect the use of LLM-enhanced CAI for individuals with mental health issues, focusing on the use case of patients with depression: the tendency to humanize LLM-enhanced CAI and their lack of contextualized robustness. Our approach is interdisciplinary, relying on considerations from philosophy, psychology, and computer science. We argue that the humanization of LLM-enhanced CAI hinges on the reflection of what it means to simulate \"human-like\" features with LLMs and what role these systems should play in interactions with humans. Further, ensuring the contextualization of the robustness of LLMs requires considering the specificities of language production in individuals with depression, as well as its evolution over time. Finally, we provide a series of recommendations to foster the responsible design and deployment of LLM-enhanced CAI for the therapeutic support of individuals with depression.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文
3 Enhancing Early Lung Cancer Diagnosis: Predicting Lung Nodule Progression in Follow-Up Low-Dose CT Scan with Deep Generative Model.

增强早期肺癌诊断：用深度生成模型在后续低剂量 CT 扫描中预测肺结节进展。影响指数 : 6.575
发表时间：Jun 2024 15
来源期刊：Cancers (Basel) PMID：38927934

DOI：10.3390/cancers16122229
文章类型： Journal Article

肺癌的早期诊断可以显着改善患者的预后。我们开发了基于Wasserstein生成对抗网络框架（GP-WGAN）的增长预测模型，以预测后续LDCT扫描中的结节生长模式。GP-WGAN使用包含约1年间隔的1121对结节图像的训练集（N=776）进行训练，并在基线LDCT扫描中部署到450个结节的独立测试集以预测结节图像（GP结节）在他们的1年随访扫描中。最后通过肺癌风险预测（LCRP）模型将450个GP结节分为恶性或良性。达到0.827±0.028的测试AUC，这与通过对真实随访结节图像进行分类的相同LCRP模型获得的0.862±0.028的AUC相当（p=0.071）。净重新分类指数产生了一致的结果(NRI=0.04；p=0.62)。其他基线方法，包括Lung-RADS和Brock模型,取得了显著较低的性能(p<0.05)。结果表明，我们的GP-WGAN模型预测的GP结节在肺癌诊断的真实随访扫描中实现了与结节相当的性能，与目前的等待下一次筛查的方法相比，与加速的临床管理相结合，表明更早发现肺癌的潜力。
Early diagnosis of lung cancer can significantly improve patient outcomes. We developed a Growth Predictive model based on the Wasserstein Generative Adversarial Network framework (GP-WGAN) to predict the nodule growth patterns in the follow-up LDCT scans. The GP-WGAN was trained with a training set (N = 776) containing 1121 pairs of nodule images with about 1-year intervals and deployed to an independent test set of 450 nodules on baseline LDCT scans to predict nodule images (GP-nodules) in their 1-year follow-up scans. The 450 GP-nodules were finally classified as malignant or benign by a lung cancer risk prediction (LCRP) model, achieving a test AUC of 0.827 ± 0.028, which was comparable to the AUC of 0.862 ± 0.028 achieved by the same LCRP model classifying real follow-up nodule images (p = 0.071). The net reclassification index yielded consistent outcomes (NRI = 0.04; p = 0.62). Other baseline methods, including Lung-RADS and the Brock model, achieved significantly lower performance (p < 0.05). The results demonstrated that the GP-nodules predicted by our GP-WGAN model achieved comparable performance with the nodules in the real follow-up scans for lung cancer diagnosis, indicating the potential to detect lung cancer earlier when coupled with accelerated clinical management versus the current approach of waiting until the next screening exam.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
4 ChatGPT in veterinary medicine: a practical guidance of generative artificial intelligence in clinics, education, and research.

兽医学中的 ChatGPT ：诊所生成人工智能的实践指导，教育,和研究。影响指数 : 3.471
发表时间：2024
来源期刊：Front Vet Sci PMID：38911678

DOI：10.3389/fvets.2024.1395934
文章类型： Journal Article

ChatGPT,最易于访问的生成人工智能（AI）工具，为兽医学提供了相当大的潜力，然而，缺乏对其具体应用的专门审查。本文简要地综合了ChatGPT在临床上的最新研究和实际应用,教育,和兽医学的研究领域。它旨在提供具体的指导和可操作的示例，说明如何在没有编程背景的情况下由兽医专业人员直接使用生成AI。对于从业者来说，ChatGPT可以提取患者数据，生成进度注释,并可能有助于诊断复杂病例。兽医教育工作者可以创建自定义GPT，以支持学生，而学生可以利用ChatGPT进行考试准备。ChatGPT可以帮助研究中的学术写作任务，但是兽医出版商已经为作者设定了特定的要求。尽管它具有变革性的潜力，小心使用是必不可少的，以避免像幻觉的陷阱。这篇评论涉及道德考虑，提供学习资源，并提供切实的例子来指导负责任的执行。提供了一份关键要点表，以总结这篇综述。通过强调潜在的好处和局限性，这篇评论装备了兽医，教育工作者,和研究人员有效利用ChatGPT的力量。
ChatGPT, the most accessible generative artificial intelligence (AI) tool, offers considerable potential for veterinary medicine, yet a dedicated review of its specific applications is lacking. This review concisely synthesizes the latest research and practical applications of ChatGPT within the clinical, educational, and research domains of veterinary medicine. It intends to provide specific guidance and actionable examples of how generative AI can be directly utilized by veterinary professionals without a programming background. For practitioners, ChatGPT can extract patient data, generate progress notes, and potentially assist in diagnosing complex cases. Veterinary educators can create custom GPTs for student support, while students can utilize ChatGPT for exam preparation. ChatGPT can aid in academic writing tasks in research, but veterinary publishers have set specific requirements for authors to follow. Despite its transformative potential, careful use is essential to avoid pitfalls like hallucination. This review addresses ethical considerations, provides learning resources, and offers tangible examples to guide responsible implementation. A table of key takeaways was provided to summarize this review. By highlighting potential benefits and limitations, this review equips veterinarians, educators, and researchers to harness the power of ChatGPT effectively.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
5 Generative AI for precision neuroimaging biomarker development in psychiatry.

用于精神病学中精确神经成像生物标志物开发的生成 AI 。影响指数 : 11.225
发表时间：May 2024 20
来源期刊：Psychiatry Res PMID：38909415

DOI：10.1016/j.psychres.2024.115955
文章类型： Journal Article

生成AI的爆发为精神病学中神经成像生物标志物的开发提供了希望，但有效采用人工智能方法需要明确具体的应用和挑战。这些集中在强大训练AI模型所需的数据集大小以及捕获与症状和治疗目标相关的神经信号的特征选择上。在这里，我们讨论了生成AI可以改善健壮和可重复的大脑到症状关联的量化的领域，以告知精确的精神病学应用。特别是在药物发现的背景下。最后,本通讯讨论了生成AI模型需要解决方案的一些挑战，以推进精神病学中的神经影像学生物标志物。
The explosion of generative AI offers promise for neuroimaging biomarker development in psychiatry, but effective adoption of AI methods requires clarity with respect to specific applications and challenges. These center on dataset sizes required to robustly train AI models along with feature selection that capture neural signals relevant to symptom and treatment targets. Here we discuss areas where generative AI could improve quantification of robust and reproducible brain-to-symptom associations to inform precision psychiatry applications, especially in the context of drug discovery. Finally, this communication discusses some challenges that need solutions for generative AI models to advance neuroimaging biomarkers in psychiatry.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文
6 Performance of Artificial Intelligence Content Detectors Using Human and Artificial Intelligence-Generated Scientific Writing.

使用人类和人工智能生成的科学写作的人工智能内容检测器的性能。影响指数 : 4.339
发表时间：Jun 2024 22
来源期刊：Ann Surg Oncol PMID：38909113

DOI：10.1245/s10434-024-15549-6
文章类型： Journal Article

背景：很少有研究检查了科学写作中人工智能（AI）内容检测的性能。这项研究评估了公开可用的AI内容检测器在应用于人类撰写和AI生成的科学文章时的性能。
方法：2022年发表在《外科肿瘤学年鉴》（ASO）上的文章，以及使用OpenAI的ChatGPT生成的AI文章，由三个人工智能内容检测器进行分析，以评估人工智能生成内容的概率。对完整的手稿及其各个部分进行了评估。使用ANOVA和线性回归进行组比较和趋势分析。使用曲线下面积(AUC)确定分类性能。
结果：总共449篇原始文章符合纳入标准，并进行了评估以确定AI产生的可能性。每个检测器还使用ASO文章的标题评估了47篇AI生成的文章。人类撰写的文章产生AI的平均概率为9.4％，检测器之间存在显着差异。仅检测到两个（0.4％）人类撰写的手稿，所有三个检测器都有0％的可能性是AI生成的。完全AI生成的文章被评估为具有更高的AI生成的平均概率(43.5%),范围为12.0%至99.9%。
结论：这项研究证明了各种AI含量检测器的性能差异，这些检测器具有将人类撰写的文章标记为AI生成的潜力。实施AI检测器的任何努力都必须包括随着AI模型和检测器的快速发展而进行持续评估和验证的策略。
BACKGROUND: Few studies have examined the performance of artificial intelligence (AI) content detection in scientific writing. This study evaluates the performance of publicly available AI content detectors when applied to both human-written and AI-generated scientific articles.
METHODS: Articles published in Annals of Surgical Oncology (ASO) during the year 2022, as well as AI-generated articles using OpenAI\'s ChatGPT, were analyzed by three AI content detectors to assess the probability of AI-generated content. Full manuscripts and their individual sections were evaluated. Group comparisons and trend analyses were conducted by using ANOVA and linear regression. Classification performance was determined using area under the curve (AUC).
RESULTS: A total of 449 original articles met inclusion criteria and were evaluated to determine the likelihood of being generated by AI. Each detector also evaluated 47 AI-generated articles by using titles from ASO articles. Human-written articles had an average probability of being AI-generated of 9.4% with significant differences between the detectors. Only two (0.4%) human-written manuscripts were detected as having a 0% probability of being AI-generated by all three detectors. Completely AI-generated articles were evaluated to have a higher average probability of being AI-generated (43.5%) with a range from 12.0 to 99.9%.
CONCLUSIONS: This study demonstrates differences in the performance of various AI content detectors with the potential to label human-written articles as AI-generated. Any effort toward implementing AI detectors must include a strategy for continuous evaluation and validation as AI models and detectors rapidly evolve.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文
7 Exploring ChatGPT's potential in the clinical stream of neurorehabilitation.

探索 ChatGPT 在临床神经康复中的潜力。影响指数 : 暂无
发表时间：2024
来源期刊：Front Artif Intell PMID：38903157

DOI：10.3389/frai.2024.1407905
文章类型： Journal Article

在几个医学领域，诸如ChatGPT之类的生成AI工具仅通过评估病例的叙述性临床描述，就可以在识别正确诊断方面实现最佳性能。最活跃的应用领域包括肿瘤学和COVID-19相关症状，在精神病学和神经学领域也有初步的相关结果。这篇范围综述旨在介绍ChatGPT在神经康复实践中的应用，这种人工智能驱动的解决方案有可能彻底改变患者护理和援助。首先,对ChatGPT的全面概述，包括它的设计，并提供了在医学上的潜在应用。第二,研究了这些模型的显着自然语言处理技能和局限性，重点是它们在神经康复中的应用。在这种情况下，我们提出了两种情况来评估ChatGPT解决高阶临床推理的能力。总的来说,我们为第一个证据提供支持，证明生成AI可以作为促进者有意义地融入神经康复实践，帮助医生定义越来越有效的诊断和个性化的预后计划。
In several medical fields, generative AI tools such as ChatGPT have achieved optimal performance in identifying correct diagnoses only by evaluating narrative clinical descriptions of cases. The most active fields of application include oncology and COVID-19-related symptoms, with preliminary relevant results also in psychiatric and neurological domains. This scoping review aims to introduce the arrival of ChatGPT applications in neurorehabilitation practice, where such AI-driven solutions have the potential to revolutionize patient care and assistance. First, a comprehensive overview of ChatGPT, including its design, and potential applications in medicine is provided. Second, the remarkable natural language processing skills and limitations of these models are examined with a focus on their use in neurorehabilitation. In this context, we present two case scenarios to evaluate ChatGPT ability to resolve higher-order clinical reasoning. Overall, we provide support to the first evidence that generative AI can meaningfully integrate as a facilitator into neurorehabilitation practice, aiding physicians in defining increasingly efficacious diagnostic and personalized prognostic plans.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
8 Large-scale foundation models and generative AI for BigData neuroscience.

大数据神经科学的大规模基础模型和生成 AI 。影响指数 : 2.904
发表时间：Jun 2024 17
来源期刊：Neurosci Res PMID：38897235

DOI：10.1016/j.neures.2024.06.003
文章类型： Journal Article

机器学习的最新进展导致了计算机游戏的革命性突破，图像和自然语言理解，和科学发现。由于BigData，基础模型和大规模语言模型（LLM）最近实现了类似人类的智能。在自我监督学习（SSL）和迁移学习的帮助下，这些模型可能会重塑神经科学研究的格局，并对未来产生重大影响。在这里，我们对基础模型和生成AI模型的最新进展以及它们在神经科学中的应用进行了简短的回顾。包括自然语言和语音，语义记忆，脑机接口（BMI），和数据增强。我们认为，这种范式转变框架将为许多神经科学研究方向开辟新的途径，并讨论随之而来的挑战和机遇。
Recent advances in machine learning have led to revolutionary breakthroughs in computer games, image and natural language understanding, and scientific discovery. Foundation models and large-scale language models (LLMs) have recently achieved human-like intelligence thanks to BigData. With the help of self-supervised learning (SSL) and transfer learning, these models may potentially reshape the landscapes of neuroscience research and make a significant impact on the future. Here we present a mini-review on recent advances in foundation models and generative AI models as well as their applications in neuroscience, including natural language and speech, semantic memory, brain-machine interfaces (BMIs), and data augmentation. We argue that this paradigm-shift framework will open new avenues for many neuroscience research directions and discuss the accompanying challenges and opportunities.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文
9 The Machine Speaks: Conversational AI and the Importance of Effort to Relationships of Meaning.

机器说话：对话 AI 和努力对意义关系的重要性。影响指数 : 6.332
发表时间：Jun 2024 18
来源期刊：JMIR Ment Health PMID：38889401

DOI：10.2196/53203
文章类型： Journal Article

关于对话式人工智能（CAI）的辩论的焦点主要集中在我们与机器交谈时出现的社会和道德问题上，当我们取代人类对话者时，获得了什么，失去了什么。包括我们的人类治疗师，与AI在这个观点中,相反，我们专注于一种独特且不断增长的现象：让机器为我们说话。当我们用CAI代替我们自己在人际交往方面的努力时，什么是危险的？这些技术的目的是，在某种程度上,为了消除努力，但是努力有巨大的价值，在某些情况下,甚至内在价值。在许多领域都是如此，尤其是人际关系。为某人努力，不管这种努力是什么，它本身往往传递着价值和意义。我们详细说明其含义，worth,当我们放弃在人际交往中的努力以及我们可能放弃的自我理解和成长的机会时，可能会失去意义。
The focus of debates about conversational artificial intelligence (CAI) has largely been on social and ethical concerns that arise when we speak to machines-what is gained and what is lost when we replace our human interlocutors, including our human therapists, with AI. In this viewpoint, we focus instead on a distinct and growing phenomenon: letting machines speak for us. What is at stake when we replace our own efforts at interpersonal engagement with CAI? The purpose of these technologies is, in part, to remove effort, but effort has enormous value, and in some cases, even intrinsic value. This is true in many realms, but especially in interpersonal relationships. To make an effort for someone, irrespective of what that effort amounts to, often conveys value and meaning in itself. We elaborate on the meaning, worth, and significance that may be lost when we relinquish effort in our interpersonal engagements as well as on the opportunities for self-understanding and growth that we may forsake.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
10 Applying generative AI with retrieval augmented generation to summarize and extract key clinical information from electronic health records.

应用具有检索增强生成功能的生成 AI ，从电子健康记录中汇总和提取关键临床信息。影响指数 : 8
发表时间：Jun 2024 14
来源期刊：J Biomed Inform PMID：38880236

DOI：10.1016/j.jbi.2024.104662
文章类型： Journal Article

背景：营养不良是老年护理机构（RACF）中普遍存在的问题，导致不良健康结果。从电子健康记录（EHR）的大量数据中有效提取关键临床信息的能力可以提高对问题严重程度的理解并制定有效的干预措施。这项研究旨在测试零射提示工程应用于生成人工智能（AI）模型的有效性，并结合检索增强生成（RAG）。用于在EHR中汇总结构化和非结构化数据并提取重要营养不良信息的自动化任务。
方法：我们使用了带零射提示的Llama213B模型。该数据集包括40个澳大利亚RACF中与营养不良管理相关的非结构化和结构化EHR。我们首先只对模型进行零射学习，然后将其与RAG相结合以完成两项任务：生成有关客户营养状况的结构化摘要，并提取有关营养不良风险因素的关键信息。我们在第一个任务中使用了25个音符，在第二个任务中使用了1,399个音符。我们根据黄金标准数据集手动评估了每个任务的模型输出。
结果：评估结果表明，应用于生成AI模型的零射学习在总结和提取有关RACF客户营养状况的信息方面非常有效。生成的摘要提供了原始数据的简洁和准确的表示，总体准确率为93.25％。RAG的加入改进了总结过程，导致6%的增长，达到99.25%的精度。该模型还证明了其提取风险因素的能力，准确率为90%。然而,添加RAG并没有进一步提高这项任务的准确性.总的来说,当信息在注释中明确说明时，该模型显示出稳健的性能；然而，它可能会遇到幻觉限制，特别是当细节没有明确提供时。
结论：这项研究证明了将零射学习应用于生成AI模型以自动生成EHR数据的结构化摘要并提取关键临床信息的高性能和局限性。RAG方法的加入提高了模型性能并减轻了幻觉问题。
BACKGROUND: Malnutrition is a prevalent issue in aged care facilities (RACFs), leading to adverse health outcomes. The ability to efficiently extract key clinical information from a large volume of data in electronic health records (EHR) can improve understanding about the extent of the problem and developing effective interventions. This research aimed to test the efficacy of zero-shot prompt engineering applied to generative artificial intelligence (AI) models on their own and in combination with retrieval augmented generation (RAG), for the automating tasks of summarizing both structured and unstructured data in EHR and extracting important malnutrition information.
METHODS: We utilized Llama 2 13B model with zero-shot prompting. The dataset comprises unstructured and structured EHRs related to malnutrition management in 40 Australian RACFs. We employed zero-shot learning to the model alone first, then combined it with RAG to accomplish two tasks: generate structured summaries about the nutritional status of a client and extract key information about malnutrition risk factors. We utilized 25 notes in the first task and 1,399 in the second task. We evaluated the model\'s output of each task manually against a gold standard dataset.
RESULTS: The evaluation outcomes indicated that zero-shot learning applied to generative AI model is highly effective in summarizing and extracting information about nutritional status of RACFs\' clients. The generated summaries provided concise and accurate representation of the original data with an overall accuracy of 93.25%. The addition of RAG improved the summarization process, leading to a 6% increase and achieving an accuracy of 99.25%. The model also proved its capability in extracting risk factors with an accuracy of 90%. However, adding RAG did not further improve accuracy in this task. Overall, the model has shown a robust performance when information was explicitly stated in the notes; however, it could encounter hallucination limitations, particularly when details were not explicitly provided.
CONCLUSIONS: This study demonstrates the high performance and limitations of applying zero-shot learning to generative AI models to automatic generation of structured summarization of EHRs data and extracting key clinical information. The inclusion of the RAG approach improved the model performance and mitigated the hallucination problem.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文

generative AI 关注

1 Assessing Laterality Errors in Radiology: Comparing Generative AI and Natural Language Processing.

2 The Role of Humanization and Robustness of Large Language Models in Conversational Artificial Intelligence for Individuals With Depression: A Critical Analysis.

3 Enhancing Early Lung Cancer Diagnosis: Predicting Lung Nodule Progression in Follow-Up Low-Dose CT Scan with Deep Generative Model.

4 ChatGPT in veterinary medicine: a practical guidance of generative artificial intelligence in clinics, education, and research.

5 Generative AI for precision neuroimaging biomarker development in psychiatry.

6 Performance of Artificial Intelligence Content Detectors Using Human and Artificial Intelligence-Generated Scientific Writing.

7 Exploring ChatGPT's potential in the clinical stream of neurorehabilitation.

8 Large-scale foundation models and generative AI for BigData neuroscience.

9 The Machine Speaks: Conversational AI and the Importance of Effort to Relationships of Meaning.

10 Applying generative AI with retrieval augmented generation to summarize and extract key clinical information from electronic health records.