generative language models

生成语言模型
  • 文章类型: Journal Article
    目标:最近,大型语言模型(LLM)在自然语言理解方面展示了卓越的能力。在展示日常对话和问答(QA)情况的熟练程度时,这些模型经常在需要精度的领域中挣扎,如医疗应用,由于他们缺乏特定领域的知识。在这篇文章中,我们描述了建造一个强大的,专为医学应用而设计的开源语言模型,称为PMC-LLaMA。
    方法:我们将通用LLM调整为医学领域,通过整合4.8M生物医学学术论文和30K医学教科书,涉及以数据为中心的知识注入,以及全面的特定领域指令微调,包括医疗QA,推理的理由,和对话对话与202M令牌。
    结果:在评估各种公共医疗QA基准和手动评级时,我们的轻量级PMC-LLaMA,仅由13B个参数组成,表现出优越的性能,甚至超过了ChatGPT.所有型号,代码,和调整指令的数据集将发布给研究界。
    结论:我们的贡献是3倍:(1)我们建立了一个面向医学领域的开源LLM。我们相信提出的PMC-LLaMA模型可以促进医学基础模型的进一步发展,作为医学训练的基本生成语言骨干;(2)我们进行彻底的消融研究,以证明每个建议组件的有效性,展示不同的训练数据和模型尺度如何影响医学LLM;(3)我们贡献了大规模,用于指令调整的综合数据集。
    结论:在本文中,我们系统地研究了建立开源医疗专用LLM的过程,PMC-LLaMA.
    OBJECTIVE: Recently, large language models (LLMs) have showcased remarkable capabilities in natural language understanding. While demonstrating proficiency in everyday conversations and question-answering (QA) situations, these models frequently struggle in domains that require precision, such as medical applications, due to their lack of domain-specific knowledge. In this article, we describe the procedure for building a powerful, open-source language model specifically designed for medicine applications, termed as PMC-LLaMA.
    METHODS: We adapt a general-purpose LLM toward the medical domain, involving data-centric knowledge injection through the integration of 4.8M biomedical academic papers and 30K medical textbooks, as well as comprehensive domain-specific instruction fine-tuning, encompassing medical QA, rationale for reasoning, and conversational dialogues with 202M tokens.
    RESULTS: While evaluating various public medical QA benchmarks and manual rating, our lightweight PMC-LLaMA, which consists of only 13B parameters, exhibits superior performance, even surpassing ChatGPT. All models, codes, and datasets for instruction tuning will be released to the research community.
    CONCLUSIONS: Our contributions are 3-fold: (1) we build up an open-source LLM toward the medical domain. We believe the proposed PMC-LLaMA model can promote further development of foundation models in medicine, serving as a medical trainable basic generative language backbone; (2) we conduct thorough ablation studies to demonstrate the effectiveness of each proposed component, demonstrating how different training data and model scales affect medical LLMs; (3) we contribute a large-scale, comprehensive dataset for instruction tuning.
    CONCLUSIONS: In this article, we systematically investigate the process of building up an open-source medical-specific LLM, PMC-LLaMA.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在线医学教育经常面临与沟通和理解障碍相关的挑战,特别是当教学语言不同于医疗保健提供者和护理人员的母语时。我们的研究解决了儿科医疗保健中的这些挑战,采用生成语言模型来产生一个语言定制,涵盖团队培训主题的多语言课程,外科手术,围手术期护理,病人的旅程,以及医疗保健提供者和护理人员的教育资源。
    一个跨学科小组用英语制定了视频课程,解决儿科医疗保健的微妙挑战。随后,它被翻译成西班牙语,主要强调拉丁美洲的人口统计学,利用OpenAI的GPT-4。视频丰富了母语人士的合成语音配置文件,以维护叙事的一致性。
    我们创建了45个多语言视频模块的集合,每个长度从3到8分钟不等,涵盖团队合作等基本主题,如何改善人际沟通,“我该怎么做”外科手术,以及麻醉领域的焦点话题,重症监护病房,病房护理,从医院过渡到家庭。通过AI驱动的翻译,这个全面的集合确保了全球的可及性,并为医疗保健专业人员和护理人员提供了语言上的包容性资源,以提高全球儿科护理标准.
    多语言教育内容的发展标志着儿科护理朝着全球标准化迈出了一步。通过使用高级语言模型进行翻译,我们确保课程具有包容性和可及性。该计划与世界卫生组织的数字健康指南非常吻合,倡导数字化医疗教育。
    Online medical education often faces challenges related to communication and comprehension barriers, particularly when the instructional language differs from the healthcare providers\' and caregivers\' native languages. Our study addresses these challenges within pediatric healthcare by employing generative language models to produce a linguistically tailored, multilingual curriculum that covers the topics of team training, surgical procedures, perioperative care, patient journeys, and educational resources for healthcare providers and caregivers.
    An interdisciplinary group formulated a video curriculum in English, addressing the nuanced challenges of pediatric healthcare. Subsequently, it was translated into Spanish, primarily emphasizing Latin American demographics, utilizing OpenAI\'s GPT-4. Videos were enriched with synthetic voice profiles of native speakers to uphold the consistency of the narrative.
    We created a collection of 45 multilingual video modules, each ranging from 3 to 8 min in length and covering essential topics such as teamwork, how to improve interpersonal communication, \"How I Do It\" surgical procedures, as well as focused topics in anesthesia, intensive care unit care, ward nursing, and transitions from hospital to home. Through AI-driven translation, this comprehensive collection ensures global accessibility and offers healthcare professionals and caregivers a linguistically inclusive resource for elevating standards of pediatric care worldwide.
    This development of multilingual educational content marks a progressive step toward global standardization of pediatric care. By utilizing advanced language models for translation, we ensure that the curriculum is inclusive and accessible. This initiative aligns well with the World Health Organization\'s Digital Health Guidelines, advocating for digitally enabled healthcare education.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    患者在线记录访问(ORA)正在全球范围内增长。在一些国家,包括美国和瑞典,随着患者快速访问他们在网络上的完整记录,包括实验室和测试结果,处方药清单,疫苗接种,甚至是临床医生写的非常叙述性的报告(后者,通常称为“开放说明”)。在美国,患者的ORA也可以下载形式与其他应用程序一起使用。虽然调查研究表明,一些患者报告说ORA有很多好处,围绕撰写患者现在可以阅读的临床文档的实施仍然存在挑战.有了ORA,记录的功能正在发展;它不再只是医生的备忘录,也是患者的沟通工具。研究表明,临床医生正在改变他们编写文档的方式,引发对准确性和完整性的担忧。其他问题包括工作负担;虽然很少有客观研究检查ORA对工作量的影响,一些研究表明,临床医生花更多的时间写笔记和回答与患者记录相关的问题。旨在解决其中一些问题,已经提出了临床医生和患者教育策略。在这篇观点论文中,我们探索了这些方法,并提出了另一个长期策略:使用生成人工智能(AI)来支持临床医生记录患者更容易理解的叙述性总结.适用于叙述性临床文档,我们建议这种方法可能会大大有助于保持笔记的准确性,加强写作的清晰度和信号的同情和以患者为中心的护理,并作为文件工作负担的缓冲。然而,我们还考虑了当前与现有生成AI相关的风险。我们强调,这项创新要在ORA中发挥关键作用,临床笔记的共同创造将势在必行。我们还警告说,临床医生需要在如何与生成AI一起工作以优化其巨大潜力方面得到支持。
    Patients\' online record access (ORA) is growing worldwide. In some countries, including the United States and Sweden, access is advanced with patients obtaining rapid access to their full records on the web including laboratory and test results, lists of prescribed medications, vaccinations, and even the very narrative reports written by clinicians (the latter, commonly referred to as \"open notes\"). In the United States, patient\'s ORA is also available in a downloadable form for use with other apps. While survey studies have shown that some patients report many benefits from ORA, there remain challenges with implementation around writing clinical documentation that patients may now read. With ORA, the functionality of the record is evolving; it is no longer only an aide memoire for doctors but also a communication tool for patients. Studies suggest that clinicians are changing how they write documentation, inviting worries about accuracy and completeness. Other concerns include work burdens; while few objective studies have examined the impact of ORA on workload, some research suggests that clinicians are spending more time writing notes and answering queries related to patients\' records. Aimed at addressing some of these concerns, clinician and patient education strategies have been proposed. In this viewpoint paper, we explore these approaches and suggest another longer-term strategy: the use of generative artificial intelligence (AI) to support clinicians in documenting narrative summaries that patients will find easier to understand. Applied to narrative clinical documentation, we suggest that such approaches may significantly help preserve the accuracy of notes, strengthen writing clarity and signals of empathy and patient-centered care, and serve as a buffer against documentation work burdens. However, we also consider the current risks associated with existing generative AI. We emphasize that for this innovation to play a key role in ORA, the cocreation of clinical notes will be imperative. We also caution that clinicians will need to be supported in how to work alongside generative AI to optimize its considerable potential.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目标:最近发表在JMIRMedEducJournal上的一项研究探讨了生成预培训(ChatGPT)的潜在影响,生成语言模型,关于医学教育,研究,和实践。在本研究中,对ChatGPT进行了访谈,以确定其在解剖学教育(AE)和解剖学研究(AR)中的应用能力和潜力.
    方法:该研究涉及ChatGPT在获得第4版在线订阅后提出的18个问题。问题是随机选择的,并根据准确性进行评估,相关性,和全面性。
    结果:ChatGPT提供了准确且结构良好的解剖学描述,包括临床相关性和结构之间的关系。聊天机器人还提供了章节的简明摘要和有关解剖学术语的有用建议,即使是复杂的术语。然而,当谈到解剖学变异及其临床意义时,chatbot的答复是不充分的,除非变体被系统地分类到类型。ChatGPT-4生成了多项选择测验和不同难度级别的匹配问题,以及与文本一起呈现的文章摘要。然而,聊天机器人认识到它在准确性方面的局限性,就像当前研究的作者一样。
    结论:ChatGPT-4对于解剖学领域的学生来说是一种有价值的互动教育工具,鼓励参与和进一步的问题。然而,它不能取代教育工作者的关键作用,应该作为补充工具。未来的研究应该为ChatGPT在医学教育中的最佳使用和应用建立指南。
    OBJECTIVE: A recent study published in the JMIR Med Educ Journal explored the potential impact of the Generative Pre-Train (ChatGPT), a generative language model, on medical education, research, and practice. In the present study, an interview with ChatGPT was conducted to determine its capabilities and potential for use in anatomy education (AE) and anatomy research (AR).
    METHODS: The study involved 18 questions asked of ChatGPT after obtaining an online subscription to the 4th edition. The questions were randomly selected and evaluated based on accuracy, relevance, and comprehensiveness.
    RESULTS: The ChatGPT provided accurate and well-structured anatomical descriptions, including clinical relevance and relationships between structures. The chatbot also offered concise summaries of chapters and helpful advice on anatomical terminology, even with complex terms. However, when it came to anatomical variants and their clinical significance, the chatbot\'s replies were inadequate unless variants were systematically classified into types. ChatGPT-4 generated multiple-choice quizzes and matching questions of varying difficulty levels, as well as summaries of articles when presented with text. However, the chatbot recognized its limitations in terms of accuracy, as did the authors of the current study.
    CONCLUSIONS: ChatGPT-4 can be a valuable interactive educational tool for students in the field of anatomy, encouraging engagement and further questions. However, it cannot replace the critical role of educators and should be used as a complementary tool. Future research should establish guidelines for ChatGPT\'s optimal use and application in medical education.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    人工智能(AI)和语言模型,如ChatGPT-4(生成预训练变压器)最近取得了巨大的进步,并正在迅速改变医学的格局。心脏病学是许多利用AI旨在改善患者护理的专业之一。生成AI,使用先进的机器学习算法,有可能诊断心脏病,并推荐适合患者的管理方案。这可以不仅通过推荐最佳治疗计划而且通过提高医师效率来导致改善的患者结果。语言模型可以帮助医生完成管理任务,让他们花更多的时间在病人护理上。然而,在医学领域使用人工智能和语言模型有几个问题。这些技术可能不是最新研究的最新技术,可能会提供过时的信息,这可能导致不良事件。其次,人工智能工具可能很昂贵,导致医疗费用增加,普通人群的可及性降低。随着AI变得越来越主流,人们也担心失去人情味和同理心。医疗保健专业人员需要接受充分的培训才能使用这些工具。虽然人工智能和语言模型有许多有益的特征,所有医疗保健提供者都需要参与并意识到生成AI,以确保其最佳使用并减轻与实施相关的任何潜在风险和挑战。在这次审查中,我们讨论了语言模型在心脏病学领域的各种用途。
    Artificial intelligence (AI) and language models such as ChatGPT-4 (Generative Pretrained Transformer) have made tremendous advances recently and are rapidly transforming the landscape of medicine. Cardiology is among many of the specialties that utilize AI with the intention of improving patient care. Generative AI, with the use of its advanced machine learning algorithms, has the potential to diagnose heart disease and recommend management options suitable for the patient. This may lead to improved patient outcomes not only by recommending the best treatment plan but also by increasing physician efficiency. Language models could assist physicians with administrative tasks, allowing them to spend more time on patient care. However, there are several concerns with the use of AI and language models in the field of medicine. These technologies may not be the most up-to-date with the latest research and could provide outdated information, which may lead to an adverse event. Secondly, AI tools can be expensive, leading to increased healthcare costs and reduced accessibility to the general population. There is also concern about the loss of the human touch and empathy as AI becomes more mainstream. Healthcare professionals would need to be adequately trained to utilize these tools. While AI and language models have many beneficial traits, all healthcare providers need to be involved and aware of generative AI so as to assure its optimal use and mitigate any potential risks and challenges associated with its implementation. In this review, we discuss the various uses of language models in the field of cardiology.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Spoken communication occurs in a \"noisy channel\" characterized by high levels of environmental noise, variability within and between speakers, and lexical and syntactic ambiguity. Given these properties of the received linguistic input, robust spoken word recognition-and language processing more generally-relies heavily on listeners\' prior knowledge to evaluate whether candidate interpretations of that input are more or less likely. Here we compare several broad-coverage probabilistic generative language models in their ability to capture human linguistic expectations. Serial reproduction, an experimental paradigm where spoken utterances are reproduced by successive participants similar to the children\'s game of \"Telephone,\" is used to elicit a sample that reflects the linguistic expectations of English-speaking adults. When we evaluate a suite of probabilistic generative language models against the yielded chains of utterances, we find that those models that make use of abstract representations of preceding linguistic context (i.e., phrase structure) best predict the changes made by people in the course of serial reproduction. A logistic regression model predicting which words in an utterance are most likely to be lost or changed in the course of spoken transmission corroborates this result. We interpret these findings in light of research highlighting the interaction of memory-based constraints and representations in language processing.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号