clinical notes

临床注意事项
  • 文章类型: Systematic Review
    目的:未来的计算机辅助诊断和预后系统应该能够同时处理多模式数据。多模态深度学习(MDL)涉及多个数据源的集成,如图像和文本,有可能彻底改变生物医学数据的分析和解释。然而,它最近才引起研究人员的注意。为此,迫切需要对这一主题进行系统审查,确定当前工作的局限性,探索未来的方向。
    方法:在本范围审查中,我们旨在全面概述该领域的现状,并确定关键概念,研究类型,和研究空白,专注于生物医学图像和文本联合学习,主要是因为这两种是MDL研究中最常见的数据类型。
    结果:本研究回顾了多模态深度学习在五个任务中的当前用途:(1)报告生成,(2)视觉问答,(3)跨模态检索,(4)计算机辅助诊断,(5)语义分割。
    结论:我们的研究结果突出了MDL的多种应用和潜力,并为该领域的未来研究提出了方向。我们希望我们的审查将促进自然语言处理(NLP)和医学成像社区的合作,并支持下一代决策和计算机辅助诊断系统的开发。
    Computer-assisted diagnostic and prognostic systems of the future should be capable of simultaneously processing multimodal data. Multimodal deep learning (MDL), which involves the integration of multiple sources of data, such as images and text, has the potential to revolutionize the analysis and interpretation of biomedical data. However, it only caught researchers\' attention recently. To this end, there is a critical need to conduct a systematic review on this topic, identify the limitations of current work, and explore future directions.
    In this scoping review, we aim to provide a comprehensive overview of the current state of the field and identify key concepts, types of studies, and research gaps with a focus on biomedical images and texts joint learning, mainly because these two were the most commonly available data types in MDL research.
    This study reviewed the current uses of multimodal deep learning on five tasks: (1) Report generation, (2) Visual question answering, (3) Cross-modal retrieval, (4) Computer-aided diagnosis, and (5) Semantic segmentation.
    Our results highlight the diverse applications and potential of MDL and suggest directions for future research in the field. We hope our review will facilitate the collaboration of natural language processing (NLP) and medical imaging communities and support the next generation of decision-making and computer-assisted diagnostic system development.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    临床情绪是一种判断,关于个人健康的观察所促进的思想或态度。情绪分析在医疗保健领域引起了人们的注意,因为二次使用来自临床叙述的数据,具有多种应用,包括预测新出现的精神疾病或临床结果的可能性。目前的研究状况尚未总结。本研究提供了范围审查的结果,旨在提供临床叙述的情感分析概述,以总结现有研究并确定开放的研究差距。范围审查是根据PRISMA-ScR(系统审查的首选报告项目和范围审查的Meta分析扩展)指南进行的。通过搜索4个电子数据库(例如,PubMed,IEEEXplore)除了对所包括的研究进行后向和前向参考列表检查之外。我们提取了用例的信息,应用的方法和工具,使用的数据集和情绪分析方法的性能。在检索到的1200篇引文中,29个独特的研究被纳入审查,为期8年。大多数研究应用通用领域工具(例如TextBlob)和情感词典(例如SentiWordNet)来实现诸如临床结果预测之类的用例;其他人则提出了基于机器学习的新的特定领域情感分析方法。报告的准确度值在71.5-88.2%之间。用于评估和测试的数据通常从MIMIC数据库或i2b2挑战中检索。与人工神经网络相关的最新进展尚未在该领域得到充分考虑。我们得出的结论是,未来的研究应该集中在开发金本位制情绪词典上,适应临床叙事的具体特点。必须努力增强现有的或创建新的高质量的临床叙述标记数据集。最后,应研究最先进的机器学习方法在自然语言处理中的适用性,尤其是基于变形金刚的模型在临床叙事情感分析中的应用.
    A clinical sentiment is a judgment, thought or attitude promoted by an observation with respect to the health of an individual. Sentiment analysis has drawn attention in the healthcare domain for secondary use of data from clinical narratives, with a variety of applications including predicting the likelihood of emerging mental illnesses or clinical outcomes. The current state of research has not yet been summarized. This study presents results from a scoping review aiming at providing an overview of sentiment analysis of clinical narratives in order to summarize existing research and identify open research gaps. The scoping review was carried out in line with the PRISMA-ScR (Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews) guideline. Studies were identified by searching 4 electronic databases (e.g., PubMed, IEEE Xplore) in addition to conducting backward and forward reference list checking of the included studies. We extracted information on use cases, methods and tools applied, used datasets and performance of the sentiment analysis approach. Of 1,200 citations retrieved, 29 unique studies were included in the review covering a period of 8 years. Most studies apply general domain tools (e.g. TextBlob) and sentiment lexicons (e.g. SentiWordNet) for realizing use cases such as prediction of clinical outcomes; others proposed new domain-specific sentiment analysis approaches based on machine learning. Accuracy values between 71.5-88.2% are reported. Data used for evaluation and test are often retrieved from MIMIC databases or i2b2 challenges. Latest developments related to artificial neural networks are not yet fully considered in this domain. We conclude that future research should focus on developing a gold standard sentiment lexicon, adapted to the specific characteristics of clinical narratives. Efforts have to be made to either augment existing or create new high-quality labeled data sets of clinical narratives. Last, the suitability of state-of-the-art machine learning methods for natural language processing and in particular transformer-based models should be investigated for their application for sentiment analysis of clinical narratives.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Temporal relations between clinical events play an important role in clinical assessment and decision making. Extracting such relations from free text data is a challenging task because it lies on between medical natural language processing, temporal representation and temporal reasoning.
    To survey existing methods for extracting temporal relations (TLINKs) between events from clinical free text in English; to establish the state-of-the-art in this field; and to identify outstanding methodological challenges.
    A systematic search in PubMed and the DBLP computer science bibliography was conducted for studies published between January 2006 and December 2018. The relevant studies were identified by examining the titles and abstracts. Then, the full text of selected studies was analyzed in depth and information were collected on TLINK tasks, TLINK types, data sources, features selection, methods used, and reported performance.
    A total of 2834 publications were identified for title and abstract screening. Of these publications, 51 studies were selected. Thirty-two studies used machine learning approaches, 15 studies used a hybrid approaches, and only four studies used a rule-based approach. The majority of studies use publicly available corpora: THYME (28 studies) and the i2b2 corpus (17 studies).
    The performance of TLINK extraction methods ranges widely depending on relation types and events (e.g. from 32% to 87% F-score for identifying relations between clinical events and document creation time). A small set of TLINKs (before, after, overlap and contains) has been widely studied with relatively good performance, whereas other types of TLINK (e.g., started by, finished by, precedes) are rarely studied and remain challenging. Machine learning classifiers (such as Support Vector Machine and Conditional Random Fields) and Deep Neural Networks were among the best performing methods for extracting TLINKs, but nearly all the work has been carried out and tested on two publicly available corpora only. The field would benefit from the availability of more publicly available, high-quality, annotated clinical text corpora.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    背景:在慢性病领域需要补充和超越循证医学的新方法,鉴于这种疾病在全球人口中的发病率越来越高。一个有希望的途径是二次使用电子健康记录(EHR),分析患者数据以进行临床和转化研究。基于机器学习处理EHR的方法提高了对患者临床轨迹和慢性病风险预测的理解,创造一个独特的机会来获得以前未知的临床见解。然而,在自由形式的文本中,大量的临床病史仍然被锁定在临床叙述之后。因此,释放EHR数据的全部潜力取决于自然语言处理(NLP)方法的发展,以自动将临床文本转换为结构化临床数据,可以指导临床决策并可能延迟或预防疾病发作。
    目的:研究的目的是全面概述应用于与慢性病相关的自由文本临床笔记的NLP方法的发展和吸收,包括调查NLP方法在理解临床叙事方面面临的挑战。
    方法:遵循系统审查和荟萃分析(PRISMA)指南的首选报告项目,并使用“临床笔记”在5个数据库中进行搜索,\"\"自然语言处理,\"和\"慢性疾病\"及其变化作为关键词,以最大限度地覆盖文章。
    结果:在所考虑的2652篇文章中,106符合纳入标准。对纳入的论文进行审查,确定了43种慢性病,然后使用国际疾病分类将其进一步分为10种疾病类别,第十次修订。大多数研究集中在循环系统疾病(n=38),而内分泌和代谢疾病最少(n=14)。这是由于与代谢疾病相关的临床记录的结构,通常包含更多的结构化数据,与循环系统疾病的医疗记录相比,它们更多地关注非结构化数据,因此看到了NLP的更多关注。审查表明,与基于规则的方法相比,机器学习方法的使用显着增加;但是,深度学习方法仍然是新兴的(n=3)。因此,大多数作品都集中在疾病表型的分类上,只有少数论文涉及从自由文本中提取合并症或将临床笔记与结构化数据整合。有一个值得注意的使用相对简单的方法,例如浅层分类器(或与基于规则的方法的组合),由于预测的可解释性,对于更复杂的方法来说,这仍然是一个重要的问题。最后,公开可用数据的稀缺也可能导致更先进方法的开发不足,例如从临床笔记中提取词嵌入。
    结论:仍然需要努力改善(1)临床NLP方法从提取到理解的进展;(2)识别实体之间的关系,而不是孤立的实体;(3)了解过去的时间提取,电流,和未来的临床事件;(4)利用临床知识的替代来源;(5)大规模,去识别的临床身体。
    BACKGROUND: Novel approaches that complement and go beyond evidence-based medicine are required in the domain of chronic diseases, given the growing incidence of such conditions on the worldwide population. A promising avenue is the secondary use of electronic health records (EHRs), where patient data are analyzed to conduct clinical and translational research. Methods based on machine learning to process EHRs are resulting in improved understanding of patient clinical trajectories and chronic disease risk prediction, creating a unique opportunity to derive previously unknown clinical insights. However, a wealth of clinical histories remains locked behind clinical narratives in free-form text. Consequently, unlocking the full potential of EHR data is contingent on the development of natural language processing (NLP) methods to automatically transform clinical text into structured clinical data that can guide clinical decisions and potentially delay or prevent disease onset.
    OBJECTIVE: The goal of the research was to provide a comprehensive overview of the development and uptake of NLP methods applied to free-text clinical notes related to chronic diseases, including the investigation of challenges faced by NLP methodologies in understanding clinical narratives.
    METHODS: Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines were followed and searches were conducted in 5 databases using \"clinical notes,\" \"natural language processing,\" and \"chronic disease\" and their variations as keywords to maximize coverage of the articles.
    RESULTS: Of the 2652 articles considered, 106 met the inclusion criteria. Review of the included papers resulted in identification of 43 chronic diseases, which were then further classified into 10 disease categories using the International Classification of Diseases, 10th Revision. The majority of studies focused on diseases of the circulatory system (n=38) while endocrine and metabolic diseases were fewest (n=14). This was due to the structure of clinical records related to metabolic diseases, which typically contain much more structured data, compared with medical records for diseases of the circulatory system, which focus more on unstructured data and consequently have seen a stronger focus of NLP. The review has shown that there is a significant increase in the use of machine learning methods compared to rule-based approaches; however, deep learning methods remain emergent (n=3). Consequently, the majority of works focus on classification of disease phenotype with only a handful of papers addressing extraction of comorbidities from the free text or integration of clinical notes with structured data. There is a notable use of relatively simple methods, such as shallow classifiers (or combination with rule-based methods), due to the interpretability of predictions, which still represents a significant issue for more complex methods. Finally, scarcity of publicly available data may also have contributed to insufficient development of more advanced methods, such as extraction of word embeddings from clinical notes.
    CONCLUSIONS: Efforts are still required to improve (1) progression of clinical NLP methods from extraction toward understanding; (2) recognition of relations among entities rather than entities in isolation; (3) temporal extraction to understand past, current, and future clinical events; (4) exploitation of alternative sources of clinical knowledge; and (5) availability of large-scale, de-identified clinical corpora.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    随着电子健康记录(EHR)的快速采用,希望从EHR中获取信息和知识,以支持护理点的自动化系统,并使EHR能够用于临床和转化研究。用于促进EHR数据的二次使用的一个关键组件是信息提取(IE)任务,自动从文本中提取和编码临床信息。
    在这篇文献综述中,我们对最近发表的关于临床信息提取(IE)应用的研究进行了综述。
    根据OvidMEDLINE过程中和其他非索引引文,对2009年1月至2016年9月发表的文章进行了文献检索,OvidMEDLINE,OvidEmbase,Scopus,WebofScience,ACM数字图书馆。
    总共确定了1917种出版物用于标题和摘要筛选。在这些出版物中,在这篇综述中,共选择了263篇文章,并就发表地点和数据源进行了讨论,临床IE工具,方法,以及在疾病和药物相关研究领域的应用,和临床工作流程优化。
    临床IE已用于广泛的应用,然而,使用EHR数据的临床研究与使用临床IE的研究之间存在相当大的差距.这项研究使我们能够更具体地了解这一差距,并提供潜在的解决方案来弥合这一差距。
    With the rapid adoption of electronic health records (EHRs), it is desirable to harvest information and knowledge from EHRs to support automated systems at the point of care and to enable secondary use of EHRs for clinical and translational research. One critical component used to facilitate the secondary use of EHR data is the information extraction (IE) task, which automatically extracts and encodes clinical information from text.
    In this literature review, we present a review of recent published research on clinical information extraction (IE) applications.
    A literature search was conducted for articles published from January 2009 to September 2016 based on Ovid MEDLINE In-Process & Other Non-Indexed Citations, Ovid MEDLINE, Ovid EMBASE, Scopus, Web of Science, and ACM Digital Library.
    A total of 1917 publications were identified for title and abstract screening. Of these publications, 263 articles were selected and discussed in this review in terms of publication venues and data sources, clinical IE tools, methods, and applications in the areas of disease- and drug-related studies, and clinical workflow optimizations.
    Clinical IE has been used for a wide range of applications, however, there is a considerable gap between clinical studies using EHR data and studies using clinical IE. This study enabled us to gain a more concrete understanding of the gap and to provide potential solutions to bridge this gap.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号