Topological data analysis

拓扑数据分析
  • 文章类型: Journal Article
    近年来取得的重大技术进步使生物医学数字技术的使用急剧增加-从电子健康记录的广泛使用到改进的医学成像能力以及基因组测序的日益普及,都有助于生物医学研究和临床护理的“数字化”。随着这种向计算机化工具的转变,可用数据量急剧增加,当前能够从这些丰富的信息中提取有意义的知识的数据分析工具尚未赶上。本文旨在概述新兴的数学方法,这些方法有可能提高临床医生和研究人员分析生物医学数据的能力,但由于生命科学研究界缺乏概念可及性和意识,可能会阻碍这样做。特别是,我们专注于拓扑数据分析(TDA),一组基于代数拓扑数学领域的方法,旨在描述和利用与数据的“形状”相关的特征。我们的目标是通过对其理论基础进行概念性讨论,然后对其已发表的科学研究应用进行调查,从而使非数学家更容易使用此类技术。最后,我们讨论了这些方法的局限性,并提出了将数学工具整合到临床护理和生物医学信息学中的未来工作的潜在途径。
    Significant technological advances made in recent years have shepherded a dramatic increase in utilization of digital technologies for biomedicine- everything from the widespread use of electronic health records to improved medical imaging capabilities and the rising ubiquity of genomic sequencing contribute to a \"digitization\" of biomedical research and clinical care. With this shift toward computerized tools comes a dramatic increase in the amount of available data, and current tools for data analysis capable of extracting meaningful knowledge from this wealth of information have yet to catch up. This article seeks to provide an overview of emerging mathematical methods with the potential to improve the abilities of clinicians and researchers to analyze biomedical data, but may be hindered from doing so by a lack of conceptual accessibility and awareness in the life sciences research community. In particular, we focus on topological data analysis (TDA), a set of methods grounded in the mathematical field of algebraic topology that seeks to describe and harness features related to the \"shape\" of data. We aim to make such techniques more approachable to non-mathematicians by providing a conceptual discussion of their theoretical foundations followed by a survey of their published applications to scientific research. Finally, we discuss the limitations of these methods and suggest potential avenues for future work integrating mathematical tools into clinical care and biomedical informatics.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    来自传感器的肥胖相关数据的丰富来源,智能手机应用程序,电子医疗健康记录和保险数据可以为理解带来新的见解,预防和治疗肥胖。对于如此大的数据集,机器学习提供了复杂而优雅的工具来描述,对肥胖相关风险和结局进行分类和预测。这里,我们回顾了预测和/或分类的机器学习方法,如线性和逻辑回归,人工神经网络,深度学习和决策树分析。我们还回顾了描述和表征数据的方法,如聚类分析,主成分分析,网络科学和拓扑数据分析。我们以高级概述介绍每种方法,然后是成功应用的示例。然后将算法应用于国家健康和营养检查调查,以证明方法,效用和结果。还评估了每种方法的优势和局限性。机器学习算法的总结提供了专门应用于肥胖的数据分析状态的独特概述。
    Rich sources of obesity-related data arising from sensors, smartphone apps, electronic medical health records and insurance data can bring new insights for understanding, preventing and treating obesity. For such large datasets, machine learning provides sophisticated and elegant tools to describe, classify and predict obesity-related risks and outcomes. Here, we review machine learning methods that predict and/or classify such as linear and logistic regression, artificial neural networks, deep learning and decision tree analysis. We also review methods that describe and characterize data such as cluster analysis, principal component analysis, network science and topological data analysis. We introduce each method with a high-level overview followed by examples of successful applications. The algorithms were then applied to National Health and Nutrition Examination Survey to demonstrate methodology, utility and outcomes. The strengths and limitations of each method were also evaluated. This summary of machine learning algorithms provides a unique overview of the state of data analysis applied specifically to obesity.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号