Supervised learning

监督学习
  • 文章类型: Journal Article
    最常见的精神疾病之一是重度抑郁症(MDD),这增加了自杀意念或过早死亡的可能性。在言语流畅性任务(VFT)期间通过功能近红外光谱(fNIRS)检测到的异常额叶血液动力学变化有可能用作评估临床症状的客观指标。然而,针对表现出抑郁症症状的个体的全面定量和客观评估工具仍未开发。从包含289名MDD患者和178名健康对照的大规模数据集中的467个样本中提取。在整个VFT中获得fNIRS测量值。为了识别独特的MDD生物标志物,本研究引入了一种从fNIRS信号中提取时空特征的数据表示方法,随后被用作潜在的预测因子。机器学习分类器(例如,实施梯度增强决策树(GBDT)和多层感知器)以评估预测所选特征的能力。交叉验证的平均值和标准偏差表明,GBDT模型,当与180特征图案组合时,以最有效的方式区分MDD患者与健康对照组。测试集的正确分类的准确性为0.829±0.053,AUC为0.895(95%CI:0.864-0.925),灵敏度为0.914±0.051。使用Shapley加法解释方法识别对MDD识别做出最重要贡献的渠道,位于额极区和背外侧前额叶皮层,以及三角形Broca区。在MDD中VFT期间异常前额叶活动的评估充当客观可测量的生物标志物,其可用于评估认知缺陷并促进MDD的早期筛查。本研究中建议的模型可应用于大规模病例对照fNIRS数据集,以检测MDD的独特特征,并为临床医生提供客观的基于生物标志物的分析仪器,以协助评估可疑病例。
    One of the most prevalent psychiatric disorders is major depressive disorder (MDD), which increases the probability of suicidal ideation or untimely demise. Abnormal frontal hemodynamic changes detected by functional near-infrared spectroscopy (fNIRS) during verbal fluency task (VFT) have the potential to be used as an objective indicator for assessing clinical symptoms. However, comprehensive quantitative and objective assessment instruments for individuals who exhibit symptoms suggestive of depression remain undeveloped. Drawing from a total of 467 samples in a large-scale dataset comprising 289 MDD patients and 178 healthy controls, fNIRS measurements were obtained throughout the VFT. To identify unique MDD biomarkers, this research introduced a data representation approach for extracting spatiotemporal features from fNIRS signals, which were subsequently utilized as potential predictors. Machine learning classifiers (e.g., Gradient Boosted Decision Trees (GBDT) and Multilayer Perceptron) were implemented to assess the ability to predict selected features. The mean and standard deviation of the cross-validation indicated that the GBDT model, when combined with the 180-feature pattern, distinguishes patients with MDD from healthy controls in the most effective manner. The accuracy of correct classification for the test set was 0.829 ± 0.053, with an AUC of 0.895 (95 % CI: 0.864-0.925) and a sensitivity of 0.914 ± 0.051. Channels that made the most important contribution to the identification of MDD were identified using Shapley Additive Explanations method, located in the frontopolar area and the dorsolateral prefrontal cortex, as well as pars triangularis Broca\'s area. Assessment of abnormal prefrontal activity during the VFT in MDD serves as an objectively measurable biomarker that could be utilized to evaluate cognitive deficits and facilitate early screening for MDD. The model suggested in this research could be applied to large-scale case-control fNIRS datasets to detect unique characteristics of MDD and offer clinicians an objective biomarker-based analytical instrument to assist in the evaluation of suspicious cases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    这项研究旨在开发和验证,通过机器学习,与特定于成年人和住院老年人的处方药物相关的跌倒风险预测模型.在三级医院进行了病例对照研究,2016年住院的9,037名成年人和老年人。使用以下算法对变量进行了分析:逻辑回归,天真的贝叶斯,随机森林和梯度提升。最佳模型在老年人亚组中呈现曲线下面积=0.628,与成人亚组的曲线下面积(AUC)=0.776相比。为此样品开发了一个特定的模型。梯度增强模型在老年人样本中表现最佳(AUC=0.71)。与在研究的总人口中开发的模型相比,基于专门针对老年人的药物开发的预测跌倒风险的模型表现更好。
    The study aimed to develop and validate, through machine learning, a fall risk prediction model related to prescribed medications specific to adults and older adults admitted to hospital. A case-control study was carried out in a tertiary hospital, involving 9,037 adults and older adults admitted to hospital in 2016. The variables were analyzed using the algorithms: logistic regression, naive bayes, random forest and gradient boosting. The best model presented an area under the curve = 0.628 in the older adult subgroup, compared to an area under the curve (AUC) = 0.776 in the adult subgroup. A specific model was developed for this sample. The gradient boosting model presented the best performance in the sample of older adults (AUC = 0.71). Models developed to predict the risk of falls based on medications specifically aimed at older adults presented better performance in relation to models developed in the total population studied.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    近年来,可穿戴传感器和生物电子学的机器学习技术取得了巨大的进步,它在实时传感数据分析中起着至关重要的作用,为个性化医疗提供临床级信息。为此,监督学习和无监督学习算法已经成为强大的工具,允许检测复杂的模式和关系,高维数据集。在这篇评论中,我们的目标是描述可穿戴传感器机器学习的最新进展,专注于算法技术的关键发展,应用程序,以及这种不断发展的景观所固有的挑战。此外,我们强调了机器学习方法提高准确性的潜力,可靠性,和可穿戴传感器数据的可解释性,并讨论这一新兴领域的机会和局限性。最终,我们的工作旨在为这个令人兴奋和快速发展的领域的未来研究工作提供路线图。
    Recent years have witnessed tremendous advances in machine learning techniques for wearable sensors and bioelectronics, which play an essential role in real-time sensing data analysis to provide clinical-grade information for personalized healthcare. To this end, supervised learning and unsupervised learning algorithms have emerged as powerful tools, allowing for the detection of complex patterns and relationships in large, high-dimensional data sets. In this Review, we aim to delineate the latest advancements in machine learning for wearable sensors, focusing on key developments in algorithmic techniques, applications, and the challenges intrinsic to this evolving landscape. Additionally, we highlight the potential of machine-learning approaches to enhance the accuracy, reliability, and interpretability of wearable sensor data and discuss the opportunities and limitations of this emerging field. Ultimately, our work aims to provide a roadmap for future research endeavors in this exciting and rapidly evolving area.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    获得越来越多的科学和临床数据,特别是随着电子健康记录的实施,重新点燃了人们对人工智能及其在健康科学中的应用的热情。在过去的几年中,随着几种基于机器学习和深度学习的医疗技术的发展,这种兴趣达到了高潮。胃肠病学和肝病学对研究和临床实践的影响已经很大,但不久的将来,人工智能和机器学习只能进一步整合到这一领域。人工智能和机器学习背后的概念最初似乎令人生畏,但是随着越来越熟悉,它们将成为每个临床医生工具包中的基本技能。在这次审查中,我们提供了机器学习基础知识的指南,人工智能中的一个集中研究领域,建立在经典统计学的基础上。最常见的机器学习方法,包括那些涉及深度学习的,也有描述。
    The access to increasing volumes of scientific and clinical data, particularly with the implementation of electronic health records, has reignited an enthusiasm for artificial intelligence and its application to the health sciences. This interest has reached a crescendo in the past few years with the development of several machine learning- and deep learning-based medical technologies. The impact on research and clinical practice within gastroenterology and hepatology has already been significant, but the near future promises only further integration of artificial intelligence and machine learning into this field. The concepts underlying artificial intelligence and machine learning initially seem intimidating, but with increasing familiarity, they will become essential skills in every clinician\'s toolkit. In this review, we provide a guide to the fundamentals of machine learning, a concentrated area of study within artificial intelligence that has been built on a foundation of classical statistics. The most common machine learning methodologies, including those involving deep learning, are also described.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    本文提出了一种使用低成本FSR传感器预测地面反作用力(GRF)和压力中心(CoP)的方案。GRF和CoP数据通常从智能鞋垫收集,以分析佩戴者的步态并诊断平衡问题。这种方法可用于改善用户的康复过程,并为特定疾病的患者提供定制的治疗计划,使其成为许多领域的有用技术。然而,用于直接监测GRF和CoP值的常规测量设备,例如F扫描,是昂贵的,对该行业的商业化构成挑战。为了解决这个问题,本文提出了一种技术来预测相关指标只使用低成本的力敏电阻(FSR)传感器,而不是昂贵的设备。在这项研究中,数据是从同时佩戴低成本FSR传感器和F扫描设备的受试者收集的,并使用监督学习技术分析收集的数据集之间的关系。使用所提出的技术,构建了一个人工神经网络,该神经网络可以仅使用来自FSR传感器的数据得出接近实际F扫描值的预测值。在这个过程中,使用六个虚拟力代替整个鞋底的压力值计算GRF和CoP。通过各种模拟验证,与传统预测技术相比,使用所提出的技术可以实现30%以上的改进预测精度。
    This paper proposes a scheme for predicting ground reaction force (GRF) and center of pressure (CoP) using low-cost FSR sensors. GRF and CoP data are commonly collected from smart insoles to analyze the wearer\'s gait and diagnose balance issues. This approach can be utilized to improve a user\'s rehabilitation process and enable customized treatment plans for patients with specific diseases, making it a useful technology in many fields. However, the conventional measuring equipment for directly monitoring GRF and CoP values, such as F-Scan, is expensive, posing a challenge to commercialization in the industry. To solve this problem, this paper proposes a technology to predict relevant indicators using only low-cost Force Sensing Resistor (FSR) sensors instead of expensive equipment. In this study, data were collected from subjects simultaneously wearing a low-cost FSR Sensor and an F-Scan device, and the relationship between the collected data sets was analyzed using supervised learning techniques. Using the proposed technique, an artificial neural network was constructed that can derive a predicted value close to the actual F-Scan values using only the data from the FSR Sensor. In this process, GRF and CoP were calculated using six virtual forces instead of the pressure value of the entire sole. It was verified through various simulations that it is possible to achieve an improved prediction accuracy of more than 30% when using the proposed technique compared to conventional prediction techniques.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:虚弱是一种老年综合征,其特征是个体易损性增加,暴露于外部压力源时依赖性和死亡率均增加。衰弱指数在常规临床实践中的使用受到几个因素的限制,比如病人的认知状态,协商时间,或缺乏患者的先验信息。
    目的:在本研究中,我们提出了一种客观的弱点衡量标准,基于来自手握力(HGS)的信号。
    方法:使用改进的Deyard测功机记录该信号,并使用基于监督学习方法的机器学习策略进行处理,以训练分类器。在一项横向试点研究中,从138名老年人的队列中生成了一个数据库,该研究将经典的老年问卷调查与生理数据相结合。
    方法:参与者是由合作实体提供医疗服务的老年病学家选择的患者。
    结果:为了处理生成的信息,过滤了HGS数据集的20个选定的重要特征,清洁,并提取。基于从最小组生成新样本的合成少数过采样技术(SMOTE)和去除噪声样本的ENN(基于K-最近邻的技术)的组合的技术作为数据的良好平衡分布提供了最佳结果。
    结论:训练随机森林分类器以92.9%的准确度预测脆弱标签,敏感度高于90%。
    BACKGROUND: Frailty is a geriatric syndrome characterized by increased individual vulnerability with an increase in both dependence and mortality when exposed to external stressors. The use of Frailty Indices in routine clinical practice is limited by several factors, such as the cognitive status of the patient, times of consultation, or lack of prior information from the patient.
    OBJECTIVE: In this study, we propose the generation of an objective measure of frailty, based on the signal from hand grip strength (HGS).
    METHODS: This signal was recorded with a modified Deyard dynamometer and processed using machine learning strategies based on supervised learning methods to train classifiers. A database was generated from a cohort of 138 older adults in a transverse pilot study that combined classical geriatric questionnaires with physiological data.
    METHODS: Participants were patients selected by geriatricians of medical services provided by collaborating entities.
    RESULTS: To process the generated information 20 selected significant features of the HGS dataset were filtered, cleaned, and extracted. A technique based on a combination of the Synthetic Minority Oversampling Technique (SMOTE) to generate new samples from the smallest group and ENN (technique based on K-nearest neighbors) to remove noisy samples provided the best results as a well-balanced distribution of data.
    CONCLUSIONS: A Random Forest Classifier was trained to predict the frailty label with 92.9% of accuracy, achieving sensitivities higher than 90%.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    传染病最近已构成全球性威胁,从地方病发展到大流行。早期发现和找到更好的治疗方法是遏制疾病及其传播的方法。机器学习(ML)已被证明是早期疾病诊断的理想方法。这篇评论重点介绍了ML算法在猴痘(MP)中的使用。各种型号,比如CNN,DL,NLP,朴素贝叶斯,GRA-TLA,HMD,阿丽玛,SEL,回归分析,和Twitter帖子是为了从数据集中提取有用的信息而构建的。这些发现表明,检测,分类,预测,和情感分析进行了主要分析。此外,这篇综述将有助于研究人员了解ML在MP中的最新实施情况,以及该领域的进一步进展,以发现有效的治疗方法。
    Infectious diseases have posed a global threat recently, progressing from endemic to pandemic. Early detection and finding a better cure are methods for curbing the disease and its transmission. Machine learning (ML) has demonstrated to be an ideal approach for early disease diagnosis. This review highlights the use of ML algorithms for monkeypox (MP). Various models, such as CNN, DL, NLP, Naïve Bayes, GRA-TLA, HMD, ARIMA, SEL, Regression analysis, and Twitter posts were built to extract useful information from the dataset. These findings show that detection, classification, forecasting, and sentiment analysis are primarily analyzed. Furthermore, this review will assist researchers in understanding the latest implementations of ML in MP and further progress in the field to discover potent therapeutics.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    机器学习(ML)通过提高污染的预测准确性和管理策略,正在彻底改变地下水质量研究。这篇全面的综述探讨了ML技术的演变及其与环境科学的整合,评估230篇论文,以了解地下水质量研究的进展和挑战。它揭示了很大一部分研究忽略了关键的预处理步骤,对于模型准确性至关重要,83%的研究忽视了这一阶段。此外,虽然模型优化更常见,在65%的论文中被实施,模型可解释性存在明显差距,只有15%的研究为模型结果提供了解释。ML算法的比较评估和仔细选择评估指标对于确定模型的适用性和可靠性至关重要。审查强调了跨学科合作的必要性,方法的严谨,并不断创新,推动ML在地下水管理方面的发展。通过应对这些挑战并实施解决方案,可以利用ML的全部潜力来解决复杂的环境问题,并确保可持续的地下水管理。这份全面而重要的审查文件可以作为指导框架,为在地下水质量研究中开发ML建立最低标准。
    Machine learning (ML) is revolutionizing groundwater quality research by enhancing predictive accuracy and management strategies for contamination. This comprehensive review explores the evolution of ML technologies and their integration into environmental science, assessing 230 papers to understand the advancements and challenges in groundwater quality research. It reveals that a substantial portion of the research neglects critical preprocessing steps, crucial for model accuracy, with 83 % of the studies overlooking this phase. Furthermore, while model optimization is more commonly addressed, being implemented in 65 % of the papers, there is a noticeable gap in model interpretability, with only 15 % of the research providing explanations for model outcomes. Comparative evaluation of ML algorithms and careful selection of evaluation metrics are deemed essential for determining model fitness and reliability. The review underscores the need for interdisciplinary collaboration, methodological rigor, and continuous innovation to advance ML in groundwater management. By addressing these challenges and implementing solutions, the full potential of ML can be harnessed to tackle complex environmental issues and ensure sustainable groundwater management. This comprehensive and critical review paper can serve as a guiding framework to establish minimum standards for developing ML in groundwater quality studies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    由于硬件配置的变化,多变量校准模型在外推校准仪器方面经常遇到挑战。信号处理算法,或环境条件。已经开发了校准传递技术来缓解这个问题。在这项研究中,我们介绍了一种称为监督因子分析转移(SFAT)的新方法,旨在实现稳健和可解释的校准转移。SFAT从概率框架运行,并将响应变量集成到其传输过程中,以有效地将目标仪器的数据与源仪器的数据对齐。在SFAT模型中,来自源仪器的数据,目标仪器,并且响应变量被共同投影到一组共享的潜在变量上。这些潜在变量作为三个不同领域之间信息传递的管道,从而促进有效的光谱转移。此外,SFAT明确建模与每个变量相关的噪声方差,从而最大限度地减少非信息噪声的传输。此外,我们提供了经验证据,展示了SFAT在三个真实世界数据集的有效性,在校准转移方案中展示其卓越的性能。
    Multivariate calibration models often encounter challenges in extrapolating beyond the calibration instruments due to variations in hardware configurations, signal processing algorithms, or environmental conditions. Calibration transfer techniques have been developed to mitigate this issue. In this study, we introduce a novel methodology known as Supervised Factor Analysis Transfer (SFAT) aimed at achieving robust and interpretable calibration transfer. SFAT operates from a probabilistic framework and integrates response variables into its transfer process to effectively align data from the target instrument to that of the source instrument. Within the SFAT model, the data from the source instrument, the target instrument, and the response variables are collectively projected onto a shared set of latent variables. These latent variables serve as the conduit for information transfer between the three distinct domains, thereby facilitating effective spectra transfer. Moreover, SFAT explicitly models the noise variances associated with each variable, thereby minimizing the transfer of non-informative noise. Furthermore, we provide empirical evidence showcasing the efficacy of SFAT across three real-world datasets, demonstrating its superior performance in calibration transfer scenarios.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    胰腺癌是世界上最致命的癌症之一,5年生存率低于5%,所有癌症类型中最低的。胰腺导管腺癌(PDAC)是最常见和侵袭性的胰腺癌,在过去的几十年中已被列为健康紧急情况。PDAC的组织病理学诊断和预后评估是耗时的,辛苦,在当前的临床实践条件下具有挑战性。病理人工智能(AI)的研究最近一直在积极进行。然而,获取医疗数据具有挑战性;开放病理数据量很小,并且缺乏医务人员绘制的开放注释数据,这使得进行病理学AI研究变得困难。这里,我们提供易于获取的高质量注释数据来解决上述障碍。通过使用深度卷积神经网络结构的监督学习来执行数据评估,以分割由医务人员直接从开放WSI数据集中绘制的11个注释的PDAC组织病理学整张幻灯片图像(WSI)。我们可视化了WSI上Dice评分为73%的组织病理学图像的分割结果,包括PDAC区域,从而确定对PDAC诊断重要的区域并证明高数据质量。此外,人工智能辅助病理学家可以显著提高工作效率。我们提出的病理学AI指南在开发PDAC的组织病理学AI方面是有效的,并且在临床领域具有重要意义。
    Pancreatic cancer is one of the most lethal cancers worldwide, with a 5-year survival rate of less than 5%, the lowest of all cancer types. Pancreatic ductal adenocarcinoma (PDAC) is the most common and aggressive pancreatic cancer and has been classified as a health emergency in the past few decades. The histopathological diagnosis and prognosis evaluation of PDAC is time-consuming, laborious, and challenging in current clinical practice conditions. Pathological artificial intelligence (AI) research has been actively conducted lately. However, accessing medical data is challenging; the amount of open pathology data is small, and the absence of open-annotation data drawn by medical staff makes it difficult to conduct pathology AI research. Here, we provide easily accessible high-quality annotation data to address the abovementioned obstacles. Data evaluation is performed by supervised learning using a deep convolutional neural network structure to segment 11 annotated PDAC histopathological whole slide images (WSIs) drawn by medical staff directly from an open WSI dataset. We visualized the segmentation results of the histopathological images with a Dice score of 73% on the WSIs, including PDAC areas, thus identifying areas important for PDAC diagnosis and demonstrating high data quality. Additionally, pathologists assisted by AI can significantly increase their work efficiency. The pathological AI guidelines we propose are effective in developing histopathological AI for PDAC and are significant in the clinical field.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号