Ensemble learning

合奏学习
  • 文章类型: Journal Article
    在目前世界范围内的COVID-19大流行期间,有效筛查COVID-19病例对于减轻和阻止疾病的快速传播变得极其重要。在这篇文章中,我们考虑使用胸部X射线图像进行放射学检查,这是COVID-19病例检测的有效筛查方法之一。鉴于深度学习是图像分析的有效工具和框架,通过使用X射线图像训练深度学习模型来进行COVID-19病例检测的研究很多。尽管其中一些报告了良好的预测结果,他们提出的深度学习模型可能会过度拟合,高方差,和由噪声和有限数量的数据集引起的泛化错误。考虑到集成学习可以通过使用多个模型而不是单个模型进行预测来克服深度学习的缺点,我们提议EDL-COVID,采用深度学习和集成学习的集成深度学习模型。EDL-COVID模型是通过组合COVID-Net的多个快照模型生成的,它开创了一种开源的COVID-19病例检测方法,该方法具有深度神经网络处理的胸部X射线图像,通过采用提出的加权平均集成方法,该方法可以意识到深度学习模型在不同类别类型上的不同敏感性。实验结果表明,EDL-COVID为COVID-19病例检测提供了有希望的结果,准确率为95%,优于COVID-Net的93.3%。
    Effective screening of COVID-19 cases has been becoming extremely important to mitigate and stop the quick spread of the disease during the current period of COVID-19 pandemic worldwide. In this article, we consider radiology examination of using chest X-ray images, which is among the effective screening approaches for COVID-19 case detection. Given deep learning is an effective tool and framework for image analysis, there have been lots of studies for COVID-19 case detection by training deep learning models with X-ray images. Although some of them report good prediction results, their proposed deep learning models might suffer from overfitting, high variance, and generalization errors caused by noise and a limited number of datasets. Considering ensemble learning can overcome the shortcomings of deep learning by making predictions with multiple models instead of a single model, we propose EDL-COVID, an ensemble deep learning model employing deep learning and ensemble learning. The EDL-COVID model is generated by combining multiple snapshot models of COVID-Net, which has pioneered in an open-sourced COVID-19 case detection method with deep neural network processed chest X-ray images, by employing a proposed weighted averaging ensembling method that is aware of different sensitivities of deep learning models on different classes types. Experimental results show that EDL-COVID offers promising results for COVID-19 case detection with an accuracy of 95%, better than COVID-Net of 93.3%.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:准确估计手术时间是一项重要的手术室效率指标。脊柱手术中的当前预测技术包括不太复杂的方法,例如经典的多变量统计模型。机器学习方法已用于预测结果,例如停留时间和恢复正常工作的时间。但没有集中在案件的持续时间。
    目标:这4年的主要目标,单一学术中心,回顾性研究是使用集成学习方法,该方法可能会提高脊柱手术预定病例持续时间的准确性。主要结果指标是病例持续时间。
    方法:我们将使用手术和患者特征的机器学习模型与我们的机构方法进行了比较,根据需要使用历史平均值和外科医生调整。我们实施了多元线性回归,随机森林,装袋,和XGBoost(极限梯度提升),并计算平均R2,均方根误差(RMSE),解释方差,和使用k折交叉验证的平均绝对误差(MAE)。然后,我们使用SHAP(Shapley加法解释)解释器模型来确定特征重要性。
    结果:共纳入3189例接受脊柱手术的患者。机构当前预测病例次数的方法与实际次数的确定系数非常差(R2=0.213)。在k折交叉验证中,线性回归模型的解释方差得分为0.345,R2为0.34,RMSE为162.84分钟,MAE为127.22分钟。在所有型号中,XGBoost回归函数表现最好,解释方差分数为0.778,R2为0.770,RMSE为92.95分钟,MAE为44.31分钟。基于XGBoost回归的SHAP分析,身体质量指数,脊柱融合,外科手术,涉及的脊柱水平数量是对模型影响最大的特征。
    结论:使用基于集成学习的预测模型,特别是XGBoost回归,可以提高脊柱手术次数估计的准确性。
    BACKGROUND: Estimating surgical case duration accurately is an important operating room efficiency metric. Current predictive techniques in spine surgery include less sophisticated approaches such as classical multivariable statistical models. Machine learning approaches have been used to predict outcomes such as length of stay and time returning to normal work, but have not been focused on case duration.
    OBJECTIVE: The primary objective of this 4-year, single-academic-center, retrospective study was to use an ensemble learning approach that may improve the accuracy of scheduled case duration for spine surgery. The primary outcome measure was case duration.
    METHODS: We compared machine learning models using surgical and patient features to our institutional method, which used historic averages and surgeon adjustments as needed. We implemented multivariable linear regression, random forest, bagging, and XGBoost (Extreme Gradient Boosting) and calculated the average R2, root-mean-square error (RMSE), explained variance, and mean absolute error (MAE) using k-fold cross-validation. We then used the SHAP (Shapley Additive Explanations) explainer model to determine feature importance.
    RESULTS: A total of 3189 patients who underwent spine surgery were included. The institution\'s current method of predicting case times has a very poor coefficient of determination with actual times (R2=0.213). On k-fold cross-validation, the linear regression model had an explained variance score of 0.345, an R2 of 0.34, an RMSE of 162.84 minutes, and an MAE of 127.22 minutes. Among all models, the XGBoost regressor performed the best with an explained variance score of 0.778, an R2 of 0.770, an RMSE of 92.95 minutes, and an MAE of 44.31 minutes. Based on SHAP analysis of the XGBoost regression, body mass index, spinal fusions, surgical procedure, and number of spine levels involved were the features with the most impact on the model.
    CONCLUSIONS: Using ensemble learning-based predictive models, specifically XGBoost regression, can improve the accuracy of the estimation of spine surgery times.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    社交媒体平台每天都会产生大量的数据。数百万用户参与这些平台上发布的帖子。尽管这些平台实施了社会法规和协议,很难限制一些带有仇恨内容的令人反感的帖子。社交媒体平台上的自动仇恨言论检测是一项基本任务,尽管各种研究人员进行了多次尝试,但仍未得到有效解决。这是一项具有挑战性的任务,涉及从社交媒体帖子中识别仇恨内容。这些帖子可能会暴露仇恨,或者他们可能是主观的用户或社区。依靠人工检查延迟了流程,和可恶的内容可能会保持在线很长一段时间。当前最先进的处理仇恨言论的方法在同一数据集上进行测试时表现良好,但在交叉数据集上失败了。因此,我们提出了一种基于集成学习的自适应模型,用于自动仇恨语音检测,改进跨数据集泛化。所提出的用于仇恨言论检测的专家模型致力于克服可用注释数据集中存在的强烈用户偏见。我们在各种实验设置下进行实验,并证明所提出的模型在COVID-19和美国总统选举等最新问题上的有效性。特别是,在所有模型中,在跨数据集评估下观察到的性能损失最小。此外,在限制每个用户的最大推文数量的同时,我们的业绩不会下降。
    Social media platforms generate an enormous amount of data every day. Millions of users engage themselves with the posts circulated on these platforms. Despite the social regulations and protocols imposed by these platforms, it is difficult to restrict some objectionable posts carrying hateful content. Automatic hate speech detection on social media platforms is an essential task that has not been solved efficiently despite multiple attempts by various researchers. It is a challenging task that involves identifying hateful content from social media posts. These posts may reveal hate outrageously, or they may be subjective to the user or a community. Relying on manual inspection delays the process, and the hateful content may remain available online for a long time. The current state-of-the-art methods for tackling hate speech perform well when tested on the same dataset but fail miserably on cross-datasets. Therefore, we propose an ensemble learning-based adaptive model for automatic hate speech detection, improving the cross-dataset generalization. The proposed expert model for hate speech detection works towards overcoming the strong user-bias present in the available annotated datasets. We conduct our experiments under various experimental setups and demonstrate the proposed model\'s efficacy on the latest issues such as COVID-19 and US presidential elections. In particular, the loss in performance observed under cross-dataset evaluation is the least among all the models. Also, while restricting the maximum number of tweets per user, we incur no drop in performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    机器学习结合卫星图像时间序列可以快速、并可靠地实施,以绘制粮食安全所需的作物分布和生长监测图。然而,获取大量的实地调查样本进行分类器训练通常是耗时且昂贵的,这导致作物分布图的制作非常缓慢。为了克服这一挑战,我们从现有的历史作物数据层(CDL)中提出了一种集成学习方法,以根据时空样本选择规则自动创建多个样本。对2017-2019年吉林省农作物分布图Sentinel-2月合成图像进行马赛克分类。比较了四种机器学习算法对单个月和多个月时间序列的分类精度。结果表明,深度神经网络(DNN)表现最好,其次是随机森林(RF),然后决策树(DT),和支持向量机(SVM)最少。与其他月份相比,7月和8月有较高的分类精度,Kappa系数分别为0.78和0.79。与单相相比,随着时间序列的增长,Kappa系数逐渐增大,最早在8月达到0.94,然后增加并不明显,在整个生长周期中最高的是0.95。在映射过程中,不同长度的时间序列产生不同的分类结果。湿地类型被错误分类为水稻。在这种情况下,作者结合两种长度的时间序列来纠正错误分类的水稻类型。通过与现有产品和现场点的比较,大米的稠度最高,其次是玉米,而大豆的稠度最低。这表明本研究中生成的样本数据集和训练模型可以满足作物作图精度,同时降低了实地调查的成本。为了进一步研究,应考虑更多的年份和作物类型进行绘图和验证。
    Machine learning combined with satellite image time series can quickly, and reliably be implemented to map crop distribution and growth monitoring necessary for food security. However, obtaining a large number of field survey samples for classifier training is often time-consuming and costly, which results in the very slow production of crop distribution maps. To overcome this challenge, we propose an ensemble learning approach from the existing historical crop data layer (CDL) to automatically create multitudes of samples according to the rules of spatiotemporal sample selection. Sentinel-2 monthly composite images from 2017 to 2019 for crop distribution mapping in Jilin Province were mosaicked and classified. Classification accuracies of four machine learning algorithms for a single-month and multi-month time series were compared. The results show that deep neural network (DNN) performed the best, followed by random forest (RF), then decision tree (DT), and support vector machine (SVM) the least. Compared with other months, July and August have higher classification accuracy, and the kappa coefficients of 0.78 and 0.79, respectively. Compared with a single phase, the kappa coefficient gradually increases with the growth of the time series, reaching 0.94 in August at the earliest, and then the increase is not obvious, and the highest in the whole growth cycle is 0.95. During the mapping process, time series of different lengths produced different classification results. Wetland types were misclassified as rice. In such cases, authors combined time series of two lengths to correct the misclassified rice types. By comparing with existing products and field points, rice has the highest consistency, followed by corn, whereas soybeans have the least consistency. This shows that the generated sample data set and trained model in this research can meet the crop mapping accuracy and simultaneously reduce the cost of field surveys. For further research, more years and types of crops should be considered for mapping and validation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    骨折是急诊(ER)的主要原因之一;检测骨折的主要方法是使用X射线图像。X射线图像需要经验丰富的放射科医师对其进行分类;然而,有经验的放射科医生并不总是在急诊室。ER中的准确自动X射线图像分类器可以通过向急诊医生提供即时第二意见来帮助降低错误率。深度学习是人工智能的新兴趋势,其中可以训练自动分类器来对肌肉骨骼图像进行分类。图像增强技术已经证明了它们在提高深度学习模型性能方面的有用性。通常,在图像分类领域,增强技术是在训练网络期间使用的,而不是在测试阶段。测试时间增强(TTA)可以通过提供,计算成本可以忽略不计,对同一图像进行多次转换。在本文中,我们研究了TTA对MURA数据集图像分类性能的影响。与没有TTA的预测相比,评估了九种不同的增强技术以确定其性能。还评估了两种集成技术,多数投票和平均投票。根据我们的结果,TTA显著提高了分类性能,特别是对于分数低的模型。
    Bone fractures are one of the main causes to visit the emergency room (ER); the primary method to detect bone fractures is using X-Ray images. X-Ray images require an experienced radiologist to classify them; however, an experienced radiologist is not always available in the ER. An accurate automatic X-Ray image classifier in the ER can help reduce error rates by providing an instant second opinion to the emergency doctor. Deep learning is an emerging trend in artificial intelligence, where an automatic classifier can be trained to classify musculoskeletal images. Image augmentations techniques have proven their usefulness in increasing the deep learning model\'s performance. Usually, in the image classification domain, the augmentation techniques are used during training the network and not during the testing phase. Test time augmentation (TTA) can increase the model prediction by providing, with a negligible computational cost, several transformations for the same image. In this paper, we investigated the effect of TTA on image classification performance on the MURA dataset. Nine different augmentation techniques were evaluated to determine their performance compared to predictions without TTA. Two ensemble techniques were assessed as well, the majority vote and the average vote. Based on our results, TTA increased classification performance significantly, especially for models with a low score.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    风险预测模型被广泛用于为循证临床决策提供信息。然而,在存在不同预后的人群水平上(如SARS-CoV-2[严重急性呼吸道综合征冠状病毒2]大流行),很少有由单一队列建立的模型能够持续良好地表现.本研究旨在通过使用集成学习来协同文献中的预测模型来应对这一挑战。
    在这项研究中,我们选择并重新实施了7种来自不同队列的COVID-19(2019年冠状病毒病)预测模型,并使用了不同的实施技术.提出了一种新颖的集成学习框架来协同它们,以实现对个体患者的个性化预测。使用四个不同的国际队列(2个来自英国,2个来自中国;N=5394)来验证所有8个歧视模型,校准,和临床有用性。
    结果表明,个体预测模型在某些队列中表现良好,而在其他队列中表现不佳。相反,集成模型在所有量化歧视的指标上始终如一地实现了最佳性能,校准,和临床有用性。在来自两个国家的队列中观察到了绩效差异:所有模型在中国队列中都取得了更好的绩效。
    当单个模型从互补队列中学习时,协同模型有可能获得比任何单个模型更好的性能。结果表明,早期收集时,血液参数和生理测量值可能具有更好的预测能力,这还有待进一步研究证实。
    结合一组不同的个体预测模型,集成方法可以通过为个体患者选择最有能力的模型来协同稳健和性能良好的模型。
    Risk prediction models are widely used to inform evidence-based clinical decision making. However, few models developed from single cohorts can perform consistently well at population level where diverse prognoses exist (such as the SARS-CoV-2 [severe acute respiratory syndrome coronavirus 2] pandemic). This study aims at tackling this challenge by synergizing prediction models from the literature using ensemble learning.
    In this study, we selected and reimplemented 7 prediction models for COVID-19 (coronavirus disease 2019) that were derived from diverse cohorts and used different implementation techniques. A novel ensemble learning framework was proposed to synergize them for realizing personalized predictions for individual patients. Four diverse international cohorts (2 from the United Kingdom and 2 from China; N = 5394) were used to validate all 8 models on discrimination, calibration, and clinical usefulness.
    Results showed that individual prediction models could perform well on some cohorts while poorly on others. Conversely, the ensemble model achieved the best performances consistently on all metrics quantifying discrimination, calibration, and clinical usefulness. Performance disparities were observed in cohorts from the 2 countries: all models achieved better performances on the China cohorts.
    When individual models were learned from complementary cohorts, the synergized model had the potential to achieve better performances than any individual model. Results indicate that blood parameters and physiological measurements might have better predictive powers when collected early, which remains to be confirmed by further studies.
    Combining a diverse set of individual prediction models, the ensemble method can synergize a robust and well-performing model by choosing the most competent ones for individual patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    近年来,人工智能(AI)在医疗决策支持中的应用取得了重大进展。然而,许多基于人工智能的系统通常只向医生提供最终预测,而没有解释其潜在的决策过程。在有关致命疾病的情况下,比如乳腺癌,采用辅助预测的医生冒着很大的风险,因为一个糟糕的决定会对病人产生非常有害的后果。我们提出了一种辅助决策支持系统,该系统将集成学习与基于案例的推理相结合,以帮助医生提高乳腺癌复发预测的准确性。该系统提供了对其预测的基于案例的解释,这对医生来说更容易理解,帮助他们评估系统预测的可靠性,并做出相应的决策。我们在针对乳腺癌复发预测的案例研究中的应用和评估表明,所提出的系统不仅提供了合理准确的预测,而且还受到肿瘤学家的好评。
    Significant progress has been achieved in recent years in the application of artificial intelligence (AI) for medical decision support. However, many AI-based systems often only provide a final prediction to the doctor without an explanation of its underlying decision-making process. In scenarios concerning deadly diseases, such as breast cancer, a doctor adopting an auxiliary prediction is taking big risks, as a bad decision can have very harmful consequences for the patient. We propose an auxiliary decision support system that combines ensemble learning with case-based reasoning to help doctors improve the accuracy of breast cancer recurrence prediction. The system provides a case-based interpretation of its prediction, which is easier for doctors to understand, helping them assess the reliability of the system\'s prediction and make their decisions accordingly. Our application and evaluation in a case study focusing on breast cancer recurrence prediction shows that the proposed system not only provides reasonably accurate predictions but is also well-received by oncologists.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号