machine learning model

机器学习模型
  • 文章类型: Journal Article
    背景:术前评估很重要,我们的研究探索了机器学习方法在麻醉风险分类和评估各种因素贡献中的应用。为了在模型训练期间最小化混杂变量的影响,我们使用了生理状态和年龄相似的同质组,他们接受了类似的盆腔器官相关手术,但不涉及恶性肿瘤.
    目的:2017年1月1日至2021年12月31日期间进行妊娠或妇科手术的育龄妇女(年龄=20-50岁)的数据来自国立台湾大学医院综合医学数据库。
    方法:我们首先进行了探索性分析并选择了关键特征。然后,我们进行了数据预处理,以获取与术前检查相关的特征。为了进一步提高预测性能,我们采用对数似然比算法生成合并症模式。最后,我们将处理后的特征输入到光梯度增强机(LightGBM)模型中进行训练和后续预测。
    结果:共纳入10,892例患者。在这个数据集中,9893名患者被归类为低麻醉风险(美国麻醉医师协会身体状况评分1-2),999例患者被归类为麻醉风险高(美国麻醉医师协会身体状况评分>2)。LightGBM模型的接收器工作特性曲线下的面积为90.25。
    结论:通过结合合并症信息和临床实验室数据,我们基于LightGBM模型的方法为麻醉风险分类提供了更准确的预测.
    背景:本研究已在国立台湾大学医院研究伦理委员会注册,试验编号为202204010RINB。
    BACKGROUND: Preoperative evaluation is important, and this study explored the application of machine learning methods for anesthetic risk classification and the evaluation of the contributions of various factors. To minimize the effects of confounding variables during model training, we used a homogenous group with similar physiological states and ages undergoing similar pelvic organ-related procedures not involving malignancies.
    OBJECTIVE: Data on women of reproductive age (age 20-50 years) who underwent gestational or gynecological surgery between January 1, 2017, and December 31, 2021, were obtained from the National Taiwan University Hospital Integrated Medical Database.
    METHODS: We first performed an exploratory analysis and selected key features. We then performed data preprocessing to acquire relevant features related to preoperative examination. To further enhance predictive performance, we used the log-likelihood ratio algorithm to generate comorbidity patterns. Finally, we input the processed features into the light gradient boosting machine (LightGBM) model for training and subsequent prediction.
    RESULTS: A total of 10,892 patients were included. Within this data set, 9893 patients were classified as having low anesthetic risk (American Society of Anesthesiologists physical status score of 1-2), and 999 patients were classified as having high anesthetic risk (American Society of Anesthesiologists physical status score of >2). The area under the receiver operating characteristic curve of the proposed model was 0.6831.
    CONCLUSIONS: By combining comorbidity information and clinical laboratory data, our methodology based on the LightGBM model provides more accurate predictions for anesthetic risk classification.
    BACKGROUND: Research Ethics Committee of the National Taiwan University Hospital 202204010RINB; https://www.ntuh.gov.tw/RECO/Index.action.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:术后谵妄是一种神经心理学综合征,通常发生在手术患者中。它的发作可导致住院时间延长以及发病率和死亡率增加。因此,重要的是要及时识别其迹象。本研究旨在使用广泛的人群数据开发和验证术后谵妄的机器学习预测模型。
    方法:回顾性观察研究。
    方法:日本诊断程序组合住院数据。数据用于内部(2016.4-2018.12)和时间验证(2019.01-2019.10)。
    方法:年龄≥65岁的患者接受手术全身麻醉。
    方法:主要结果是术后谵妄,这被定义为在手术日期后需要新处方的抗精神病药物或分配相应的保险索赔代码的情况。我们使用所选择的接收器工作特征曲线(AUC)值下的最佳面积通过10倍交叉验证来训练和调整最佳机器学习模型。在时间验证中,我们测量了模型的性能。
    结果:分析包括557,990例患者。光梯度增强机器模型显示比其他模型更高的AUC值(0.826[95%置信区间(CI):0.822-0.829])。关于性能,模型的召回值为0.124(95%CI:0.119-0.129),精确度值为0.659(95%CI:0.641-0.677]).这种表现在时间验证中得到了维持(AUC,0.815[95%CI:0.811-0.818])。在0.80的灵敏度下,该模型的特异性为0.672(95%CI:0.670-0.674]),阴性预测值为0.975(95%CI:0.974-0.975),阳性预测值为0.176(95%CI:0.176-0.179)。
    结论:使用广泛的诊断程序组合数据,我们成功创建并验证了预测术后谵妄的机器学习模型.该模型可能有助于预测术后谵妄。
    OBJECTIVE: Postoperative delirium is a neuropsychological syndrome that typically occurs in surgical patients. Its onset can lead to prolonged hospitalization as well as increased morbidity and mortality. Therefore, it is important to promptly identify its signs. This study aimed to develop and validate a machine learning predictive model for postoperative delirium using extensive population data.
    METHODS: Retrospective observational study.
    METHODS: Japanese Diagnosis Procedure Combination inpatient data. Data were used for internal (2016.4-2018.12) and temporal validation (2019.01-2019.10).
    METHODS: Patients aged ≥65 years who underwent general anesthesia for surgical procedure.
    METHODS: The primary outcome was postoperative delirium, which was defined as a condition requiring newly prescribed antipsychotic drugs or assignment of the corresponding insurance claim code after the date of surgery. We trained and tuned the optimal machine-learning model through 10-fold cross-validation using the selected optimal area under the receiver operating characteristic curve (AUC) value. In the temporal validation, we measured the performance of our model.
    RESULTS: The analysis included 557,990 patients. The light-gradient boosting machine models showed a higher AUC value (0.826 [95% confidence interval (CI): 0.822-0.829]) than the other models. Regarding performance, the model had a recall value of 0.124 (95% CI: 0.119-0.129) and precision value of 0.659 (95% CI: 0.641-0.677]). This performance was sustained in the temporal validation (AUC, 0.815 [95% CI: 0.811-0.818]). At a sensitivity of 0.80, the model achieved a specificity of 0.672 (95% CI: 0.670-0.674]), a negative predictive value of 0.975 (95% CI: 0.974-0.975), and a positive predictive value of 0.176 (95% CI: 0.176-0.179).
    CONCLUSIONS: Using extensive Diagnostic Procedure Combination data, we successfully created and validated a machine learning model for predicting postoperative delirium. This model may facilitate prediction of postoperative delirium.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:呼吸道合胞病毒(RSV)影响儿童,导致严重感染,尤其是高危人群。鉴于RSV的季节性和快速隔离感染者的重要性,迫切需要更有效的诊断方法来加快这一进程。
    目的:本研究旨在研究机器学习模型的性能,该模型利用症状发作的时间多样性来检测RSV感染并阐明其辨别能力。
    方法:本研究在日本的儿科和急诊门诊进行。我们开发了一种检测模型,该模型基于使用结构化电子模板获得的患者报告的症状信息来远程确认RSV感染,该模板结合了熟练儿科医生的差异点。使用接受RSV快速抗原测试的4174名年龄≤24个月的患者的数据开发了基于极端梯度增强的机器学习模型。这些患者于2009年1月1日至2015年12月31日期间访问了横滨市立医院的儿科或急诊科。主要结果是机器学习模型对RSV感染的诊断准确性。通过快速抗原测试确定,使用接收器工作特性曲线下的面积进行测量。通过计算自首次症状发作以来经过的天数和基于合理敏感性和特异性阈值的排除率来评估临床疗效。
    结果:我们的模型显示受试者工作特征曲线下面积为0.811(95%CI0.784-0.833),校准良好,患者发病3天内为0.746(95%CI0.694-0.794)。它准确地捕获了症状的时间演变;基于与快速抗原测试相当的调整阈值,我们的模型预测,整个队列中6.9%(95%CI5.4%-8.5%)的患者为阳性,68.7%(95%CI65.4%-71.9%)为阴性.我们的模型可以消除对大约四分之三的所有患者进行额外测试的需要。
    结论:我们的模型可能有助于在门诊环境中立即检测RSV感染,潜在的,在家庭环境中。这种方法可以简化诊断过程,减少儿童侵入性测试引起的不适,并允许在家中快速实施适当的治疗和隔离。这些发现强调了机器学习在早期检测RSV感染中增强临床决策的潜力。
    BACKGROUND: Respiratory syncytial virus (RSV) affects children, causing serious infections, particularly in high-risk groups. Given the seasonality of RSV and the importance of rapid isolation of infected individuals, there is an urgent need for more efficient diagnostic methods to expedite this process.
    OBJECTIVE: This study aimed to investigate the performance of a machine learning model that leverages the temporal diversity of symptom onset for detecting RSV infections and elucidate its discriminatory ability.
    METHODS: The study was conducted in pediatric and emergency outpatient settings in Japan. We developed a detection model that remotely confirms RSV infection based on patient-reported symptom information obtained using a structured electronic template incorporating the differential points of skilled pediatricians. An extreme gradient boosting-based machine learning model was developed using the data of 4174 patients aged ≤24 months who underwent RSV rapid antigen testing. These patients visited either the pediatric or emergency department of Yokohama City Municipal Hospital between January 1, 2009, and December 31, 2015. The primary outcome was the diagnostic accuracy of the machine learning model for RSV infection, as determined by rapid antigen testing, measured using the area under the receiver operating characteristic curve. The clinical efficacy was evaluated by calculating the discriminative performance based on the number of days elapsed since the onset of the first symptom and exclusion rates based on thresholds of reasonable sensitivity and specificity.
    RESULTS: Our model demonstrated an area under the receiver operating characteristic curve of 0.811 (95% CI 0.784-0.833) with good calibration and 0.746 (95% CI 0.694-0.794) for patients within 3 days of onset. It accurately captured the temporal evolution of symptoms; based on adjusted thresholds equivalent to those of a rapid antigen test, our model predicted that 6.9% (95% CI 5.4%-8.5%) of patients in the entire cohort would be positive and 68.7% (95% CI 65.4%-71.9%) would be negative. Our model could eliminate the need for additional testing in approximately three-quarters of all patients.
    CONCLUSIONS: Our model may facilitate the immediate detection of RSV infection in outpatient settings and, potentially, in home environments. This approach could streamline the diagnostic process, reduce discomfort caused by invasive tests in children, and allow rapid implementation of appropriate treatments and isolation at home. The findings underscore the potential of machine learning in augmenting clinical decision-making in the early detection of RSV infection.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:对心力衰竭(HF)患者的护理会给医疗保健系统带来巨大的负担,其中一个突出的挑战是初次出院后30天内再入院率的升高。临床专业人员在最佳出院时机的决策过程中面临高度的不确定性和主观性。不必要的住院会产生成本并给患者带来压力,并可能对护理结果产生影响。最近的研究旨在通过开发和测试风险评估工具和预测模型来确定有再次入院风险的患者,从而减轻不确定性。经常使用新的方法,如机器学习(ML)。
    目的:本研究旨在探讨开发的临床决策支持(CDS)工具如何在心力衰竭患者出院的特定背景下改变医疗保健专业人员的决策过程。如果是这样,在哪些方面。此外,这样做的目的是捕捉卫生保健从业人员参与系统输出的经验,以分析可用性方面,并获得与未来实施相关的见解。
    方法:将与卫生保健专业人员一起在瑞典南部地区对HF患者的情况进行随机交叉评估的准实验设计。总的来说,12名医生和护士将被随机分为对照组和测试组。应向小组提供20种有目的采样的患者情况。临床医生将被要求对患者的下一步行动做出决定。测试组将提供10种方案,其中包含来自电子健康记录的患者数据以及基于ML的CDS模型对同一患者再次入院的风险水平的结果。对照组将有10个其他场景,没有CDS模型输出,并且仅包含来自电子病历的患者数据。这些组将在接下来的10种情况下切换角色。本研究将通过访谈和观察收集数据。关键成果衡量标准是决策一致性,决策质量,工作效率,使用CDS模型的感知效益,可靠性,有效性,以及对CDS模型结果的信心,常规工作流程中的可积性,易用性,并打算使用。这项研究将与CambioHealthcareSystems合作进行。
    结果:该项目是应用智能系统研究中心健康研究概况的一部分,由知识基金会资助(2021-2028)。本研究的伦理批准由瑞典伦理审查机构(2022-07287-02)批准。临床医生的招募过程和患者方案选择将于2023年9月开始,持续到2024年3月。
    结论:该研究方案将有助于未来形成性评估研究的发展,以与临床专业人员一起测试ML模型。
    PRR1-10.2196/52744。
    BACKGROUND: Care for patients with heart failure (HF) causes a substantial load on health care systems where a prominent challenge is the elevated rate of readmissions within 30 days following initial discharge. Clinical professionals face high levels of uncertainty and subjectivity in the decision-making process on the optimal timing of discharge. Unwanted hospital stays generate costs and cause stress to patients and potentially have an impact on care outcomes. Recent studies have aimed to mitigate the uncertainty by developing and testing risk assessment tools and predictive models to identify patients at risk of readmission, often using novel methods such as machine learning (ML).
    OBJECTIVE: This study aims to investigate how a developed clinical decision support (CDS) tool alters the decision-making processes of health care professionals in the specific context of discharging patients with HF, and if so, in which ways. Additionally, the aim is to capture the experiences of health care practitioners as they engage with the system\'s outputs to analyze usability aspects and obtain insights related to future implementation.
    METHODS: A quasi-experimental design with randomized crossover assessment will be conducted with health care professionals on HF patients\' scenarios in a region located in the South of Sweden. In total, 12 physicians and nurses will be randomized into control and test groups. The groups shall be provided with 20 scenarios of purposefully sampled patients. The clinicians will be asked to take decisions on the next action regarding a patient. The test group will be provided with the 10 scenarios containing patient data from electronic health records and an outcome from an ML-based CDS model on the risk level for readmission of the same patients. The control group will have 10 other scenarios without the CDS model output and containing only the patients\' data from electronic medical records. The groups will switch roles for the next 10 scenarios. This study will collect data through interviews and observations. The key outcome measures are decision consistency, decision quality, work efficiency, perceived benefits of using the CDS model, reliability, validity, and confidence in the CDS model outcome, integrability in the routine workflow, ease of use, and intention to use. This study will be carried out in collaboration with Cambio Healthcare Systems.
    RESULTS: The project is part of the Center for Applied Intelligent Systems Research Health research profile, funded by the Knowledge Foundation (2021-2028). Ethical approval for this study was granted by the Swedish ethical review authority (2022-07287-02). The recruitment process of the clinicians and the patient scenario selection will start in September 2023 and last till March 2024.
    CONCLUSIONS: This study protocol will contribute to the development of future formative evaluation studies to test ML models with clinical professionals.
    UNASSIGNED: PRR1-10.2196/52744.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    不断上升的二氧化碳(CO2)排放水平是全球变暖的主要驱动因素,解决这些问题至关重要。及时准确的预测,以及有效控制二氧化碳排放,是指导缓解措施的关键。本文旨在选择中国近实时每日CO2排放量的最佳预测模型。预测模型基于1月1日的单变量每日时间序列数据,2020年9月30日,2022年。提出了六种模型,包括三个统计模型:灰色预测(GM(1,1)),自回归积分移动平均(ARIMA),和具有外生因素的季节性自回归综合移动平均(SARIMAX),和三种机器学习模型:人工神经网络(ANN),随机森林(RF),和长短期记忆(LSTM)。这六个模型的性能使用五个标准进行评估:均方误差(MSE),均方根误差(RMSE),平均绝对误差(MAE),平均绝对百分比误差(MAPE),和决定系数(R2)。我们的发现表明,在所有五个标准中,这三个机器学习模型的表现始终优于三个统计模型。其中,LSTM模型在每日二氧化碳排放预测方面表现出众,拥有令人印象深刻的低MSE值3.5179e-04,0.0187的RMSE值,0.0140的MAE值,14.8291%的MAPE值,和0.9844的高R2值。这强调了LSTM模型在捕获和预测复杂排放模式方面的鲁棒性,根据提供的每日时间序列数据,将其定位为最适合近实时每日CO2排放预测的选项。此外,我们的研究结果为排放预测提供了有价值的见解,为决策者和利益相关者提供数据驱动的决策。LSTM模型提供的准确和及时的预测可以帮助制定有效的策略来减少碳排放,为更可持续的未来做出贡献。此外,这项研究的结果可以增强我们对二氧化碳排放动态的理解,导致旨在减少碳排放的更知情的环境政策和行动。
    The escalating levels of carbon dioxide (CO2) emissions represent the primary driver of global warming, and addressing them is of paramount importance. Timely and accurate prediction, as well as effective control of CO2 emissions, are pivotal for guiding mitigation measures. This paper aims to select the best prediction model for near-real-time daily CO2 emissions in China. The prediction models are based on univariate daily time-series data spanning January 1st, 2020, to September 30st, 2022. Six models are proposed, including three statistical models: grey prediction (GM(1,1)), autoregressive integrated moving average (ARIMA), and seasonal autoregressive integrated moving average with exogenous factors (SARIMAX), and three machine learning models: artificial neural network (ANN), random forest (RF), and long short-term memory (LSTM). The performance of these six models is evaluated using five criteria: mean squared error (MSE), root-mean squared error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE), and coefficient of determination (R2). Our findings reveal that the three machine learning models consistently outperform the three statistical models across all five criteria. Among them, the LSTM model demonstrates exceptional performance for daily CO2 emission prediction, boasting an impressively low MSE value of 3.5179e-04, an RMSE value of 0.0187, an MAE value of 0.0140, an MAPE value of 14.8291%, and a high R2 value of 0.9844. This underscores the robustness of the LSTM model in capturing and predicting complex emission patterns, positioning it as the most suitable option for near-real-time daily CO2 emission prediction based on the provided daily time series data. Moreover, our study\'s results provide valuable insights into emissions forecasting, enabling data-driven decision-making for policymakers and stakeholders. The accurate and timely predictions offered by the LSTM model can aid in the formulation of effective strategies to mitigate carbon emissions, contributing to a more sustainable future. Furthermore, the findings of this study can enhance our understanding of the dynamics of CO2 emissions, leading to more informed environmental policies and actions aimed at reducing carbon emissions.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    未经证实:结直肠癌(CRC)是一组异质性的恶性肿瘤,具有不同的临床特征。这些特征与静脉血栓栓塞症(VTE)的关联尚待阐明。机器学习(ML)模型非常适合改善CRC中的VTE预测,因为它们能够接收大量特征的特征并理解数据集以获得隐含的相关性。
    UNASSIGNED:从2019年8月至2022年8月的4,914例结直肠癌患者中提取了数据,并纳入了1,191例原发肿瘤部位接受手术治疗的患者。分析的变量包括患者水平因素,癌症水平的因素,和实验室测试结果。使用十倍交叉验证方法对数据集的30%进行模型训练,并使用总数据集进行模型验证。主要结果是术后30天发生VTE。六种ML算法,包括逻辑回归(LR),随机森林(RF),极端梯度提升(XGBoost),加权支持向量机(SVM),多层感知(MLP)网络,和长短期记忆(LSTM)网络,用于模型拟合。模型评价基于六个指标,包括受试者工作特征曲线-曲线下面积(ROC-AUC),灵敏度(SEN),特异性(SPE),阳性预测值(PPV),负预测值(NPV),和Brier得分.使用两个先前的VTE模型(Caprini和Khorana)作为基准。
    UNASSIGNED:术后VTE发生率为10.8%。前十大重要预测因素包括淋巴结转移,C反应蛋白,肿瘤分级,贫血,原发肿瘤位置,性别,年龄,D-二聚体水平,凝血酶时间,和肿瘤分期。在我们的结果中,XGBoost模型表现出最好的性能,ROC-AUC为0.990,SEN为96.9%,训练数据集中的SPE为96.1%,ROC-AUC为0.908,SEN为77.5%,验证数据集中的SPE为93.7%。所有ML模型都优于以前开发的模型(Caprini和Khorana)。
    UNASSIGNED:本研究使用六种ML算法开发了术后VTE预测模型。XGBoostVTE模型可能为临床VTE预防决策提供补充工具,所提出的风险因素可以为CRC患者的VTE风险分层提供一些启示。
    UNASSIGNED: Colorectal cancer (CRC) is a heterogeneous group of malignancies distinguished by distinct clinical features. The association of these features with venous thromboembolism (VTE) is yet to be clarified. Machine learning (ML) models are well suited to improve VTE prediction in CRC due to their ability to receive the characteristics of a large number of features and understand the dataset to obtain implicit correlations.
    UNASSIGNED: Data were extracted from 4,914 patients with colorectal cancer between August 2019 and August 2022, and 1,191 patients who underwent surgery on the primary tumor site with curative intent were included. The variables analyzed included patient-level factors, cancer-level factors, and laboratory test results. Model training was conducted on 30% of the dataset using a ten-fold cross-validation method and model validation was performed using the total dataset. The primary outcome was VTE occurrence in postoperative 30 days. Six ML algorithms, including logistic regression (LR), random forest (RF), extreme gradient boosting (XGBoost), weighted support vector machine (SVM), a multilayer perception (MLP) network, and a long short-term memory (LSTM) network, were applied for model fitting. The model evaluation was based on six indicators, including receiver operating characteristic curve-area under the curve (ROC-AUC), sensitivity (SEN), specificity (SPE), positive predictive value (PPV), negative predictive value (NPV), and Brier score. Two previous VTE models (Caprini and Khorana) were used as the benchmarks.
    UNASSIGNED: The incidence of postoperative VTE was 10.8%. The top ten significant predictors included lymph node metastasis, C-reactive protein, tumor grade, anemia, primary tumor location, sex, age, D-dimer level, thrombin time, and tumor stage. In our results, the XGBoost model showed the best performance, with a ROC-AUC of 0.990, a SEN of 96.9%, a SPE of 96.1% in training dataset and a ROC-AUC of 0.908, a SEN of 77.5%, a SPE of 93.7% in validation dataset. All ML models outperformed the previously developed models (Caprini and Khorana).
    UNASSIGNED: This study developed postoperative VTE predictive models using six ML algorithms. The XGBoost VTE model might supply a complementary tool for clinical VTE prophylaxis decision-making and the proposed risk factors could shed some light on VTE risk stratification in CRC patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    滑坡是世界上最具破坏性的自然灾害之一。滑坡灾害的准确建模和预测已被用作滑坡灾害防治的重要工具。目的探讨耦合模型在滑坡敏感性评价中的应用。本文以威信县为研究对象。首先,根据构建的滑坡目录数据库,研究区域有345次滑坡。选择了12个环境因素,包括地形(海拔,斜坡,斜坡方向,平面曲率,和轮廓曲率),地质结构(地层岩性和与断裂带的距离),气象水文学(年平均降雨量和与河流的距离),和土地覆盖(NDVI,土地利用,和到道路的距离)。然后,单一模型(逻辑回归,支持向量机,和随机森林)和耦合模型(IV-LR,IV-SVM,IV-RF,FR-LR,FR-SVM,和FR-RF)基于信息量和频率比构建,并对模型的准确性和可靠性进行了比较分析。最后,讨论了最优模型下环境因素对滑坡敏感性的影响。结果表明,9种模型的预测精度在75.2%(LR模型)到94.9%(FR-RF模型)之间,耦合精度普遍高于单一模型。因此,耦合模型在一定程度上提高了模型的预测精度。FR-RF耦合模型的精度最高。在最优模型FR-RF下,距离道路,NDVI,土地利用是最重要的三个环境因素,占20.15%,13.37%,和9.69%,分别。因此,威信县有必要加强对道路附近山区和植被稀疏地区的监测,以防止人类活动和降雨造成滑坡。
    A landslide is one of the most destructive natural disasters in the world. The accurate modeling and prediction of landslide hazards have been used as some of the vital tools for landslide disaster prevention and control. The purpose of this study was to explore the application of coupling models in landslide susceptibility evaluation. This paper used Weixin County as the research object. First, according to the landslide catalog database constructed, there were 345 landslides in the study area. Twelve environmental factors were selected, including terrain (elevation, slope, slope direction, plane curvature, and profile curvature), geological structure (stratigraphic lithology and distance from fault zone), meteorological hydrology (average annual rainfall and distance to rivers), and land cover (NDVI, land use, and distance to roads). Then, a single model (logistic regression, support vector machine, and random forest) and a coupled model (IV-LR, IV-SVM, IV-RF, FR-LR, FR-SVM, and FR-RF) based on information volume and frequency ratio were constructed, and the accuracy and reliability of the models were compared and analyzed. Finally, the influence of environmental factors on landslide susceptibility under the optimal model was discussed. The results showed that the prediction accuracy of the nine models ranged from 75.2% (LR model) to 94.9% (FR-RF model), and the coupling accuracy was generally higher than that of the single model. Therefore, the coupling model could improve the prediction accuracy of the model to a certain extent. The FR-RF coupling model had the highest accuracy. Under the optimal model FR-RF, distance from the road, NDVI, and land use were the three most important environmental factors, ac-counting for 20.15%, 13.37%, and 9.69%, respectively. Therefore, it was necessary for Weixin County to strengthen the monitoring of mountains near roads and areas with sparse vegetation to prevent landslides caused by human activities and rainfall.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    红树林是重要的碳汇,因为它可以通过隔离大气中的二氧化碳来实现气候调节。然而,在中国,50%的红树林物种面临灭绝的威胁,植被中的碳储量也下降了53.1%。这里,我们将遥感数据与随机森林相结合,支持向量机,和XGBoost分析了1986年至2019年中国大陆的红树林。我们发现水产养殖对红树林和预测误差有至关重要的影响。未来预测表明,不同城市红树林的变化范围为-5.09E06m2至2.30E06m2,土壤碳(C)储量为\“-1.90E05Mg〜8.57E04Mg\”。为了保护红树林,迫切需要探索水产养殖与红树林之间的平衡,并关注水产养殖的可持续转型。这样,红树林可以充分发挥固碳作用,为我国实现双碳目标做出贡献。
    Mangrove is an important carbon sink, as it can achieve climate regulation by sequestering carbon dioxide in the atmosphere. However, 50 % of mangrove species are threatened with extinction in China, and the carbon stocks in vegetation has also dropped by 53.1 %. Here, we couple remote sensing data with Random Forests, Support Vector Machines, and XGBoost to analyse mangroves in mainland China from 1986 to 2019. We find that aquaculture has crucial impacts on mangroves and prediction error. Future predictions indicate that the changes of mangroves in different cities range from -5.09E+06 m2 to 2.30E+06 m2, and soil carbon(C) stocks is \"-1.90E+05 Mg ~ 8.57E+04 Mg\". To protect mangroves, exploring the balance between aquaculture and mangroves and paying attention to the sustainable transformation of aquaculture are urgently required. In this way, mangroves can fully play the role of carbon sequestration and contribute to China\'s dual carbon goals.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    脊索瘤和软骨肉瘤具有共同的影像学特征,但在临床上却截然不同。术前区分这些肿瘤的放射学机器学习模型将有助于计划手术。MR图像来自2012年9月至2020年2月在东京大学医院治疗的57例连续脊索瘤(N=32)或软骨肉瘤(N=25)患者。分析了术前T1加权图像和GdT1增强(GdT1)和T2加权图像。来自前47个案例的数据集用于模型创建,随后的10例病例用于验证。特征提取是半自动执行的,每个图像序列获得2438个特征。创建了具有逻辑回归和支持向量机的机器学习模型。具有最高准确度的模型在逻辑回归中结合了从GdT1提取的七个特征。在验证数据集中,曲线下平均面积为0.93±0.06,准确度为0.90(9/10)。由20名董事会认证的神经外科医生评估了相同的验证数据集。诊断准确性范围为0.50至0.80(中位数0.60,95%置信区间0.60±0.06%),低于机器学习模型(p=0.03),虽然有一些局限性,例如过度拟合的风险和缺乏真正独立的最终验证的校外队列。总之,我们创建了一种新的基于MRI的机器学习模型,以区分颅底脊索瘤和软骨肉瘤和多参数特征.
    Chordoma and chondrosarcoma share common radiographic characteristics yet are distinct clinically. A radiomic machine learning model differentiating these tumors preoperatively would help plan surgery. MR images were acquired from 57 consecutive patients with chordoma (N = 32) or chondrosarcoma (N = 25) treated at the University of Tokyo Hospital between September 2012 and February 2020. Preoperative T1-weighted images with gadolinium enhancement (GdT1) and T2-weighted images were analyzed. Datasets from the first 47 cases were used for model creation, and those from the subsequent 10 cases were used for validation. Feature extraction was performed semi-automatically, and 2438 features were obtained per image sequence. Machine learning models with logistic regression and a support vector machine were created. The model with the highest accuracy incorporated seven features extracted from GdT1 in the logistic regression. The average area under the curve was 0.93 ± 0.06, and accuracy was 0.90 (9/10) in the validation dataset. The same validation dataset was assessed by 20 board-certified neurosurgeons. Diagnostic accuracy ranged from 0.50 to 0.80 (median 0.60, 95% confidence interval 0.60 ± 0.06%), which was inferior to that of the machine learning model (p = 0.03), although there are some limitations, such as the risk of overfitting and the lack of an extramural cohort for truly independent final validation. In summary, we created a novel MRI-based machine learning model to differentiate skull base chordoma and chondrosarcoma from multiparametric signatures.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    对早期胃癌(EGC)内镜切除术(ER)后非治愈性切除术患者淋巴结转移(LNM)的风险进行分层对于确定其他治疗策略和防止不必要的手术至关重要。因此,我们开发了一个机器学习(ML)模型,并验证了其在EGC患者LNM风险分层中的性能.我们招募了在2005年5月至2021年3月期间接受初次手术或ER后接受EGC额外手术的患者。此外,纳入了在2005年5月至2016年3月期间因EGC接受ER治疗且随访至少5年的患者.ML模型是基于开发集(70%)使用逻辑回归建立的,随机森林(RF),和支持向量机(SVM)在验证集中进行分析和评估(30%)。在验证集中,在4428例患者中的337例(7.6%)中发现了LNM。在所有患者中,在逻辑回归中,预测LNM风险的接受者工作特征下面积(AUROC)为0.86,0.85inRF,在SVM分析中,为0.86;在患有初始ER的患者中,Logistic回归预测LNM风险的AUROC为0.90,0.88inRF,在SVM分析中,为0.89。ML模型可以将LNM风险分层为非常低(<1%),低(<3%),中间(<7%),和高风险类别(≥7%),与实际的LNM率相当。我们证明了ML模型可用于识别LNM风险。然而,该工具需要在ER后非治愈性切除的EGC患者中进一步验证才能实际应用.
    Stratification of the risk of lymph node metastasis (LNM) in patients with non-curative resection after endoscopic resection (ER) for early gastric cancer (EGC) is crucial in determining additional treatment strategies and preventing unnecessary surgery. Hence, we developed a machine learning (ML) model and validated its performance for the stratification of LNM risk in patients with EGC. We enrolled patients who underwent primary surgery or additional surgery after ER for EGC between May 2005 and March 2021. Additionally, patients who underwent ER alone for EGC between May 2005 and March 2016 and were followed up for at least 5 years were included. The ML model was built based on a development set (70%) using logistic regression, random forest (RF), and support vector machine (SVM) analyses and assessed in a validation set (30%). In the validation set, LNM was found in 337 of 4428 patients (7.6%). Among the total patients, the area under the receiver operating characteristic (AUROC) for predicting LNM risk was 0.86 in the logistic regression, 0.85 in RF, and 0.86 in SVM analyses; in patients with initial ER, AUROC for predicting LNM risk was 0.90 in the logistic regression, 0.88 in RF, and 0.89 in SVM analyses. The ML model could stratify the LNM risk into very low (<1%), low (<3%), intermediate (<7%), and high (≥7%) risk categories, which was comparable with actual LNM rates. We demonstrate that the ML model can be used to identify LNM risk. However, this tool requires further validation in EGC patients with non-curative resection after ER for actual application.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号