目的:使用可解释的机器学习方法,基于Caprini量表确定泌尿外科住院患者静脉血栓栓塞症(VTE)的关键危险因素。
方法:根据病例医院Caprini量表获得泌尿科住院患者的VTE风险数据。根据数据,使用Boruta方法从Caprini量表的37个变量中进一步选择关键变量。此外,使用粗糙集(RS)方法生成与每个风险级别相对应的决策规则。最后,随机森林(RF),支持向量机(SVM),和反向传播人工神经网络(BPANN)验证了数据的准确性,并与RS方法进行了比较。
结果:筛选后,泌尿外科静脉血栓栓塞的关键危险因素是“(C1)年龄,\“\”(C2)计划的小手术,\“\”(C3)肥胖(BMI>25),\"\"(C8)静脉曲张,\“\”(C9)脓毒症(<1个月),“(C10)”严重肺部疾病,包括。肺炎(<1个月)“(C11)COPD,\“\”(C16)其他风险,\“\”(C18)大手术(>45分钟),\“\”(C19)腹腔镜手术(>45分钟),\“\”(C20)患者卧床(>72小时),\“\”(C18)恶性肿瘤(现在或以前),\"\"(C23)中心静脉通路,“”(C31)DVT/PE的历史,\“\”(C32)其他先天性或获得性血栓形成倾向,“和”(C34)中风(<1个月。“根据RS方法得到的不同风险等级的决策规则,“(C1)年龄,\"\"(C18)大手术(>45分钟),“和”(C21)恶性肿瘤(现在或以前)“是影响中高风险水平的主要因素,并根据这三个因素提出了一些预防VTE的建议。RS的平均准确度,射频,SVM,BPANN模型为79.5%,87.9%,92.6%,97.2%,分别。此外,BPANN的准确度最高,召回,F1分数,和精度。
结论:与其他三种常见的机器学习模型相比,RS模型的准确性较差。然而,RS模型提供了很强的可解释性,并允许识别影响泌尿外科VTE高风险评估的高危因素和决策规则.这种透明度对于临床医生在风险评估过程中非常重要。
OBJECTIVE: To identify the key risk factors for venous thromboembolism (VTE) in urological inpatients based on the Caprini scale using an interpretable machine learning method.
METHODS: VTE risk data of urological inpatients were obtained based on the Caprini scale in the case hospital. Based on the data, the
Boruta method was used to further select the key variables from the 37 variables in the Caprini scale. Furthermore, decision rules corresponding to each risk level were generated using the rough set (RS) method. Finally, random forest (RF), support vector machine (SVM), and backpropagation artificial neural network (BPANN) were used to verify the data accuracy and were compared with the RS method.
RESULTS: Following the screening, the key risk factors for VTE in urology were \"(C1) Age,\" \"(C2) Minor Surgery planned,\" \"(C3) Obesity (BMI > 25),\" \"(C8) Varicose veins,\" \"(C9) Sepsis (< 1 month),\" (C10) \"Serious lung disease incl. pneumonia (< 1month) \" (C11) COPD,\" \"(C16) Other risk,\" \"(C18) Major surgery (> 45 min),\" \"(C19) Laparoscopic surgery (> 45 min),\" \"(C20) Patient confined to bed (> 72 h),\" \"(C18) Malignancy (present or previous),\" \"(C23) Central venous access,\" \"(C31) History of DVT/PE,\" \"(C32) Other congenital or acquired thrombophilia,\" and \"(C34) Stroke (< 1 month.\" According to the decision rules of different risk levels obtained using the RS method, \"(C1) Age,\" \"(C18) Major surgery (> 45 minutes),\" and \"(C21) Malignancy (present or previous)\" were the main factors influencing mid- and high-risk levels, and some suggestions on VTE prevention were indicated based on these three factors. The average accuracies of the RS, RF, SVM, and BPANN models were 79.5%, 87.9%, 92.6%, and 97.2%, respectively. In addition, BPANN had the highest accuracy, recall, F1-score, and precision.
CONCLUSIONS: The RS model achieved poorer accuracy than the other three common machine learning models. However, the RS model provides strong interpretability and allows for the identification of high-risk factors and decision rules influencing high-risk assessments of VTE in urology. This transparency is very important for clinicians in the risk assessment process.