RESULTS: Model performance varied across antibiotics. The Support Vector Machine excelled in predicting ciprofloxacin resistance (87% accuracy, F1 Score: 0.57), Light Gradient Boosting Machine for cefotaxime (92% accuracy, F1 Score: 0.42), and Gradient Boosting for ampicillin (58% accuracy, F1 Score: 0.66). In validation with data from Africa, Logistic Regression showed high accuracy for ampicillin (94%, F1 Score: 0.97), while Random Forest and Light Gradient Boosting Machine were effective for ciprofloxacin (50% accuracy, F1 Score: 0.56) and cefotaxime (45% accuracy, F1 Score:0.54), respectively. Key mutations associated with AMR were identified for these antibiotics.
CONCLUSIONS: As the threat of AMR continues to rise, the successful application of these models, particularly on genomic datasets from LMICs, signals a promising avenue for improving AMR prediction to support large AMR surveillance programs. This work thus not only expands our current understanding of the genetic underpinnings of AMR but also provides a robust methodological framework that can guide future research and applications in the fight against AMR.
结果:模型性能因抗生素而异。支持向量机在预测环丙沙星耐药性方面表现出色(准确率为87%,F1得分:0.57),头孢噻肟光梯度升压机(92%精度,F1得分:0.42),和氨苄青霉素的梯度提升(58%的准确率,F1得分:0.66)。用非洲的数据验证,Logistic回归显示氨苄青霉素的准确性高(94%,F1得分:0.97),而随机森林和光梯度升压机对环丙沙星有效(50%的准确度,F1评分:0.56)和头孢噻肟(准确率为45%,F1得分:0.54),分别。鉴定了这些抗生素的与AMR相关的关键突变。
结论:随着AMR的威胁不断增加,这些模型的成功应用,特别是来自LMIC的基因组数据集,这标志着改善AMR预测以支持大型AMR监测计划的有希望的途径。因此,这项工作不仅扩展了我们目前对AMR遗传基础的理解,而且提供了一个强大的方法论框架,可以指导未来在对抗AMR方面的研究和应用。