RESULTS: Based on the known microbe-disease associations derived from the Human Microbe-Disease Association Database (HMDAD), the proposed model shows reliable performance with high values of the area under ROC curve (AUC) of 0.9456 and 0.8866 in leave-one-out cross validations and five-fold cross validations, respectively. In case studies of colorectal carcinoma, 80% out of the top-20 predicted microbes have been experimentally confirmed via published literatures.
CONCLUSIONS: Based on the assumption that functionally similar microbes tend to share the similar interaction patterns with human diseases, we here propose a group based computational model of Bayesian disease-oriented ranking to prioritize the most potential microbes associating with various human diseases. Based on the sequence information of genes, two computational approaches (BLAST+ and MEGA 7) are leveraged to measure the microbe-microbe similarity from different perspectives. The disease-disease similarity is calculated by capturing the hierarchy information from the Medical Subject Headings (MeSH) data. The experimental results illustrate the accuracy and effectiveness of the proposed model. This work is expected to facilitate the characterization and identification of promising microbial biomarkers.
结果:基于来自人类微生物-疾病关联数据库(HMDAD)的已知微生物-疾病关联,所提出的模型在留一交叉验证和五倍交叉验证中显示出可靠的性能,ROC曲线下面积(AUC)的高值为0.9456和0.8866,分别。在大肠癌的案例研究中,在预测的前20种微生物中,有80%已通过已发表的文献进行了实验证实。
结论:基于功能相似的微生物倾向于与人类疾病共享相似的相互作用模式的假设,我们在这里提出了一个基于贝叶斯疾病导向排名的计算模型,以优先考虑与各种人类疾病相关的最有潜力的微生物。根据基因的序列信息,利用两种计算方法(BLAST+和MEGA7)从不同角度测量微生物-微生物相似性。通过从医学主题标题(MeSH)数据捕获层级信息来计算疾病-疾病相似性。实验结果验证了该模型的准确性和有效性。这项工作有望促进有前途的微生物生物标志物的表征和鉴定。