■越来越多的证据表明,人类的健康和疾病与人体内的微生物密切相关。
■在这份手稿中,一种基于图注意力网络和稀疏自编码器的新计算模型,叫做GCANCAE,被提议用于推断可能的微生物-疾病关联。在GCANCAE,我们首先通过组合已知的微生物-疾病关系构建了一个异构网络,疾病相似性,微生物的相似性。然后,我们采用改进的GCN和CSAE来提取邻接矩阵中的邻居关系和异构网络中的新特征表示。之后,为了估计与疾病相关的潜在微生物的可能性,我们整合了这两种类型的表示来创建疾病和微生物的独特特征矩阵,分别,并通过计算这两种类型的特征矩阵的内积获得潜在微生物-疾病关联的预测分数。
■基于HMDAD和Disbiome等基线数据库,进行了深入的实验来评估GCANCAE的预测能力,实验结果表明,在2倍和5倍CV的框架下,GCANCAE比最先进的竞争方法获得了更好的性能。此外,三类常见疾病的案例研究,比如哮喘,肠易激综合征(IBS),和2型糖尿病(T2D),证实了GCANCAE的效率。
UNASSIGNED: Accumulating evidence shows that human health and disease are closely related to the microbes in the human body.
UNASSIGNED: In this manuscript, a new computational model based on graph attention networks and sparse autoencoders, called GCANCAE, was proposed for inferring possible microbe-disease associations. In GCANCAE, we first constructed a heterogeneous network by combining known microbe-disease relationships, disease similarity, and microbial similarity. Then, we adopted the improved GCN and the CSAE to extract neighbor relations in the adjacency matrix and novel feature representations in heterogeneous networks. After that, in order to estimate the likelihood of a potential microbe associated with a disease, we integrated these two types of representations to create unique eigenmatrices for diseases and microbes, respectively, and obtained predicted scores for potential microbe-disease associations by calculating the inner product of these two types of eigenmatrices.
UNASSIGNED: Based on the baseline databases such as the HMDAD and the Disbiome, intensive experiments were conducted to evaluate the prediction ability of GCANCAE, and the experimental results demonstrated that GCANCAE achieved better performance than state-of-the-art competitive methods under the frameworks of both 2-fold and 5-fold CV. Furthermore, case studies of three categories of common diseases, such as asthma, irritable bowel syndrome (IBS), and type 2 diabetes (T2D), confirmed the efficiency of GCANCAE.