graph convolution neural network

图卷积神经网络
  • 文章类型: Journal Article
    背景:准确预测血浆蛋白结合率(PPBR)和口服生物利用度(OBA)有助于更好地揭示药物在人体内的吸收和分布以及随后的药物设计。尽管机器学习模型在预测精度方面取得了良好的效果,在处理具有不规则拓扑结构的数据时,它们经常遭受精度不足的困扰。
    方法:鉴于此,本研究提出了一种基于图卷积网络(GCN)的药代动力学参数预测框架,预测了小分子药物的PPBR和OBA。在框架中,GCN首先用于提取药物分子拓扑结构的空间特征信息,从而更好地了解节点特征和节点间的关联信息。然后,基于药物相似性的原则,这项研究计算了小分子药物之间的相似性,选择不同的阈值来构建数据集,并建立了以GCN算法为中心的预测模型。实验结果表明,与传统的机器学习预测模型相比,基于GCN方法构建的预测模型在分子间相似性阈值为0.25的PPBR和OBA数据集上表现最好,MAE分别为0.155和0.167。此外,为了进一步提高预测模型的准确性,GCN与其他算法相结合。与使用单一GCN方法相比,组合模型得到的预测值分布与真实值高度吻合。总之,这项工作为今后提高药物早期筛查率提供了新的方法。
    BACKGROUND: Accurately predicting plasma protein binding rate (PPBR) and oral bioavailability (OBA) helps to better reveal the absorption and distribution of drugs in the human body and subsequent drug design. Although machine learning models have achieved good results in prediction accuracy, they often suffer from insufficient accuracy when dealing with data with irregular topological structures.
    METHODS: In view of this, this study proposes a pharmacokinetic parameter prediction framework based on graph convolutional networks (GCN), which predicts the PPBR and OBA of small molecule drugs. In the framework, GCN is first used to extract spatial feature information on the topological structure of drug molecules, in order to better learn node features and association information between nodes. Then, based on the principle of drug similarity, this study calculates the similarity between small molecule drugs, selects different thresholds to construct datasets, and establishes a prediction model centered on the GCN algorithm. The experimental results show that compared with traditional machine learning prediction models, the prediction model constructed based on the GCN method performs best on PPBR and OBA datasets with an inter-molecular similarity threshold of 0.25, with MAE of 0.155 and 0.167, respectively. In addition, in order to further improve the accuracy of the prediction model, GCN is combined with other algorithms. Compared to using a single GCN method, the distribution of the predicted values obtained by the combined model is highly consistent with the true values. In summary, this work provides a new method for improving the rate of early drug screening in the future.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    大脑变异是发育障碍的原因,包括自闭症谱系障碍(ASD)。EEG信号通过揭示有关脑功能异常的关键信息来有效地检测神经系统状况。
    这项研究旨在利用从自闭症和典型发育儿童收集的脑电图数据,研究图卷积神经网络(GCNN)在基于通过脑电图信号揭示的神经系统异常预测ASD中的潜力。
    在这项研究中,脑电图数据来自中央精神病学研究所的8名自闭症儿童和8名使用儿童自闭症评定量表诊断的典型发育儿童,兰契.脑电图记录是使用具有257个通道的HydroCelGSN完成的,并使用了71个具有10-10个国际等值的频道。电极被分成12个脑区。为ASD预测引入了GCNN,之前是自回归和谱特征提取。
    前额叶脑区,对于情感等认知功能至关重要,记忆,和社会互动,被证明对ASD最具预测性,达到87.07%的准确率。这强调了GCNN方法用于基于EEG的ASD检测的适用性。
    收集的详细数据集增强了对ASD神经基础的理解,有利于参与ASD诊断的医疗保健从业人员。
    UNASSIGNED: Brain variations are responsible for developmental impairments, including autism spectrum disorder (ASD). EEG signals efficiently detect neurological conditions by revealing crucial information about brain function abnormalities.
    UNASSIGNED: This study aims to utilize EEG data collected from both autistic and typically developing children to investigate the potential of a Graph Convolutional Neural Network (GCNN) in predicting ASD based on neurological abnormalities revealed through EEG signals.
    UNASSIGNED: In this study, EEG data were gathered from eight autistic children and eight typically developing children diagnosed using the Childhood Autism Rating Scale at the Central Institute of Psychiatry, Ranchi. EEG recording was done using a HydroCel GSN with 257 channels, and 71 channels with 10-10 international equivalents were utilized. Electrodes were divided into 12 brain regions. A GCNN was introduced for ASD prediction, preceded by autoregressive and spectral feature extraction.
    UNASSIGNED: The anterior-frontal brain region, crucial for cognitive functions like emotion, memory, and social interaction, proved most predictive of ASD, achieving 87.07% accuracy. This underscores the suitability of the GCNN method for EEG-based ASD detection.
    UNASSIGNED: The detailed dataset collected enhances understanding of the neurological basis of ASD, benefiting healthcare practitioners involved in ASD diagnosis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    全世界最常见的死亡原因之一是心脏病,包括心律失常.今天,人工智能和医学统计学等科学正在寻找正确和自动诊断心律失常的方法和模型。为了追求提高自动化方法的准确性,已经进行了许多研究。然而,在以前的文章中,模型中未包括心脏导线之间的关系和结构。看来,ECG数据的结构可以帮助提高心律失常检测的准确性。因此,在这项研究中,介绍了一种新的心电图(ECG)数据结构,和图卷积网络(GCN),它有可能学习结构,用于提高心律失常诊断的准确性。考虑到基于不同ECG极点的心脏导联和簇之间的关系,引入了一种新的结构。在这个结构中,使用互信息(MI)指数来评估线索之间的关系,权重是根据导线的极点给出的。加权互信息(WMI)矩阵(新结构)由R软件形成。最后,通过该结构对15层GCN网络进行了调整,并对人的心律失常进行了检测和分类。为了评估拟议的新网络的性能,灵敏度,精度,特异性,准确度,并使用了混淆矩阵指数。此外,通过三种不同的结构比较了GCN网络的精度,包括WMI,MI,和身份。本研究使用Chapman的12导联心电图数据集。结果表明,灵敏度值,精度,特异性,具有15个中间层的GCN-WMI网络的精度等于98.74%,99.08%,99.97%&99.82%,分别。这种新提出的网络比图卷积网络互信息(GCN-MI)更准确,精度等于99.71%,GCN-Id精度等于92.68%。因此,利用这个网络,对心律失常的类型进行识别和分类.此外,图卷积网络加权互信息(GCN-WMI)提出的新网络比对同一数据集(Chapman)进行的其他研究更准确。根据获得的结果,本研究提出的结构提高了Chapman数据集心律失常诊断和分类的准确性.实现心律失常诊断的这种准确性是临床科学中的一项重大成就。
    One of the most common causes of death worldwide is heart disease, including arrhythmia. Today, sciences such as artificial intelligence and medical statistics are looking for methods and models for correct and automatic diagnosis of cardiac arrhythmia. In pursuit of increasing the accuracy of automated methods, many studies have been conducted. However, in none of the previous articles, the relationship and structure between the heart leads have not been included in the model. It seems that the structure of ECG data can help develop the accuracy of arrhythmia detection. Therefore, in this study, a new structure of Electrocardiogram (ECG) data was introduced, and the Graph Convolution Network (GCN), which has the possibility of learning the structure, was used to develop the accuracy of cardiac arrhythmia diagnosis. Considering the relationship between the heart leads and clusters based on different ECG poles, a new structure was introduced. In this structure, the Mutual Information(MI) index was used to evaluate the relationship between the leads, and weight was given based on the poles of the leads. Weighted Mutual Information (WMI) matrices (new structure) were formed by R software. Finally, the 15-layer GCN network was adjusted by this structure and the arrhythmia of people was detected and classified by it. To evaluate the performance of the proposed new network, sensitivity, precision, specificity, accuracy, and confusion matrix indices were used. Also, the accuracy of GCN networks was compared by three different structures, including WMI, MI, and Identity. Chapman\'s 12-lead ECG Dataset was used in this study. The results showed that the values of sensitivity, precision, specificity, and accuracy of the GCN-WMI network with 15 intermediate layers were equal to 98.74%, 99.08%, 99.97% & 99.82%, respectively. This new proposed network was more accurate than the Graph Convolution Network-Mutual Information (GCN-MI) with an accuracy equal to 99.71% and GCN-Id with an accuracy equal to 92.68%. Therefore, utilizing this network, the types of arrhythmia were recognized and classified. Also, the new network proposed by the Graph Convolution Network-Weighted Mutual Information (GCN-WMI) was more accurate than those conducted in other studies on the same data set (Chapman). Based on the obtained results, the structure proposed in this study increased the accuracy of cardiac arrhythmia diagnosis and classification on the Chapman data set. Achieving such accuracy for arrhythmia diagnosis is a great achievement in clinical sciences.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    脑电图(EEG)功能连接(FC)网络是个性化的,并且在基于EEG的人员识别中起着重要作用。传统的FC网络是通过脑电通道之间的统计依赖性和相关性来构建的,而不考虑通道之间的空间关系。基于传统FC网络的个体识别算法对信道完整性敏感,对信号预处理具有重要的依赖性,找到FC网络的新表示可能有助于提高识别算法的性能。由于体积传导效应,EEG信号在空间上是平滑的。考虑信道之间的这种空间关系可以提供FC网络的更准确表示。在这项研究中,我们提出了一个具有虚拟节点的EEGFC网络,该网络结合了通道的空间关系和功能连通性。个体识别的比较结果表明,新的EEG网络更加个性化,在没有预处理的情况下,数据的准确率达到98.64%。此外,我们的算法在减少通道数量方面更加健壮,即使在去除大面积通道时也能很好地执行。
    An electroencephalogram (EEG) functional connectivity (FC) network is individualized and plays a significant role in EEG-based person identification. Traditional FC networks are constructed by statistical dependence and correlation between EEG channels, without considering the spatial relationships between the channels. The individual identification algorithm based on traditional FC networks is sensitive to the integrity of channels and crucially relies on signal preprocessing; therefore, finding a new presentation for FC networks may help increase the performance of the identification algorithms. EEG signals are smooth across space owing to the volume conduction effect. Considering such spatial relationships among channels can provide a more accurate representation of FC networks. In this study, we propose an EEG FC network with virtual nodes that combines the spatial relationships and functional connectivity of channels. The comparison results for individual identification show that the novel EEG network is more individualized and achieves an accuracy of 98.64% for data without preprocessing. Furthermore, our algorithm is more robust in reducing the number of channels and can perform well even when a large area of channels is removed.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目标:尽管人工智能最近有所发展,通过OGS预测上颌骨和下颌骨的手术运动可能比通过正畸治疗预测牙齿运动更困难。使用正颌手术(OGS)患者的术前(T0)和术后(T1)侧位脑图(lat-ceph)和双嵌入模块图卷积神经网络(DEM-GCNN)模型评估手术运动的预测准确性。
    方法:使用来自3个机构的599对作为培训,内部验证,内部测试集和其他6个机构的201对被用作外部测试集。DEM-GCNN模型(IEM,学习lat-ceph图像;LTEM,学习标志)被开发用于预测ANS和PNS在上颌骨以及下颌骨的B点和Md1冠的手术运动的数量和方向。比较了由OGS(地面实况)实际移动并由DEM-GCNN模型预测的T1地标坐标与预先存在的基于CNN的Model-C(学习lat-ceph图像)之间的距离。
    结果:在内部和外部测试中,DEM-GCNN在所有地标中都没有表现出与地面真实的显著差异(ANS,PNS,B点,Md1crown,所有P>0.05)。当比较每个地标的累积成功检测率时,DEM-GCNN在内部和外部测试中均显示出比Model-C更高的值。在显示预测结果误差分布的小提琴图中,内部和外部测试表明,DEM-GCNN在PNS中具有显著的性能改进,ANS,B点,Md1crown比Model-C.DEM-GCNN显示出明显低于Model-C的预测误差值(单颌手术,B点,Md1crown,所有P<0.005;双颌手术,PNS,ANS,所有P<0.05;B点,Md1crown,所有P<0.005)。
    结论:我们开发了一个稳健的OGS规划模型,该模型具有最大化的泛化性,尽管来自9个机构的lat-cephs具有不同的质量。
    OBJECTIVE: Despite recent development of AI, prediction of the surgical movement in the maxilla and mandible by OGS might be more difficult than that of tooth movement by orthodontic treatment. To evaluate the prediction accuracy of the surgical movement using pairs of pre-(T0) and post-surgical (T1) lateral cephalograms (lat-ceph) of orthognathic surgery (OGS) patients and dual embedding module-graph convolution neural network (DEM-GCNN) model.
    METHODS: 599 pairs from 3 institutions were used as training, internal validation, and internal test sets and 201 pairs from other 6 institutions were used as external test set. DEM-GCNN model (IEM, learning the lat-ceph images; LTEM, learning the landmarks) was developed to predict the amount and direction of surgical movement of ANS and PNS in the maxilla and B-point and Md1crown in the mandible. The distance between T1 landmark coordinates actually moved by OGS (ground truth) and predicted by DEM-GCNN model and pre-existed CNN-based Model-C (learning the lat-ceph images) was compared.
    RESULTS: In both internal and external tests, DEM-GCNN did not exhibit significant difference from ground truth in all landmarks (ANS, PNS, B-point, Md1crown, all P > 0.05). When the accumulated successful detection rate for each landmark was compared, DEM-GCNN showed higher values than Model-C in both the internal and external tests. In violin plots exhibiting the error distribution of the prediction results, both internal and external tests showed that DEM-GCNN had significant performance improvement in PNS, ANS, B-point, Md1crown than Model-C. DEM-GCNN showed significantly lower prediction error values than Model-C (one-jaw surgery, B-point, Md1crown, all P < 0.005; two-jaw surgery, PNS, ANS, all P < 0.05; B point, Md1crown, all P < 0.005).
    CONCLUSIONS: We developed a robust OGS planning model with maximized generalizability despite diverse qualities of lat-cephs from 9 institutions.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Researches have demonstrated that microorganisms are indispensable for the nutrition transportation, growth and development of human bodies, and disorder and imbalance of microbiota may lead to the occurrence of diseases. Therefore, it is crucial to study relationships between microbes and diseases. In this manuscript, we proposed a novel prediction model named MADGAN to infer potential microbe-disease associations by combining biological information of microbes and diseases with the generative adversarial networks. To our knowledge, it is the first attempt to use the generative adversarial network to complete this important task. In MADGAN, we firstly constructed different features for microbes and diseases based on multiple similarity metrics. And then, we further adopted graph convolution neural network (GCN) to derive different features for microbes and diseases automatically. Finally, we trained MADGAN to identify latent microbe-disease associations by games between the generation network and the decision network. Especially, in order to prevent over-smoothing during the model training process, we introduced the cross-level weight distribution structure to enhance the depth of the network based on the idea of residual network. Moreover, in order to validate the performance of MADGAN, we conducted comprehensive experiments and case studies based on databases of HMDAD and Disbiome respectively, and experimental results demonstrated that MADGAN not only achieved satisfactory prediction performances, but also outperformed existing state-of-the-art prediction models.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    微生物关联网络的研究对于了解微生物的致病机理和促进微生物在精准医疗中的应用具有重要意义。在本文中,研究了基于多数据生物网络和图神经网络算法的微生物-疾病关联预测。HMDAD数据库提供了一个包括39种疾病的数据集,292种微生物,和450个已知的微生物-疾病关联。我们根据微生物相似网络提出了一个微生物-疾病异构网络,疾病相似性网络,和已知的微生物-疾病关联。此外,我们将网络集成到图卷积神经网络算法中,并开发了GCNN4Micro-Dis模型来预测微生物与疾病的关联.最后,通过5倍交叉验证评估GCNN4Micro-Dis模型的性能.我们将所有已知的微生物-疾病关联数据随机分为五组。结果表明,AUC平均值和标准偏差为0.8954±0.0030。我们的模型具有良好的预测能力,可以帮助识别新的微生物-疾病关联。此外,我们将GCNN4Micro-Dis与三种预测微生物-疾病关联的先进方法进行了比较,KATZHMDA,BiRWHMDA,和LRLSHMDA。结果表明,该方法比其他三种方法具有更好的预测性能。此外,我们选择乳腺癌作为一个案例研究,从患者的肠道菌群中发现了与乳腺癌相关的前12种微生物,这进一步验证了模型的准确性。
    The research on microbe association networks is greatly significant for understanding the pathogenic mechanism of microbes and promoting the application of microbes in precision medicine. In this paper, we studied the prediction of microbe-disease associations based on multi-data biological network and graph neural network algorithm. The HMDAD database provided a dataset that included 39 diseases, 292 microbes, and 450 known microbe-disease associations. We proposed a Microbe-Disease Heterogeneous Network according to the microbe similarity network, disease similarity network, and known microbe-disease associations. Furthermore, we integrated the network into the graph convolutional neural network algorithm and developed the GCNN4Micro-Dis model to predict microbe-disease associations. Finally, the performance of the GCNN4Micro-Dis model was evaluated via 5-fold cross-validation. We randomly divided all known microbe-disease association data into five groups. The results showed that the average AUC value and standard deviation were 0.8954 ± 0.0030. Our model had good predictive power and can help identify new microbe-disease associations. In addition, we compared GCNN4Micro-Dis with three advanced methods to predict microbe-disease associations, KATZHMDA, BiRWHMDA, and LRLSHMDA. The results showed that our method had better prediction performance than the other three methods. Furthermore, we selected breast cancer as a case study and found the top 12 microbes related to breast cancer from the intestinal flora of patients, which further verified the model\'s accuracy.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    乳酸菌抗菌肽(LABAMPs)是乳酸菌代谢过程中产生的一类活性多肽,能抑制或杀死食品中的致病菌或腐败菌。LABAMP在与人类密切相关的重要实践领域有着广泛的应用,比如食品生产,高效农业种植,等等。然而,生物实验研究人员筛选抗菌肽费时费力。因此,迫切需要开发一种预测LABAMP的模型。在这项工作中,我们设计了一个用于识别LABAMP的图卷积神经网络框架。我们基于氨基酸构建异构图,三肽及其关系,并学习图卷积网络(GCN)的权重。我们的GCN在输入序列标签的监督下,迭代地完成了图中嵌入词和序列权重的学习。我们将10倍交叉验证实验应用于两个训练数据集,分别获得0.9163和0.9379的准确性。它们比其他机器学习和GNN算法更高。在独立的测试数据集中,两个数据集的准确性分别为0.9130和0.9291,比其他在线网络服务器的最佳方法高1.08%和1.57%。
    Lactic acid bacteria antimicrobial peptides (LABAMPs) are a class of active polypeptide produced during the metabolic process of lactic acid bacteria, which can inhibit or kill pathogenic bacteria or spoilage bacteria in food. LABAMPs have broad application in important practical fields closely related to human beings, such as food production, efficient agricultural planting, and so on. However, screening for antimicrobial peptides by biological experiment researchers is time-consuming and laborious. Therefore, it is urgent to develop a model to predict LABAMPs. In this work, we design a graph convolutional neural network framework for identifying of LABAMPs. We build heterogeneous graph based on amino acids, tripeptide and their relationships and learn weights of a graph convolutional network (GCN). Our GCN iteratively completes the learning of embedded words and sequence weights in the graph under the supervision of inputting sequence labels. We applied 10-fold cross-validation experiment to two training datasets and acquired accuracy of 0.9163 and 0.9379 respectively. They are higher that of other machine learning and GNN algorithms. In an independent test dataset, accuracy of two datasets is 0.9130 and 0.9291, which are 1.08% and 1.57% higher than the best methods of other online webservers.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    环状RNA(circRNAs)和microRNAs(miRNAs)之间的相互作用已被证明可以改变基因表达并调节疾病的基因。由于传统的实验方法耗时耗力,大多数circRNA-miRNA相互作用在很大程度上仍然未知。开发计算方法来大规模探索circRNAs和miRNAs之间的相互作用可以帮助弥合这一差距。在本文中,我们提出了一种基于图卷积神经网络的方法GCNCMI来预测circRNAs和miRNAs之间的潜在相互作用。GCNCMI首先挖掘图卷积神经网络中相邻节点的潜在交互,然后在图卷积层上递归传播交互信息。最后,它将每个层生成的嵌入式表示统一起来,以进行最终预测。在五次交叉验证中,GCNCMI达到0.9312的最高AUC和0.9412的最高AUPR。此外,两个miRNA的案例研究,hsa-miR-622和hsa-miR-149-5p,表明我们的模型对预测circRNA-miRNA相互作用有很好的效果。代码和数据可在https://github.com/csuhjhjhj/GCNCMI获得。
    The interactions between circular RNAs (circRNAs) and microRNAs (miRNAs) have been shown to alter gene expression and regulate genes on diseases. Since traditional experimental methods are time-consuming and labor-intensive, most circRNA-miRNA interactions remain largely unknown. Developing computational approaches to large-scale explore the interactions between circRNAs and miRNAs can help bridge this gap. In this paper, we proposed a graph convolutional neural network-based approach named GCNCMI to predict the potential interactions between circRNAs and miRNAs. GCNCMI first mines the potential interactions of adjacent nodes in the graph convolutional neural network and then recursively propagates interaction information on the graph convolutional layers. Finally, it unites the embedded representations generated by each layer to make the final prediction. In the five-fold cross-validation, GCNCMI achieved the highest AUC of 0.9312 and the highest AUPR of 0.9412. In addition, the case studies of two miRNAs, hsa-miR-622 and hsa-miR-149-5p, showed that our model has a good effect on predicting circRNA-miRNA interactions. The code and data are available at https://github.com/csuhjhjhj/GCNCMI.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:提出一种使用时空卷积神经网络在动态蛋白质网络中挖掘复合物的新方法。
    方法:边缘强度,定义了动态蛋白质网络的节点强度和边存在概率。根据图上的时间序列信息和结构信息,利用希尔伯特-黄变换设计了两个卷积算子,注意机制和残差连接技术来表示和学习网络中蛋白质的特征,构建了动态蛋白质网络特征图谱。最后,光谱聚类用于鉴定蛋白质复合物。
    结果:在几个公共生物数据集上的仿真结果表明,所提出的算法的F值在DIP数据集和MIPS数据集上超过90%。与其他4种识别算法(DPCMNE,GE-CFI,VGAE和NOCD),该算法将识别效率提高了34.5%,28.7%,25.4%和17.6%,分别。
    结论:深度学习技术的应用可以提高动态蛋白质网络分析的效率。
    OBJECTIVE: To propose a new method for mining complexes in dynamic protein network using spatiotemporal convolution neural network.
    METHODS: The edge strength, node strength and edge existence probability are defined for modeling of the dynamic protein network. Based on the time series information and structure information on the graph, two convolution operators were designed using Hilbert-Huang transform, attention mechanism and residual connection technology to represent and learn the characteristics of the proteins in the network, and the dynamic protein network characteristic map was constructed. Finally, spectral clustering was used to identify the protein complexes.
    RESULTS: The simulation results on several public biological datasets showed that the F value of the proposed algorithm exceeded 90% on DIP dataset and MIPS dataset. Compared with 4 other recognition algorithms (DPCMNE, GE-CFI, VGAE and NOCD), the proposed algorithm improved the recognition efficiency by 34.5%, 28.7%, 25.4% and 17.6%, respectively.
    CONCLUSIONS: The application of deep learning technology can improve the efficiency in analysis of dynamic protein networks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号