graph convolutional networks

图卷积网络
  • 文章类型: Journal Article
    背景:预测个人因COVID-19死亡的风险对于计划和优化资源至关重要。然而,由于现实世界的死亡率相对较低,特别是在香港这样的地方,由于数据集的不平衡特性,这使得建立准确的预测模型变得困难。这项研究介绍了图形卷积网络(GCN)的创新应用,以使用高度不平衡的数据集预测COVID-19患者的生存。与传统模式不同,GCN利用数据内的结构关系,增强预测准确性和鲁棒性。通过将人口统计和实验室数据集成到GCN框架中,我们的方法解决了类不平衡,并证明了预测准确性的显著提高。
    方法:该队列包括2020年1月23日至12月31日在香港42家公立医院收治的符合研究标准的所有连续阳性COVID-19患者(n=7,606)。我们提出了基于人群的图卷积神经网络(GCN)模型,年龄和性别作为预测生存结果的输入。此外,我们将我们提出的模型与Cox比例风险(CPH)模型进行了比较,传统的机器学习模型,和过采样机器学习模型。此外,对测试集进行了子组分析,以便更深入地了解每个患者节点与其邻居之间的关系,揭示不准确预测的可能根本原因。
    结果:GCN模型是表现最好的模型,AUC为0.944,显著优于所有其他模型(p<0.05),包括过采样CPH模型(0.708),线性回归(0.877),线性判别分析(0.860),K-最近邻(0.834),高斯预测因子(0.745)和支持向量机(0.847)。根据Kaplan-Meier的估计,GCN模型在低风险和高风险个体之间表现出良好的可判性(p<0.0001)。基于使用加权得分的子分析,尽管GCN模型能够很好地区分不同的预测组,假阴性(FN)和真阴性(TN)组之间的分离不充分。
    结论:GCN模型大大优于所有其他机器学习方法和基准CPH模型。因此,当应用于这个不平衡的COVID生存数据集时,采用人口图表示可能是实现良好预测的一种方法。
    BACKGROUND: Predicting an individual\'s risk of death from COVID-19 is essential for planning and optimising resources. However, since the real-world mortality rate is relatively low, particularly in places like Hong Kong, this makes building an accurate prediction model difficult due to the imbalanced nature of the dataset. This study introduces an innovative application of graph convolutional networks (GCNs) to predict COVID-19 patient survival using a highly imbalanced dataset. Unlike traditional models, GCNs leverage structural relationships within the data, enhancing predictive accuracy and robustness. By integrating demographic and laboratory data into a GCN framework, our approach addresses class imbalance and demonstrates significant improvements in prediction accuracy.
    METHODS: The cohort included all consecutive positive COVID-19 patients fulfilling study criteria admitted to 42 public hospitals in Hong Kong between January 23 and December 31, 2020 (n = 7,606). We proposed the population-based graph convolutional neural network (GCN) model which took blood test results, age and sex as inputs to predict the survival outcomes. Furthermore, we compared our proposed model to the Cox Proportional Hazard (CPH) model, conventional machine learning models, and oversampling machine learning models. Additionally, a subgroup analysis was performed on the test set in order to acquire a deeper understanding of the relationship between each patient node and its neighbours, revealing possible underlying causes of the inaccurate predictions.
    RESULTS: The GCN model was the top-performing model, with an AUC of 0.944, considerably outperforming all other models (p < 0.05), including the oversampled CPH model (0.708), linear regression (0.877), Linear Discriminant Analysis (0.860), K-nearest neighbours (0.834), Gaussian predictor (0.745) and support vector machine (0.847). With Kaplan-Meier estimates, the GCN model demonstrated good discriminability between low- and high-risk individuals (p < 0.0001). Based on subanalysis using the weighted-in score, although the GCN model was able to discriminate well between different predicted groups, the separation was inadequate between false negative (FN) and true negative (TN) groups.
    CONCLUSIONS: The GCN model considerably outperformed all other machine learning methods and baseline CPH models. Thus, when applied to this imbalanced COVID survival dataset, adopting a population graph representation may be an approach to achieving good prediction.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    空间分辨转录组学将高通量转录组测量与保留的空间细胞组织信息集成在一起。然而,许多技术无法达到单细胞分辨率。我们介绍STdGCN,利用单细胞RNA测序(scRNA-seq)作为空间转录组(ST)数据中细胞类型去卷积的参考的图形模型。STdGCN结合了来自scRNA-seq的表达谱和来自ST数据的空间定位以进行去卷积。对多个数据集的广泛基准测试表明,STdGCN优于17个最先进的模型。在人类乳腺癌Visium数据集中,STdGCN描绘基质,淋巴细胞,和癌细胞,辅助肿瘤微环境分析。在人类心脏ST数据中,STdGCN识别组织发育过程中内皮-心肌细胞通讯的变化。
    Spatially resolved transcriptomics integrates high-throughput transcriptome measurements with preserved spatial cellular organization information. However, many technologies cannot reach single-cell resolution. We present STdGCN, a graph model leveraging single-cell RNA sequencing (scRNA-seq) as reference for cell-type deconvolution in spatial transcriptomic (ST) data. STdGCN incorporates expression profiles from scRNA-seq and spatial localization from ST data for deconvolution. Extensive benchmarking on multiple datasets demonstrates that STdGCN outperforms 17 state-of-the-art models. In a human breast cancer Visium dataset, STdGCN delineates stroma, lymphocytes, and cancer cells, aiding tumor microenvironment analysis. In human heart ST data, STdGCN identifies changes in endothelial-cardiomyocyte communications during tissue development.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    基于运动的康复计划已被证明可有效提高生活质量,降低死亡率和再住院率。人工智能驱动的虚拟康复,让病人在家里独立完成锻炼,利用人工智能算法分析锻炼数据,向患者提供反馈并更新临床医生的进展。这些项目通常规定了各种锻炼类型,导致康复运动评估数据集面临明显的挑战:虽然在整体训练样本中丰富,这些数据集通常对每种运动类型的样本数量有限。这种差异阻碍了现有方法在每种锻炼类型的样本量如此小的情况下训练可概括模型的能力。解决这个问题,本文介绍了一种新的监督对比学习框架,该框架具有硬和软负样本,有效地利用整个数据集来训练适用于所有运动类型的单个模型。这个模型,具有时空图卷积网络(ST-GCN)架构,证明了在练习中的泛化能力增强,整体复杂性降低。通过对三个公开的康复运动评估数据集进行广泛的实验,UI-PRMD,IRDS,KIMORE,我们的方法已经被证明超越了现有的方法,在康复运动质量评估中树立新的基准。
    Exercise-based rehabilitation programs have proven to be effective in enhancing the quality of life and reducing mortality and rehospitalization rates. AI-driven virtual rehabilitation, which allows patients to independently complete exercises at home, utilizes AI algorithms to analyze exercise data, providing feedback to patients and updating clinicians on their progress. These programs commonly prescribe a variety of exercise types, leading to a distinct challenge in rehabilitation exercise assessment datasets: while abundant in overall training samples, these datasets often have a limited number of samples for each individual exercise type. This disparity hampers the ability of existing approaches to train generalizable models with such a small sample size per exercise type. Addressing this issue, this paper introduces a novel supervised contrastive learning framework with hard and soft negative samples that effectively utilizes the entire dataset to train a single model applicable to all exercise types. This model, with a Spatial-Temporal Graph Convolutional Network (ST-GCN) architecture, demonstrated enhanced generalizability across exercises and a decrease in overall complexity. Through extensive experiments on three publicly available rehabilitation exercise assessment datasets, UI-PRMD, IRDS, and KIMORE, our method has proven to surpass existing methods, setting a new benchmark in rehabilitation exercise quality assessment.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    自闭症谱系障碍(ASD)影响了美国的大量儿童和成人,和全世界。ASD的早期和快速诊断可以显着改善患者及其家人的生活质量。先前的研究提供了强有力的证据,即从ASD个体收集的结构和功能磁共振成像(MRI)数据表现出在局部和全局上不同的特征。大脑的空间和时间神经模式-因此可用于各种精神障碍的诊断目的。然而,来自MRI的数据是高维的,需要先进的方法来理解这些数据集。在本文中,我们提出了一种基于图卷积网络(GCN)的新模型,该模型可以利用静息状态fMRI(rs-fMRI)数据将ASD受试者与健康对照(HC)分类。除了使用传统相关矩阵的图,我们提出的GCN模型将graphlet拓扑计数作为训练特征之一。我们的结果表明,graphlet可以保留从fMRI数据获得的图形的拓扑信息。结合我们的GCN,图形保留了足够的拓扑信息来区分ASD和HC。我们提出的模型在整个ABIDE-I数据集(1035名受试者)上的平均准确率为64.27%,最高的特定地点准确率为75.9%。这与其他最先进的方法相当-同时可能更容易解释。
    Autism spectrum disorder (ASD) affects large number of children and adults in the US, and worldwide. Early and quick diagnosis of ASD can improve the quality of life significantly both for patients and their families. Prior research provides strong evidence that structural and functional magnetic resonance imaging (MRI) data collected from individuals with ASD exhibit distinguishing characteristics that differ in local and global, spatial and temporal neural patterns of the brain - and therefore can be used for diagnostic purposes for various mental disorders. However, the data from MRI are high-dimensional and advanced methods are needed to make sense out of these datasets. In this paper, we present a novel model based on graph convolutional network (GCN) that can utilize resting state fMRI (rs-fMRI) data to classify ASD subjects from health controls (HC). In addition to using the graph from traditional correlation matrices, our proposed GCN model incorporates graphlet topological counting as one of the training features. Our results show that graphlets can preserve the topological information of the graphs obtained from fMRI data. Combined with our GCN, the graphlets retain enough topological information to differentiate between the ASD and HC. Our proposed model gives an average accuracy of 64.27% on the whole ABIDE-I data sets (1035 subjects) and highest site-specific accuracy of 75.9%, which is comparable to other state-of-the-art methods - while potentially open to being more interpretable.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    下一个兴趣点(POI)建议旨在从用户的历史活动中预测用户的下一个POI。现有方法通常依赖于位置级POI签入轨迹来探索用户顺序过渡模式,受到严重的签入数据稀疏性问题的困扰。然而,考虑到区域级别和类别级别的POI序列可以帮助解决这个问题。此外,不同粒度的POI序列之间的协作信息没有得到很好的利用,这可以促进相互增强,有利于增强用户偏好学习。为了应对这些挑战,我们提出了多粒度对比学习(MGCL)用于下一个POI推荐,它利用多粒度表示和对比学习来提高下一个POI推荐性能。具体来说,位置级POI图,类别级别,首先构建区域水平的序列。然后,我们在POI图上使用图卷积网络来提取跨用户的顺序过渡模式。此外,自我注意网络用于学习每个粒度级别的单个用户顺序过渡模式。为了捕获多粒度之间的协作信号,我们采用对比学习方法。最后,我们共同训练推荐和对比学习任务。大量实验证明MGCL比现有技术方法更有效。
    Next Point-of-Interest (POI) recommendation aims to predict the next POI for users from their historical activities. Existing methods typically rely on location-level POI check-in trajectories to explore user sequential transition patterns, which suffer from the severe check-in data sparsity issue. However, taking into account region-level and category-level POI sequences can help address this issue. Moreover, collaborative information between different granularities of POI sequences is not well utilized, which can facilitate mutual enhancement and benefit to augment user preference learning. To address these challenges, we propose multi-granularity contrastive learning (MGCL) for next POI recommendation, which utilizes multi-granularity representation and contrastive learning to improve the next POI recommendation performance. Specifically, location-level POI graph, category-level, and region-level sequences are first constructed. Then, we use graph convolutional networks on POI graph to extract cross-user sequential transition patterns. Furthermore, self-attention networks are used to learn individual user sequential transition patterns for each granularity level. To capture the collaborative signals between multi-granularity, we apply the contrastive learning approach. Finally, we jointly train the recommendation and contrastive learning tasks. Extensive experiments demonstrate that MGCL is more effective than state-of-the-art methods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    计算机断层扫描(CT)扫描最近已成为通过图像分类技术快速诊断肺部疾病的主要技术。在这项研究中,我们提出了一种诊断COVID-19疾病的方法,该方法通过利用不同层结构和核大小的图卷积网络(GCN)从CT扫描图像中提取特征,从而提高了诊断的准确性.我们应用U-Net模型来帮助分割和特征提取。与以前从卷积滤波器和池化层中检索深层特征的研究相比,没有充分考虑节点的空间连通性,我们使用GCN进行分类和预测,以捕获空间连通性模式,这提供了显著的关联利益。我们处理提取的深层特征以形成包含图结构的邻接矩阵,并将其与原始图像图和最大内核图一起传递给GCN。我们将这些图组合在一起,形成图输入的一个块,然后将其通过具有额外的dropout层的GCN,以避免过拟合。我们的研究结果表明,建议的框架,称为特征提取图卷积网络(FGCN),与最近提出的不基于图形表示的深度学习架构相比,在识别肺部疾病方面表现更好。所提出的模型还优于通常用于医疗诊断任务的各种迁移学习模型,突出了图表示相对于传统方法的抽象潜力。
    Computed tomography (CT) scans have recently emerged as a major technique for the fast diagnosis of lung diseases via image classification techniques. In this study, we propose a method for the diagnosis of COVID-19 disease with improved accuracy by utilizing graph convolutional networks (GCN) at various layer formations and kernel sizes to extract features from CT scan images. We apply a U-Net model to aid in segmentation and feature extraction. In contrast with previous research retrieving deep features from convolutional filters and pooling layers, which fail to fully consider the spatial connectivity of the nodes, we employ GCNs for classification and prediction to capture spatial connectivity patterns, which provides a significant association benefit. We handle the extracted deep features to form an adjacency matrix that contains a graph structure and pass it to a GCN along with the original image graph and the largest kernel graph. We combine these graphs to form one block of the graph input and then pass it through a GCN with an additional dropout layer to avoid overfitting. Our findings show that the suggested framework, called the feature-extracted graph convolutional network (FGCN), performs better in identifying lung diseases compared to recently proposed deep learning architectures that are not based on graph representations. The proposed model also outperforms a variety of transfer learning models commonly used for medical diagnosis tasks, highlighting the abstraction potential of the graph representation over traditional methods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    轴承故障诊断对于保证大型机械设备的安全稳定运行具有重要意义。然而,不一致的操作环境会导致源域和目标域之间的数据分布差异。因此,仅在源域数据上训练的模型在应用于目标域时可能表现不佳,尤其是当目标域数据未标记时。现有方法侧重于改进领域自适应方法以进行有效的迁移学习,但忽略了提取综合特征信息的重要性。为了应对这一挑战,我们提出了一种使用双路径卷积神经网络(CNN)和多并行图卷积网络(GCN)的轴承故障诊断方法,叫做DPC-MGCN,这可以应用于可变的工作条件。要获得完整的特征信息,DPC-MGCN利用双路径CNN从源和目标域的振动信号中提取局部和全局特征。注意力机制随后被应用于识别关键特征,将其转换为邻接矩阵。然后采用多并行GCN来进一步探索这些特征之间的结构信息。为了最小化两个域之间的分布差异,我们引入了多核最大均值差异(MK-MMD)域自适应方法。通过应用DPC-MGCN方法诊断不同工况下的轴承故障,并与其他方法进行比较,我们在各种数据集上展示了其卓越的性能。
    Bearing fault diagnosis is significant in ensuring large machinery and equipment\'s safe and stable operation. However, inconsistent operating environments can lead to data distribution differences between source and target domains. As a result, models trained solely on source-domain data may not perform well when applied to the target domain, especially when the target-domain data is unlabeled. Existing approaches focus on improving domain adaptive methods for effective transfer learning but neglect the importance of extracting comprehensive feature information. To tackle this challenge, we present a bearing fault diagnosis approach using dual-path convolutional neural networks (CNNs) and multi-parallel graph convolutional networks (GCNs), called DPC-MGCN, which can be applied to variable working conditions. To obtain complete feature information, DPC-MGCN leverages dual-path CNNs to extract local and global features from vibration signals in both the source and target domains. The attention mechanism is subsequently applied to identify crucial features, which are converted into adjacency matrices. Multi-parallel GCNs are then employed to further explore the structural information among these features. To minimize the distribution differences between the two domains, we incorporate the multi-kernel maximum mean discrepancy (MK-MMD) domain adaptation method. By applying the DPC-MGCN approach for diagnosing bearing faults under diverse working conditions and comparing it with other methods, we demonstrate its superior performance on various datasets.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    这项研究提出了一种新颖的基于长短期记忆(LSTM)的模型,用于基于分子动力学(MD)模拟的部分数据预测未来的物理性质。它使用图卷积网络(GCN)从MD模拟的原子坐标中提取潜在向量,利用LSTM学习潜在向量的时间趋势,并通过完全连接的层对物理属性进行一步预测。用Ni固液体系的MD模拟进行验证,该模型使用剩余连接实现了对凝固和熔化过程中势能时间变化的准确一步预测。递归使用预测值可以仅从MD模拟的前20个快照进行长期预测。该预测捕捉到了低温下势能弯曲的特征,这代表凝固的完成,尽管MD数据在短时间内不具有这样的弯曲特性。值得注意的是,对于超过900ps的长期预测,计算时间减少到相同持续时间的完整MD模拟的1/700。通过有效地利用MD模拟的数据,该方法已显示出显着降低物理属性预测的计算成本的潜力。 .
    This study proposes a novel long short-term memory (LSTM)-based model for predicting future physical properties based on partial data of molecular dynamics (MD) simulation. It extracts latent vectors from atomic coordinates of MD simulations using graph convolutional network, utilizes LSTM to learn temporal trends in latent vectors and make one-step-ahead predictions of physical properties through fully connected layers. Validating with MD simulations of Ni solid-liquid systems, the model achieved accurate one-step-ahead prediction for time variation of the potential energy during solidification and melting processes using residual connections. Recursive use of predicted values enabled long-term prediction from just the first 20 snapshots of the MD simulation. The prediction has captured the feature of potential energy bending at low temperatures, which represents completion of solidification, despite that the MD data in short time do not have such a bending characteristic. Remarkably, for long-time prediction over 900 ps, the computation time was reduced to 1/700th of a full MD simulation of the same duration. This approach has shown the potential to significantly reduce computational cost for prediction of physical properties by efficiently utilizing the data of MD simulation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    准确识别必需蛋白质对于药物研究和疾病诊断至关重要。传统的中心性方法和机器学习方法在准确识别必需蛋白质方面经常面临挑战。主要依靠来自蛋白质-蛋白质相互作用(PPI)网络的信息。尽管一些研究人员尝试整合生物数据和PPI网络来预测必需蛋白质,设计有效的集成方法仍然是一个挑战。为了应对这些挑战,本文介绍了ACDMBI模型,专门设计来克服上述问题。ACDMBI由两个关键模块组成:特征提取和分类。在捕获相关信息方面,我们从三个不同的数据源中获得见解。最初,通过群落划分从PPI网络中提取蛋白质的结构特征。随后,这些功能使用图卷积网络(GCN)和图注意网络(GAT)进一步优化。往前走,利用双向长短期记忆网络(BiLSTM)和多头自我注意机制从基因表达数据中提取蛋白质特征。最后,蛋白质特征是通过将亚细胞定位数据映射到一维向量并通过完全连接的层进行处理而得出的。在分类阶段,我们集成了从三个不同数据源中提取的特征,构建用于蛋白质分类预测的多层深度神经网络(DNN)。酿酒酵母数据的实验结果展示了ACDMBI模型的优越性能,AUC达到0.9533,AUPR达到0.9153。消融实验进一步表明,来自不同生物信息的特征的有效整合显着提高了模型的性能。
    Accurately identifying essential proteins is vital for drug research and disease diagnosis. Traditional centrality methods and machine learning approaches often face challenges in accurately discerning essential proteins, primarily relying on information derived from protein-protein interaction (PPI) networks. Despite attempts by some researchers to integrate biological data and PPI networks for predicting essential proteins, designing effective integration methods remains a challenge. In response to these challenges, this paper presents the ACDMBI model, specifically designed to overcome the aforementioned issues. ACDMBI is comprised of two key modules: feature extraction and classification. In terms of capturing relevant information, we draw insights from three distinct data sources. Initially, structural features of proteins are extracted from the PPI network through community division. Subsequently, these features are further optimized using Graph Convolutional Networks (GCN) and Graph Attention Networks (GAT). Moving forward, protein features are extracted from gene expression data utilizing Bidirectional Long Short-Term Memory networks (BiLSTM) and a multi-head self-attention mechanism. Finally, protein features are derived by mapping subcellular localization data to a one-dimensional vector and processing it through fully connected layers. In the classification phase, we integrate features extracted from three different data sources, crafting a multi-layer deep neural network (DNN) for protein classification prediction. Experimental results on brewing yeast data showcase the ACDMBI model\'s superior performance, with AUC reaching 0.9533 and AUPR reaching 0.9153. Ablation experiments further reveal that the effective integration of features from diverse biological information significantly boosts the model\'s performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    人体运动捕捉技术,它利用传感器来跟踪关键骨架点的运动轨迹,近年来逐步从工业应用向更广泛的民用应用过渡。它广泛用于游戏开发等领域,数字人体建模,和体育科学。然而,这些传感器的可负担性通常会损害运动数据的准确性。低成本运动捕捉方法通常导致所捕捉的运动数据中的错误。我们介绍了一种使用基于时空注意力的图卷积网络(ST-ATGCN)进行人体运动重建和增强的新方法,它可以有效地学习人体骨骼结构和运动逻辑,而无需事先了解人体运动学知识。该方法能够实现无监督的运动数据恢复,并且显著降低与获得精确的运动捕获数据相关联的成本。我们的实验,在两个广泛的运动数据集上进行,并使用真实的运动捕捉传感器,如索尼(东京,日本)莫科皮,证明了该方法在提高低精度运动捕获数据质量方面的有效性。实验表明ST-ATGCN有可能提高运动捕捉技术的可访问性和准确性。
    Human motion capture technology, which leverages sensors to track the movement trajectories of key skeleton points, has been progressively transitioning from industrial applications to broader civilian applications in recent years. It finds extensive use in fields such as game development, digital human modeling, and sport science. However, the affordability of these sensors often compromises the accuracy of motion data. Low-cost motion capture methods often lead to errors in the captured motion data. We introduce a novel approach for human motion reconstruction and enhancement using spatio-temporal attention-based graph convolutional networks (ST-ATGCNs), which efficiently learn the human skeleton structure and the motion logic without requiring prior human kinematic knowledge. This method enables unsupervised motion data restoration and significantly reduces the costs associated with obtaining precise motion capture data. Our experiments, conducted on two extensive motion datasets and with real motion capture sensors such as the SONY (Tokyo, Japan) mocopi, demonstrate the method\'s effectiveness in enhancing the quality of low-precision motion capture data. The experiments indicate the ST-ATGCN\'s potential to improve both the accessibility and accuracy of motion capture technology.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号