Gene co-expression network

基因共表达网络
  • 文章类型: Journal Article
    单细胞转录组的基因共表达分析,旨在定义基因之间的功能关系,由于过多的dropout值而具有挑战性。这里,我们开发了一种单细胞图形高斯模型(SingleCellGGM)算法来进行单细胞基因共表达网络分析。当应用于小鼠单细胞数据集时,SingleCellGGM构建了网络,从中鉴定了具有高度显着功能富集的基因共表达模块。我们将这些模块视为基因表达程序(GEP)。这些GEP可以直接对单个细胞进行细胞类型注释,而无需细胞聚类,它们富含相应细胞功能所需的基因,有时水平超过10倍。GEP在数据集之间是保守的,并且能够在不同的研究之间进行通用的细胞类型标签转移。我们还提出了一种通过GEP平均进行单细胞分析的降维方法,提高结果的可解释性。因此,SingleCellGGM提供基于GEP的独特视角来分析单细胞转录组,并揭示不同单细胞数据集共享的生物学见解。
    Gene co-expression analysis of single-cell transcriptomes, aiming to define functional relationships between genes, is challenging due to excessive dropout values. Here, we developed a single-cell graphical Gaussian model (SingleCellGGM) algorithm to conduct single-cell gene co-expression network analysis. When applied to mouse single-cell datasets, SingleCellGGM constructed networks from which gene co-expression modules with highly significant functional enrichment were identified. We considered the modules as gene expression programs (GEPs). These GEPs enable direct cell-type annotation of individual cells without cell clustering, and they are enriched with genes required for the functions of the corresponding cells, sometimes at levels greater than 10-fold. The GEPs are conserved across datasets and enable universal cell-type label transfer across different studies. We also proposed a dimension-reduction method through averaging by GEPs for single-cell analysis, enhancing the interpretability of results. Thus, SingleCellGGM offers a unique GEP-based perspective to analyze single-cell transcriptomes and reveals biological insights shared by different single-cell datasets.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:射血分数保留的心力衰竭(HFpEF)是一种常见综合征,具有高发病率和高死亡率,但没有可用的循证疗法。研究临床前HFpEF动物模型中基因表达谱的变化至关重要。目的是寻找新的治疗靶点。
    方法:野生型雄性C57BL/6J小鼠给予高脂饮食(HFD)和使用N-硝基-l-精氨酸甲酯(l-NAME)抑制组成型一氧化氮合酶的组合,持续5周和7周。进行RNA测序以检测基因表达谱,并进行生物信息学分析以鉴定核心基因,通路,和涉及的生物过程。
    结果:在干预后第5周和第7周,共有1,347个基因在心脏中差异表达。基因本体论富集分析表明,这些发生较大改变的基因主要参与细胞粘附,中性粒细胞趋化性,细胞通讯,和其他功能。使用层次聚类分析,这些差异表达的基因分为16个。其中,最终确定了三个重要的概况.基因共表达网络分析表明,肌钙蛋白T1型(Tnnt1)直接调节31个相邻基因,被认为是相关基因网络的核心。
    结论:RNA测序的联合应用,层次聚类分析,和基因网络分析确定Tnnt1是HFpEF发展中最重要的基因。
    BACKGROUND: Heart failure with preserved ejection fraction (HFpEF) is a common syndrome with high morbidity and mortality but without available evidence-based therapies. It is essential to investigate changes in gene expression profiles in preclinical HFpEF animal models, with the aim of searching for novel therapeutic targets.
    METHODS: Wild-type male C57BL/6J mice were administrated with a combination of high-fat diet (HFD) and inhibition of constitutive nitric oxide synthase using N-nitro-l-arginine methyl ester (l-NAME) for 5 and 7 weeks. RNA sequencing was conducted to detect gene expression profiles, and bioinformatic analysis was performed to identify the core genes, pathways, and biological processes involved.
    RESULTS: A total of 1,347 genes were differentially expressed in the heart at week 5 and 7 post-intervention. Gene Ontology enrichment analysis indicated that these greatly changed genes were involved mainly in cell adhesion, neutrophil chemotaxis, cell communication, and other functions. Using hierarchical cluster analysis, these differentially expressed genes were classified into 16 profiles. Of these, three significant profiles were ultimately identified. Gene co-expression network analysis suggested troponin T type 1 (Tnnt1) directly regulated 31 neighboring genes and was considered to be at the core of the associated gene network.
    CONCLUSIONS: The combined application of RNA sequencing, hierarchical cluster analysis, and gene network analysis identified Tnnt1 as the most important gene in the development of HFpEF.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:由于当前的气候变化而导致的温度升高极大地影响了农作物的种植,导致产量损失和果实品质改变。番茄是种植和消费最广泛的园艺产品之一,尽管它可以承受各种气候条件,热胁迫会影响植物的生长发育,特别是在生殖阶段,严重影响最终产量。在目前的工作中,通过探索一种耐热基因型(E42)的调节基因网络,研究了其热应激反应机制。这是通过启动子分析来实现的,该分析基于对启动子中的热应力元件(HSE)作图的识别,结合基因共表达网络分析,旨在识别与热相关的基因之间的相互作用。
    结果:结果突出显示了82个在启动子中呈现HSE的基因,属于通过GCN分析获得的52个基因网络之一;其中61个也与热休克因子(Hsfs)相互作用。最后,13个候选基因的列表,包括两个Hsfs,9个热休克蛋白(Hsps)和2个GDSL酯酶/脂肪酶(GELPs)通过关注那些在启动子中表现出HSE的E42基因,与Hsfs交互并显示变体,与亨氏参考基因组相比,对翻译蛋白具有高和/或中等影响。其中,基因本体论注释分析表明,只有LeHsp100(Solyc02g088610)属于特定参与热应激反应的网络。
    结论:作为一个整体,对番茄的基因组和基因组数据进行的生物信息学分析相结合,与在耐热性E42的HS相关基因中检测到的多态性一起,可以确定参与番茄HS反应的候选基因的子集。这项研究为研究非生物应激反应机制提供了一种新方法,并将进行进一步的研究以验证突出基因的作用。
    BACKGROUND: The increase in temperatures due to the current climate change dramatically affects crop cultivation, resulting in yield losses and altered fruit quality. Tomato is one of the most extensively grown and consumed horticultural products, and although it can withstand a wide range of climatic conditions, heat stress can affect plant growth and development specially on the reproductive stage, severely influencing the final yield. In the present work, the heat stress response mechanisms of one thermotolerant genotype (E42) were investigated by exploring its regulatory gene network. This was achieved through a promoter analysis based on the identification of the heat stress elements (HSEs) mapping in the promoters, combined with a gene co-expression network analysis aimed at identifying interactions among heat-related genes.
    RESULTS: Results highlighted 82 genes presenting HSEs in the promoter and belonging to one of the 52 gene networks obtained by the GCN analysis; 61 of these also interact with heat shock factors (Hsfs). Finally, a list of 13 candidate genes including two Hsfs, nine heat shock proteins (Hsps) and two GDSL esterase/lipase (GELPs) were retrieved by focusing on those E42 genes exhibiting HSEs in the promoters, interacting with Hsfs and showing variants, compared to Heinz reference genome, with HIGH and/or MODERATE impact on the translated protein. Among these, the Gene Ontology annotation analysis evidenced that only LeHsp100 (Solyc02g088610) belongs to a network specifically involved in the response to heat stress.
    CONCLUSIONS: As a whole, the combination of bioinformatic analyses carried out on genomic and trascriptomic data available for tomato, together with polymorphisms detected in HS-related genes of the thermotolerant E42 allowed to determine a subset of candidate genes involved in the HS response in tomato. This study provides a novel approach in the investigation of abiotic stress response mechanisms and further studies will be conducted to validate the role of the highlighted genes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    最近的技术,如空间转录组学,能够在单细胞水平测量基因表达以及这些细胞在组织中的空间位置。细胞的空间聚类为理解组织的功能组织提供了有价值的见解。然而,大多数这样的聚类方法涉及一些维度减少,导致组织中任何空间位置的基因之间固有的依赖结构的损失。除了可能影响空间聚类性能外,这还破坏了对基因共表达模式的宝贵见解。在空间转录组学中,矩阵变量基因表达数据,以及单个细胞的空间坐标,通过其行和列协方差提供有关基因表达依赖性和细胞空间依赖性的信息。在这项工作中,我们提出了一种联合贝叶斯方法来同时估计这些基因和空间细胞的相关性。这些估计为下游分析提供数据摘要。我们通过对几个真实的空间转录组数据集的模拟和分析来说明我们的方法。我们的工作阐明了基因共表达网络以及细胞的清晰空间聚类模式。此外,我们的分析表明,下游空间差异分析可能有助于从已知的标记基因中发现未知的细胞类型.
    Recent technologies such as spatial transcriptomics, enable the measurement of gene expressions at the single-cell level along with the spatial locations of these cells in the tissue. Spatial clustering of the cells provides valuable insights into the understanding of the functional organization of the tissue. However, most such clustering methods involve some dimension reduction that leads to a loss of the inherent dependency structure among genes at any spatial location in the tissue. This destroys valuable insights of gene co-expression patterns apart from possibly impacting spatial clustering performance. In spatial transcriptomics, the matrix-variate gene expression data, along with spatial coordinates of the single cells, provides information on both gene expression dependencies and cell spatial dependencies through its row and column covariances. In this work, we propose a joint Bayesian approach to simultaneously estimate these gene and spatial cell correlations. These estimates provide data summaries for downstream analyses. We illustrate our method with simulations and analysis of several real spatial transcriptomic datasets. Our work elucidates gene co-expression networks as well as clear spatial clustering patterns of the cells. Furthermore, our analysis reveals that downstream spatial-differential analysis may aid in the discovery of unknown cell types from known marker genes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Preprint
    研究基因型-表型关联的潜在机制在遗传学中至关重要。基因表达研究加深了我们对基因型→表达→表型机制的理解。然而,传统的表达数量性状位点(eQTL)方法往往忽视了基因共表达网络在将基因型转化为表型中的关键作用。这种差距凸显了需要更强大的统计方法来分析基因型→网络→表型机制。这里,我们开发了一种基于网络的方法,叫做snQTL,绘制影响基因共表达网络的数量性状基因座。我们的方法通过基于张量的光谱统计来测试基因型与基因共表达的联合差异网络之间的关联。从而克服了现有方法中普遍存在的多种测试挑战。我们证明了snQTL在分析三刺棘鱼(Gasterosteusaculeatus)数据中的有效性。与传统方法相比,我们的方法snQTL揭示了影响基因共表达网络的染色体区域,包括传统eQTL分析会遗漏的一个强候选基因。我们的框架提出了当前方法的局限性,并为功能性基因座发现提供了强大的基于网络的工具。
    这项工作解决了在理解基因型-表型关联的机制基础方面的关键差距。虽然现有的表达数量性状基因座(eQTL)方法鉴定了影响基因表达变异的候选基因座,他们常常忽视基因共表达网络的关键作用。这里,我们开发了一个基于网络的QTL框架来绘制影响基因共表达网络的遗传位点。利用基于张量的谱方法,我们的snQTL方法估计了差异共表达模式,并有效地识别了相关的遗传基因座。将snQTL应用于三刺棘鱼,揭示了标准方法错过的候选基因座。这项工作表明了当前方法的局限性,并强调了基于网络的功能基因座发现的潜力。
    Studying the mechanisms underlying the genotype-phenotype association is crucial in genetics. Gene expression studies have deepened our understanding of the genotype → expression → phenotype mechanisms. However, traditional expression quantitative trait loci (eQTL) methods often overlook the critical role of gene co-expression networks in translating genotype into phenotype. This gap highlights the need for more powerful statistical methods to analyze genotype → network → phenotype mechanism. Here, we develop a network-based method, called snQTL, to map quantitative trait loci affecting gene co-expression networks. Our approach tests the association between genotypes and joint differential networks of gene co-expression via a tensor-based spectral statistics, thereby overcoming the ubiquitous multiple testing challenges in existing methods. We demonstrate the effectiveness of snQTL in the analysis of three-spined stickleback (Gasterosteus aculeatus) data. Compared to conventional methods, our method snQTL uncovers chromosomal regions affecting gene co-expression networks, including one strong candidate gene that would have been missed by traditional eQTL analyses. Our framework suggests the limitation of current approaches and offers a powerful network-based tool for functional loci discoveries.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    随着越来越多的可用基因组数据被发表,已经开发了几个数据库来破译早期哺乳动物胚胎发生;然而,奶牛胚胎发育早期天然免疫基因表达调控的研究较少。为此,我们在全基因组水平上探索了先天免疫基因的调控机制。基于比较基因组学,通过收集人类先天免疫基因的最新报告和更新的牛基因组数据进行比较,获得了1473个牛先天免疫基因,初步建立了牛先天免疫基因数据库。为了确定奶牛早期胚胎天然免疫基因的调控机制,对奶牛早期胚胎不同发育阶段的先天免疫基因进行加权共表达网络分析。结果表明,特定模块相关基因在MAPK信号通路中显著富集。蛋白质-蛋白质相互作用(PPI)分析显示每个特定模块中的基因相互作用,和10个最高连接基因被选为潜在的枢纽基因。最后,结合差异表达基因(DEGs)的结果,ATF3,IL6,CD8A,CD69,CD86,HCK,ERBB3,LCK,ITGB2,LYN,和ERBB2被鉴定为奶牛早期胚胎先天免疫的关键基因。总之,通过对牛和人类全基因组的比较分析,确定了牛先天免疫基因集,构建了奶牛胚胎早期先天免疫基因共表达网络。本研究结果为探索先天免疫基因在奶牛早期胚胎发育中的参与和调控提供了依据。
    As more and more of the available genomic data have been published, several databases have been developed for deciphering early mammalian embryogenesis; however, less research has been conducted on the regulation of the expression of natural immunity genes during early embryonic development in dairy cows. To this end, we explored the regulatory mechanism of innate immunity genes at the whole-genome level. Based on comparative genomics, 1473 innate immunity genes in cattle were obtained by collecting the latest reports on human innate immunity genes and updated bovine genome data for comparison, and a preliminary database of bovine innate immunity genes was constructed. In order to determine the regulatory mechanism of innate immune genes in dairy cattle early embryos, we conducted weighted co-expression network analysis of the innate immune genes at different developmental stages of dairy cattle early embryos. The results showed that specific module-related genes were significantly enriched in the MAPK signaling pathway. Protein-protein interaction (PPI) analysis showed gene interactions in each specific module, and 10 of the highest connectivity genes were chosen as potential hub genes. Finally, combined with the results for differential expressed genes (DEGs), ATF3, IL6, CD8A, CD69, CD86, HCK, ERBB3, LCK, ITGB2, LYN, and ERBB2 were identified as the key genes of innate immunity in dairy cattle early embryos. In conclusion, the bovine innate immunity gene set was determined and the co-expression network of innate immunity genes in the early embryonic stage of dairy cattle was constructed by comparing and analyzing the whole genome of bovines and humans. The findings in this study provide the basis for exploring the involvement and regulation of innate immune genes in the early embryonic development of dairy cattle.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    我们考虑在多层随机块模型中估计常见社区结构的问题,其中每个单层可能没有足够的信号强度来恢复完整的社区结构。为了有效地聚合不同层的信号,我们认为,即使各个层非常稀疏,平方和邻接矩阵也包含足够的信号。我们的方法使用了偏置去除步骤,当平方噪声矩阵可能在非常稀疏的状态下淹没信号时,这是必要的。我们方法的分析依赖于具有矩阵值系数和矩阵值二次形式的矩阵线性组合的几个新的尾部概率界限,这可能是独立的利益。在合成数据和有关基因共表达网络的微阵列分析中证明了我们方法的性能和消除偏见的必要性。
    We consider the problem of estimating common community structures in multi-layer stochastic block models, where each single layer may not have sufficient signal strength to recover the full community structure. In order to efficiently aggregate signal across different layers, we argue that the sum-of-squared adjacency matrices contain sufficient signal even when individual layers are very sparse. Our method uses a bias-removal step that is necessary when the squared noise matrices may overwhelm the signal in the very sparse regime. The analysis of our method relies on several novel tail probability bounds for matrix linear combinations with matrix-valued coefficients and matrix-valued quadratic forms, which may be of independent interest. The performance of our method and the necessity of bias removal is demonstrated in synthetic data and in microarray analysis about gene co-expression networks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    寻找复杂疾病的生物标志物基因由于其在临床中的应用而受到持续关注。在本文中,我们提出了一种基于网络的方法来获得一组生物标记基因。关键思想是在敏感基因之间构建基因共表达网络,并将基因簇为不同的模块。对于每个模块,我们可以确定它的代表,即,与模块内其他基因具有最大连通性和最小平均最短路径长度的基因。我们相信这些代表性基因可以作为一组新的潜在的疾病生物标志物。作为一个典型的例子,我们调查了老年痴呆症,获得总共16个潜在的代表基因,其中三个属于非转录组。从不同的角度和方法在文献中发现了这些基因中的11个。使用机器学习算法将初期组分为两个不同的亚型。我们对这两种亚型进行了基因本体论分析和京都基因百科全书,并对健康组和中等组进行了基因组分析,分别。这两个亚型参与了两个不同的生物过程,证明了这种方法的有效性。这种方法是疾病特异性和独立的;因此,它可以扩展到对其他类型的复杂疾病进行分类。
    Finding biomarker genes for complex diseases attracts persistent attention due to its application in clinics. In this paper, we propose a network-based method to obtain a set of biomarker genes. The key idea is to construct a gene co-expression network among sensitive genes and cluster the genes into different modules. For each module, we can identify its representative, i.e., the gene with the largest connectivity and the smallest average shortest path length to other genes within the module. We believe these representative genes could serve as a new set of potential biomarkers for diseases. As a typical example, we investigated Alzheimer\'s disease, obtaining a total of 16 potential representative genes, three of which belong to the non-transcriptome. A total of 11 out of these genes are found in literature from different perspectives and methods. The incipient groups were classified into two different subtypes using machine learning algorithms. We subjected the two subtypes to Gene Ontology analysis and Kyoto Encyclopedia of Genes and Genomes analysis with healthy groups and moderate groups, respectively. The two sub-type groups were involved in two different biological processes, demonstrating the validity of this approach. This method is disease-specific and independent; hence, it can be extended to classify other kinds of complex diseases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    多年生木本植物八仙花由于其独特的花卉发育机制而具有重要的研究价值。导致装饰花,可以享受相对较长的时间。然而,目前尚不完全了解拟南芥“Annabelle”当年花卉发育的调节机制。在这项研究中,我们进行了相关分析,以通过结合物候观察来探索H.arborescens\'Annabelle''的核心调节网络,生理测定,和七个花卉发育阶段的转录组比较。通过这种分析,我们构建了一个基于最高倒数排序(HRR)的基因共表达网络(GCN),使用从七个开花相关途径中鉴定的509个差异表达基因(DEGs),以及八种开花相关植物激素的生物合成和转录组学分析中的信号转导。根据GCN的分析,我们确定了14个具有最高功能连接的关键基因,这些基因在特定发育阶段起关键作用.我们证实了135个转录因子(AP2/ERF,bHLH,CO-like,GRAS,MIKC,SBP,WRKY)与14个关键基因高度共表达,表明它们与当年花卉的发展密切相关。我们进一步提出了全花发育的基因调控网络的假设模型。这个模型表明光周期,老化,和赤霉素途径,随着植物激素脱落酸(ABA),赤霉素(GA),油菜素类固醇(BR),和茉莉酸(JA),协同工作,促进花卉过渡。此外,生长素,GA,JA,ABA,水杨酸(SA)通过参与生物钟来调节开花过程。细胞分裂素(CTK),乙烯(ETH),SA是影响花衰老的关键调节剂。此外,几个花卉集成商(HaLFY,HaSOC1-2,HaAP1,HaFULL,HaAGL24,HaFLC,等。)是树枝花发育的主要贡献者。总的来说,这项研究提供了一个全面的理解的动态机制,潜在的整个过程中的年度花卉发展,从而为进一步研究拟南芥“Annabelle”的花发育提供有价值的见解。
    The perennial woody plant Hydrangea arborescens \'Annabelle\' is of great research value due to its unique mechanism of flower development that occurs in the current year, resulting in decorative flowers that can be enjoyed for a relatively long period of time. However, the mechanisms underlying the regulation of current-year flower development in H. arborescens \'Annabelle\' are still not fully understood. In this study, we conducted an associated analysis to explore the core regulating network in H. arborescens \'Annabelle\' by combining phenological observations, physiological assays, and transcriptome comparisons across seven flower developmental stages. Through this analysis, we constructed a gene co-expression network (GCN) based on the highest reciprocal rank (HRR), using 509 differentially expressed genes (DEGs) identified from seven flowering-related pathways, as well as the biosynthesis of eight flowering-related phytohormones and signal transduction in the transcriptomic analysis. According to the analysis of the GCN, we identified 14 key genes with the highest functional connectivity that played critical roles in specific development stages. We confirmed that 135 transcription factors (AP2/ERF, bHLH, CO-like, GRAS, MIKC, SBP, WRKY) were highly co-expressed with the 14 key genes, indicating their close associations with the development of current-year flowers. We further proposed a hypothetical model of a gene regulatory network for the development of the whole flower. This model suggested that the photoperiod, aging, and gibberellin pathways, along with the phytohormones abscisic acid (ABA), gibberellin (GA), brassinosteroid (BR), and jasmonic acid (JA), work synergistically to promote the floral transition. Additionally, auxin, GA, JA, ABA, and salicylic acid (SA) regulated the blooming process by involving the circadian clock. Cytokinin (CTK), ethylene (ETH), and SA were key regulators that affected flower senescence. Additionally, several floral integrators (HaLFY, HaSOC1-2, HaAP1, HaFULL, HaAGL24, HaFLC, etc.) were dominant contributors to the development of H. arborescens flowers. Overall, this research provides a comprehensive understanding of the dynamic mechanism underlying the entire process of current-year flower development, thereby offering valuable insights for further studies on the flower development of H. arborescens \'Annabelle\'.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:通量平衡分析(FBA)是一种关键的代谢建模方法,用于模拟稳态条件下的细胞代谢。它的简单性和多功能性导致了将转录组和蛋白质组数据整合到FBA中的各种策略,成功预测通量分布和表型结果。然而,尽管取得了这些进步,尚未开发的潜力在于利用基因相关的连接,如共表达模式,以获得有价值的见解。
    结果:为了填补这一空白,我们介绍ICON-GEM,一种创新的基于约束的模型,将基因共表达网络纳入FBA模型,有助于更精确地确定通量分布和功能路径。在这项研究中,来自大肠杆菌和酿酒酵母的转录组数据被整合到它们各自的基因组尺度代谢模型中。构建了一个全面的基因共表达网络,作为细胞代谢机制的全局视图。通过利用二次规划,我们最大化了反应通量对之间的比对及其在共表达网络中相应基因的相关性。结果表明,ICON-GEM在预测准确性方面优于现有方法。子系统和功能模块上的通量变化也证明了有希望的结果。此外,涉及不同类型的生物网络的比较,包括蛋白质-蛋白质相互作用和随机网络,揭示了共表达网络在基因组规模代谢工程中的应用。
    结论:ICON-GEMs引入了一种创新的约束模型,能够同时整合基因共表达网络,准备在不同的转录组数据集和多个生物体的板应用。它是免费的开源在https://github.com/ThummaratPaklao/ICOM-GEMs.git。
    BACKGROUND: Flux Balance Analysis (FBA) is a key metabolic modeling method used to simulate cellular metabolism under steady-state conditions. Its simplicity and versatility have led to various strategies incorporating transcriptomic and proteomic data into FBA, successfully predicting flux distribution and phenotypic results. However, despite these advances, the untapped potential lies in leveraging gene-related connections like co-expression patterns for valuable insights.
    RESULTS: To fill this gap, we introduce ICON-GEMs, an innovative constraint-based model to incorporate gene co-expression network into the FBA model, facilitating more precise determination of flux distributions and functional pathways. In this study, transcriptomic data from both Escherichia coli and Saccharomyces cerevisiae were integrated into their respective genome-scale metabolic models. A comprehensive gene co-expression network was constructed as a global view of metabolic mechanism of the cell. By leveraging quadratic programming, we maximized the alignment between pairs of reaction fluxes and the correlation of their corresponding genes in the co-expression network. The outcomes notably demonstrated that ICON-GEMs outperformed existing methodologies in predictive accuracy. Flux variabilities over subsystems and functional modules also demonstrate promising results. Furthermore, a comparison involving different types of biological networks, including protein-protein interactions and random networks, reveals insights into the utilization of the co-expression network in genome-scale metabolic engineering.
    CONCLUSIONS: ICON-GEMs introduce an innovative constrained model capable of simultaneous integration of gene co-expression networks, ready for board application across diverse transcriptomic data sets and multiple organisms. It is freely available as open-source at https://github.com/ThummaratPaklao/ICOM-GEMs.git .
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号