gene ontology

基因本体
  • 文章类型: Journal Article
    2019年冠状病毒病(COVID-19)大流行在全球产生了重大影响,导致更高的死亡人数和幸存者持续的健康问题,特别是那些有预先存在的医疗条件。许多研究表明,灾难性的COVID-19结果与糖尿病之间存在很强的相关性。为了获得更深入的见解,我们分析了COVID-19和糖尿病周围神经病患者的转录组数据集.使用R编程语言,差异表达基因(DEGs)进行鉴定和分类的基础上,向上和向下的规定。然后在这些组之间探索DEG的重叠。使用基因本体论(GO)对这些常见DEG进行功能注释,京都基因和基因组百科全书(KEGG),生物星球,Reactome,和Wiki途径。使用生物信息学工具创建了蛋白质-蛋白质相互作用(PPI)网络,以了解分子相互作用。通过对PPI网络的拓扑分析,我们确定了hub基因模块并探索了基因调控网络(GRN).此外,该研究扩展到基于综合分析为已鉴定的相互DEG提出潜在的药物分子.通过深入了解潜在的治疗干预措施,这些方法可能有助于了解COVID-19在糖尿病周围神经病变患者中的分子复杂性。
    The coronavirus disease 2019 (COVID-19) pandemic has had a significant impact globally, resulting in a higher death toll and persistent health issues for survivors, particularly those with pre-existing medical conditions. Numerous studies have demonstrated a strong correlation between catastrophic COVID-19 results and diabetes. To gain deeper insights, we analysed the transcriptome dataset from COVID-19 and diabetic peripheral neuropathic patients. Using the R programming language, differentially expressed genes (DEGs) were identified and classified based on up and down regulations. The overlaps of DEGs were then explored between these groups. Functional annotation of those common DEGs was performed using Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), Bio-Planet, Reactome, and Wiki pathways. A protein-protein interaction (PPI) network was created with bioinformatics tools to understand molecular interactions. Through topological analysis of the PPI network, we determined hub gene modules and explored gene regulatory networks (GRN). Furthermore, the study extended to suggesting potential drug molecules for the identified mutual DEG based on the comprehensive analysis. These approaches may contribute to understanding the molecular intricacies of COVID-19 in diabetic peripheral neuropathy patients through insights into potential therapeutic interventions.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    肾移植(KT)是治疗终末期肾病的最佳方法。尽管随着免疫抑制剂的发展,移植物的长期和短期生存率显着提高,急性排斥反应(AR)仍然是攻击移植物和患者的主要危险因素。先天免疫应答在排斥反应中起重要作用。因此,我们的目标是确定KT后与AR相关的先天性免疫的生物标志物,并为未来的研究提供支持.
    基于来自NCBI基因表达合成数据库(GEO)的数据集GSE174020进行差异表达基因(DEGs)分析,然后与分子特征数据库中鉴定的GSE5099M1巨噬细胞相关基因组合。然后,我们鉴定了DEGs中与M1巨噬细胞相关的基因,定义为DEM1Gs,并进行了基因本体论(GO)和京都基因组百科全书(KEGG)富集分析。使用Cibersort分析AR期间的免疫细胞浸润。同时,我们使用蛋白质-蛋白质相互作用(PPI)网络和Cytoscape软件来确定关键基因。数据集,来自儿科患者的GSE14328,GSE138043和GSE9493来源于成人患者,用于验证Hub基因。另外的验证是大鼠KT模型,用于进行HE染色,免疫组织化学染色,西方的Blot。在HPA数据库中搜索Hub基因以确认它们的表达。最后,我们构建了转录因子(TF)-Hub基因和miRNA-Hub基因的相互作用网络。
    与正常组相比,366个基因上调,AR组中有423个基因下调。然后,在这些基因中发现了106个与M1巨噬细胞相关的基因。GO和KEGG富集分析表明,这些基因主要参与细胞因子的结合,抗原结合,NK细胞介导的细胞毒性,激活免疫受体和免疫反应,和炎症NF-κB信号通路的激活。两个Hub基因,即CCR7和CD48,通过PPI和Cytoscape分析鉴定。它们已经在外部验证集中进行了验证,起源于儿科患者和成人患者,和动物实验。在HPA数据库中,CCR7和CD48主要在T细胞中表达,B细胞,巨噬细胞,以及这些免疫细胞分布的组织。除了免疫浸润,CD4+T,CD8+T,NK细胞,NKT细胞,AR组单核细胞显著增加,这与Hub基因筛选的结果高度一致。最后,我们预测19个TFs和32个miRNAs可能与Hub基因相互作用。
    通过全面的生物信息学分析,我们的研究结果可能为KT后AR提供预测和治疗靶点.
    UNASSIGNED: Kidney transplantation (KT) is the best treatment for end-stage renal disease. Although long and short-term survival rates for the graft have improved significantly with the development of immunosuppressants, acute rejection (AR) remains a major risk factor attacking the graft and patients. The innate immune response plays an important role in rejection. Therefore, our objective is to determine the biomarkers of congenital immunity associated with AR after KT and provide support for future research.
    UNASSIGNED: A differential expression genes (DEGs) analysis was performed based on the dataset GSE174020 from the NCBI gene Expression Synthesis Database (GEO) and then combined with the GSE5099 M1 macrophage-related gene identified in the Molecular Signatures Database. We then identified genes in DEGs associated with M1 macrophages defined as DEM1Gs and performed gene ontology (GO) and Kyoto Encyclopedia of Genomes (KEGG) enrichment analysis. Cibersort was used to analyze the immune cell infiltration during AR. At the same time, we used the protein-protein interaction (PPI) network and Cytoscape software to determine the key genes. Dataset, GSE14328 derived from pediatric patients, GSE138043 and GSE9493 derived from adult patients, were used to verify Hub genes. Additional verification was the rat KT model, which was used to perform HE staining, immunohistochemical staining, and Western Blot. Hub genes were searched in the HPA database to confirm their expression. Finally, we construct the interaction network of transcription factor (TF)-Hub genes and miRNA-Hub genes.
    UNASSIGNED: Compared to the normal group, 366 genes were upregulated, and 423 genes were downregulated in the AR group. Then, 106 genes related to M1 macrophages were found among these genes. GO and KEGG enrichment analysis showed that these genes are mainly involved in cytokine binding, antigen binding, NK cell-mediated cytotoxicity, activation of immune receptors and immune response, and activation of the inflammatory NF-κB signaling pathway. Two Hub genes, namely CCR7 and CD48, were identified by PPI and Cytoscape analysis. They have been verified in external validation sets, originated from both pediatric patients and adult patients, and animal experiments. In the HPA database, CCR7 and CD48 are mainly expressed in T cells, B cells, macrophages, and tissues where these immune cells are distributed. In addition to immunoinfiltration, CD4+T, CD8+T, NK cells, NKT cells, and monocytes increased significantly in the AR group, which was highly consistent with the results of Hub gene screening. Finally, we predicted that 19 TFs and 32 miRNAs might interact with the Hub gene.
    UNASSIGNED: Through a comprehensive bioinformatic analysis, our findings may provide predictive and therapeutic targets for AR after KT.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    骨不连是一种常见的骨折并发症,可严重影响患者的预后。然而,其机制尚未完全理解。这项研究使用差异分析和加权共表达网络分析(WGCNA)来确定与骨折愈合相关的易感性模块和枢纽基因。两个数据集,GSE125289和GSE213891是从GEO网站下载的,和差异表达的miRNA和基因被分析并用于构建WGCNA网络。差异表达基因的基因本体论(GO)分析显示细胞因子和炎症因子分泌富集,吞噬作用,和跨高尔基网络调控途径。利用生物信息学位点预测和交叉基因搜索,miR-29b-3p被鉴定为LIN7A表达的调节因子,可能对骨折愈合产生负面影响。探讨了骨不愈合机制中潜在的miRNA-mRNA相互作用,miRNA-29-3p和LIN7A被鉴定为骨骼不愈合的生物标志物。使用qRT-PCR和ELISA验证了来自骨折不愈合患者的血液样品中miRNA-29b-3p和LIN7A的表达。总的来说,这项研究确定了与骨折不愈合相关的特征模块和关键基因,并提供了对其分子机制的见解。发现下调的miRNA-29b-3p下调LIN7A蛋白表达,这可能会影响骨折不愈合患者骨折后的愈合过程。这些发现可作为骨不愈合的预后生物标志物和潜在的治疗靶标。
    Bone non-union is a common fracture complication that can severely impact patient outcomes, yet its mechanism is not fully understood. This study used differential analysis and weighted co-expression network analysis (WGCNA) to identify susceptibility modules and hub genes associated with fracture healing. Two datasets, GSE125289 and GSE213891, were downloaded from the GEO website, and differentially expressed miRNAs and genes were analysed and used to construct the WGCNA network. Gene ontology (GO) analysis of the differentially expressed genes showed enrichment in cytokine and inflammatory factor secretion, phagocytosis, and trans-Golgi network regulation pathways. Using bioinformatic site prediction and crossover gene search, miR-29b-3p was identified as a regulator of LIN7A expression that may negatively affect fracture healing. Potential miRNA-mRNA interactions in the bone non-union mechanism were explored, and miRNA-29-3p and LIN7A were identified as biomarkers of skeletal non-union. The expression of miRNA-29b-3p and LIN7A was verified in blood samples from patients with fracture non-union using qRT-PCR and ELISA. Overall, this study identified characteristic modules and key genes associated with fracture non-union and provided insight into its molecular mechanisms. Downregulated miRNA-29b-3p was found to downregulate LIN7A protein expression, which may affect the healing process after fracture in patients with bone non-union. These findings may serve as a prognostic biomarker and potential therapeutic target for bone non-union.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:从基因表达数据中提取信息的一种广泛使用的方法是构建基因共表达网络和随后的基因簇计算检测,称为模块。WGCNA和相关方法是模块检测的事实上的标准。这项工作的目的是研究更复杂的算法对设计一种替代方法的适用性,该方法具有增强的提取生物学有意义的模块的潜力。
    结果:我们介绍了自学习基因聚类管道(SGCP),用于检测基因共表达网络中的模块的光谱方法。SGCP包含多个功能,使其与以前的工作不同,包括在自我学习步骤中利用基因本体论(GO)信息的新步骤。与在12个真实基因表达数据集上广泛使用的现有框架相比,我们表明SGCP产生具有较高GO富集的模块。此外,SGCP对与基线报告的术语大不相同的GO术语赋予最高的统计重要性。
    结论:在基因共表达网络中发现基因簇的现有框架是基于相对简单的算法组件。SGCP依赖于更新的算法技术,使高度丰富的模块具有独特的特点的计算,从而为基因共表达分析提供了一种新的替代工具。
    BACKGROUND: A widely used approach for extracting information from gene expression data employs the construction of a gene co-expression network and the subsequent computational detection of gene clusters, called modules. WGCNA and related methods are the de facto standard for module detection. The purpose of this work is to investigate the applicability of more sophisticated algorithms toward the design of an alternative method with enhanced potential for extracting biologically meaningful modules.
    RESULTS: We present self-learning gene clustering pipeline (SGCP), a spectral method for detecting modules in gene co-expression networks. SGCP incorporates multiple features that differentiate it from previous work, including a novel step that leverages gene ontology (GO) information in a self-leaning step. Compared with widely used existing frameworks on 12 real gene expression datasets, we show that SGCP yields modules with higher GO enrichment. Moreover, SGCP assigns highest statistical importance to GO terms that are mostly different from those reported by the baselines.
    CONCLUSIONS: Existing frameworks for discovering clusters of genes in gene co-expression networks are based on relatively simple algorithmic components. SGCP relies on newer algorithmic techniques that enable the computation of highly enriched modules with distinctive characteristics, thus contributing a novel alternative tool for gene co-expression analysis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    最近已证明封闭测试对于同时进行真实发现比例控制是最佳的。是的,然而,构建真正的发现保证程序具有挑战性,因为它将权力集中在用户根据他们的特定兴趣或专业知识选择的某些特征集上。我们提出了一个程序,允许用户以预定的功能集为目标电源,也就是说,\"焦点集。“尽管如此,该方法还允许推断事后选择的特征集,也就是说,\"非焦点集,\“为此,我们推导了一个由插值限制的真正的发现较低的置信度。我们的程序是由部分真实发现保证程序与Holm\的程序相结合而构建的,是封闭测试程序的保守捷径。仿真研究证实,对于焦点集,我们方法的统计能力相对较高,以非聚焦集的功率为代价,根据需要。此外,我们研究了具有特定结构的集合的功率属性,例如,树和有向无环图。我们还在可复制性分析的背景下将我们的方法与AdaFilter进行了比较。通过基因本体分析在基因表达数据中说明了我们方法的应用。
    Closed testing has recently been shown to be optimal for simultaneous true discovery proportion control. It is, however, challenging to construct true discovery guarantee procedures in such a way that it focuses power on some feature sets chosen by users based on their specific interest or expertise. We propose a procedure that allows users to target power on prespecified feature sets, that is, \"focus sets.\" Still, the method also allows inference for feature sets chosen post hoc, that is, \"nonfocus sets,\" for which we deduce a true discovery lower confidence bound by interpolation. Our procedure is built from partial true discovery guarantee procedures combined with Holm\'s procedure and is a conservative shortcut to the closed testing procedure. A simulation study confirms that the statistical power of our method is relatively high for focus sets, at the cost of power for nonfocus sets, as desired. In addition, we investigate its power property for sets with specific structures, for example, trees and directed acyclic graphs. We also compare our method with AdaFilter in the context of replicability analysis. The application of our method is illustrated with a gene ontology analysis in gene expression data.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:双RNA测序是一种强大的工具,可以全面了解植物-微生物相互作用背后的分子动力学。RNA测序(RNA-seq)在植物-细菌相互作用的转录分析中存在技术障碍。尤其是在细菌转录组学中,由于存在丰富的核糖体RNA(rRNA),这可能限制了基本转录本的覆盖范围。因此,为了实现细菌转录组的经济有效和全面的测序,有必要设计有效的方法来消除rRNA和提高细菌mRNA的比例。在这项研究中,我们修改了链特异性双RNA-seq方法,目的是富集细菌感染植物样品中细菌mRNA的比例.富集方法包括通过polyA选择和rRNA去除以进行细菌mRNA富集,然后进行链特异性RNA-seq文库制备步骤来顺序分离植物mRNA。与传统方法相比,我们通过采用各种植物-细菌相互作用来评估富集方法的效率,包括宿主和非宿主与病原菌的抗性相互作用,以及分别使用辣椒和番茄植物与有益的根际相关细菌的相互作用。
    结果:在检查的所有植物-细菌相互作用的情况下,使用富集方法观察到作图效率的增加,尽管它产生了较低的读段计数.特别是在与Xanthmonascampestrispv的相容相互作用中。Vesicatoria比赛3(Xcv3),富集方法提高了Xcv3感染的辣椒样品对其自身基因组的作图率(15.09%;增加1.45倍)和CDS(8.92%;增加1.49倍)。在所研究的所有倍数变化阈值水平下,富集方法始终显示出比常规RNA-seq方法更多的差异表达基因(DEG)。特别是在辣椒中Xcv3感染的早期阶段。基因本体论(GO)富集分析显示,DEGs主要富集在蛋白水解中,激酶,丝氨酸型内肽酶和血红素结合活性。
    结论:本研究中证明的富集方法将作为现有RNA-seq方法的合适替代方法来富集细菌mRNA,并为植物-细菌相互作用中复杂的转录组改变提供新的见解。
    BACKGROUND: Dual RNA sequencing is a powerful tool that enables a comprehensive understanding of the molecular dynamics underlying plant-microbe interactions. RNA sequencing (RNA-seq) poses technical hurdles in the transcriptional analysis of plant-bacterial interactions, especially in bacterial transcriptomics, owing to the presence of abundant ribosomal RNA (rRNA), which potentially limits the coverage of essential transcripts. Therefore, to achieve cost-effective and comprehensive sequencing of the bacterial transcriptome, it is imperative to devise efficient methods for eliminating rRNA and enhancing the proportion of bacterial mRNA. In this study, we modified a strand-specific dual RNA-seq method with the goal of enriching the proportion of bacterial mRNA in the bacteria-infected plant samples. The enriched method involved the sequential separation of plant mRNA by poly A selection and rRNA removal for bacterial mRNA enrichment followed by strand specific RNA-seq library preparation steps. We assessed the efficiency of the enriched method in comparison to the conventional method by employing various plant-bacterial interactions, including both host and non-host resistance interactions with pathogenic bacteria, as well as an interaction with a beneficial rhizosphere associated bacteria using pepper and tomato plants respectively.
    RESULTS: In all cases of plant-bacterial interactions examined, an increase in mapping efficiency was observed with the enriched method although it produced a lower read count. Especially in the compatible interaction with Xanthmonas campestris pv. Vesicatoria race 3 (Xcv3), the enriched method enhanced the mapping ratio of Xcv3-infected pepper samples to its own genome (15.09%; 1.45-fold increase) and the CDS (8.92%; 1.49-fold increase). The enriched method consistently displayed a greater number of differentially expressed genes (DEGs) than the conventional RNA-seq method at all fold change threshold levels investigated, notably during the early stages of Xcv3 infection in peppers. The Gene Ontology (GO) enrichment analysis revealed that the DEGs were predominantly enriched in proteolysis, kinase, serine type endopeptidase and heme binding activities.
    CONCLUSIONS: The enriched method demonstrated in this study will serve as a suitable alternative to the existing RNA-seq method to enrich bacterial mRNA and provide novel insights into the intricate transcriptomic alterations within the plant-bacterial interplay.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:本研究旨在鉴定失调的基因,分子途径,和人乳头瘤病毒(HPV)相关宫颈癌的调节机制。我们研究了疾病相关基因以及基因本体论,生存预后,转录因子和微小RNA(miRNA)参与宫颈癌的发生,能够更深入地理解与HPV相关的宫颈癌。
    方法:我们使用10个可公开获取的基因表达综合(GEO)数据集来检查宫颈癌中的基因表达模式。差异表达基因(DEGs),这显示了宫颈癌和健康组织样本之间的明显区别,使用GEO2R工具进行分析。使用其他生物信息学技术进行途径分析和功能富集,以及分析基因表达改变与HPV感染之间的联系。
    结果:总计,与健康组织相比,48个DEGs被鉴定为在宫颈癌组织中差异表达。在DEG中,CCND1、CCNA2和SPP1是HPV相关宫颈癌的关键失调基因。针对这些基因鉴定的五种常见miRNA是miR-7-5p,miR-16-5p,miR-124-3p,miR-10b-5p和miR-27a-3p。miRNAhsa-miR-27a-3p靶向的hub-DEG受共同转录因子SP1控制。
    结论:本研究已经确定了参与HPV相关宫颈癌进展的DEGs以及调节它们的各种分子途径和转录因子。这些发现使人们更好地了解宫颈癌,从而开发和确定可能的治疗和干预目标。分别。
    BACKGROUND: The present study aimed to identify dysregulated genes, molecular pathways, and regulatory mechanisms in human papillomavirus (HPV)-associated cervical cancers. We have investigated the disease-associated genes along with the Gene Ontology, survival prognosis, transcription factors and the microRNA (miRNA) that are involved in cervical carcinogenesis, enabling a deeper comprehension of cervical cancer linked to HPV.
    METHODS: We used 10 publicly accessible Gene Expression Omnibus (GEO) datasets to examine the patterns of gene expression in cervical cancer. Differentially expressed genes (DEGs), which showed a clear distinction between cervical cancer and healthy tissue samples, were analyzed using the GEO2R tool. Additional bioinformatic techniques were used to carry out pathway analysis and functional enrichment, as well as to analyze the connection between altered gene expression and HPV infection.
    RESULTS: In total, 48 DEGs were identified to be differentially expressed in cervical cancer tissues in comparison to healthy tissues. Among DEGs, CCND1, CCNA2 and SPP1 were the key dysregulated genes involved in HPV-associated cervical cancer. The five common miRNAs that were identified against these genes are miR-7-5p, miR-16-5p, miR-124-3p, miR-10b-5p and miR-27a-3p. The hub-DEGs targeted by miRNA hsa-miR-27a-3p are controlled by the common transcription factor SP1.
    CONCLUSIONS: The present study has identified DEGs involved in HPV-associated cervical cancer progression and the various molecular pathways and transcription factors regulating them. These findings have led to a better understanding of cervical cancer resulting in the development and identification of possible therapeutic and intervention targets, respectively.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:胃癌是受遗传和环境因素影响的癌症相关死亡的主要原因。磷酸三苯酯(TPP)是一种普遍使用的阻燃剂,但其对健康的影响仍有待彻底理解。
    目的:通过检测基因表达模式和建立预测模型,探索TPP暴露与胃癌之间的联系。
    方法:基因表达数据来源于癌症基因组图谱(TCGA)和比较毒性基因组学数据库(CTD)。采用基因本体论(GO)和京都基因和基因组百科全书(KEGG)途径进行分析。单样品基因组富集分析(ssGSEA)用于获得磷酸酯阻燃剂相关评分。通过差异分析构建了预测模型,单变量COX回归,和LASSO回归。进行分子对接以评估蛋白质与TPP的相互作用。
    结果:ssGSEA确定了与胃癌中磷酸盐阻燃剂相关的分数,与免疫相关的性状有很强的相关性。鉴定了几种与TPP相关的基因,并将其用于开发具有临床意义的预后模型。分子对接显示了TPP与MTTP的高结合亲和力,与脂质代谢有关的基因。通路分析表明,TPP暴露通过脂质代谢过程促进胃癌的发生。
    结论:该研究建立了TPP暴露与胃癌发病之间的潜在相关性,精确定位涉及的关键基因和途径。这强调了环境因素在胃癌研究中的重要性,并为临床应用提供了潜在的诊断工具。
    BACKGROUND: Gastric cancer is a leading cause of cancer-related deaths influenced by both genetic and environmental factors. Triphenyl phosphate (TPP) is a prevalent flame retardant, but its health implications remain to be thoroughly understood.
    OBJECTIVE: To explore the link between TPP exposure and gastric cancer by examining gene expression patterns and developing a predictive model.
    METHODS: Gene expression data were sourced from The Cancer Genome Atlas (TCGA) and the Comparative Toxicogenomics Database (CTD). Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were employed for analysis. Single-sample Gene Set Enrichment Analysis (ssGSEA) was used to obtain phosphate flame retardant-related scores. A predictive model was constructed through differential analysis, univariate COX regression, and LASSO regression. Molecular docking was performed to assess protein interactions with TPP.
    RESULTS: ssGSEA identified scores related to phosphate flame retardants in gastric cancer, which had a strong association with immune-related traits. Several genes associated with TPP were identified and used to develop a prognostic model that has clinical significance. Molecular docking showed a high binding affinity of TPP with MTTP, a gene related to lipid metabolism. Pathway analysis indicated that TPP exposure contributes to gastric cancer through lipid metabolic processes.
    CONCLUSIONS: The study establishes a potential correlation between TPP exposure and gastric cancer onset, pinpointing key genes and pathways involved. This underscores the significance of environmental factors in gastric cancer research and presents a potential diagnostic tool for clinical application.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    本研究旨在研究与先兆子痫(PE)相关的免疫评分和基质评分相关的特征,并使用生物信息学分析确定诊断PE的关键基因。四个微阵列数据集,GSE75010、GSE25906、GSE44711和GSE10588从基因表达综合数据库获得。GSE75010用于差异表达基因(DEGs)分析。随后,生物信息学工具,如基因本体论,京都基因和基因组百科全书,加权基因相关网络分析,和基因集富集分析用于功能表征参与PE发病机理的候选靶基因。采用最小绝对收缩和选择算子回归方法来识别关键基因并建立预测模型。该方法还促进了接收器工作特性(ROC)曲线的创建,启用模型精度的评估。此外,该模型通过其他三个数据集进行了外部验证.在正常和PE组织之间总共鉴定了3286个DEGs。基因本体论和京都百科全书的基因和基因组分析揭示了与细胞趋化性相关的功能的丰富,细胞因子结合,和细胞因子-细胞因子受体相互作用。加权基因相关网络分析确定了2个颜色模块与免疫和基质评分密切相关。在将DEGs与免疫和基质相关基因相交后,选择13个基因并添加到最小绝对收缩和选择算子回归。最终,筛选出7个基因,建立区分子痫前期与对照组的风险模型,每个基因的ROC曲线下面积>0.70。构建的风险模型表明,内部和其他三个外部数据集的ROC曲线下面积均大于0.80。鉴定了7基因风险特征以建立潜在的诊断模型,并在PE患者的外部验证组中表现良好。这些发现表明,免疫和基质细胞在PE的发展过程中发挥了重要作用。
    This study aimed to investigate immune score and stromal score-related signatures associated with preeclampsia (PE) and identify key genes for diagnosing PE using bioinformatics analysis. Four microarray datasets, GSE75010, GSE25906, GSE44711, and GSE10588 were obtained from the Gene Expression Omnibus database. GSE75010 was utilized for differential expressed gene (DEGs) analysis. Subsequently, bioinformatic tools such as gene ontology, Kyoto Encyclopedia of Genes and Genomes, weighted gene correlation network analysis, and gene set enrichment analysis were employed to functionally characterize candidate target genes involved in the pathogenesis of PE. The least absolute shrinkage and selection operator regression approach was employed to identify crucial genes and develop a predictive model. This method also facilitated the creation of receiver operating characteristic (ROC) curves, enabling the evaluation of the model\'s precision. Furthermore, the model underwent external validation through the other three datasets. A total of 3286 DEGs were identified between normal and PE tissues. Gene ontology and Kyoto Encyclopedia of Genes and Genomes analyses revealed enrichments in functions related to cell chemotaxis, cytokine binding, and cytokine-cytokine receptor interaction. weighted gene correlation network analysis identified 2 color modules strongly correlated with immune and stromal scores. After intersecting DEGs with immune and stromal-related genes, 13 genes were selected and added to the least absolute shrinkage and selection operator regression. Ultimately, 7 genes were screened out to establish the risk model for discriminating preeclampsia from controls, with each gene having an area under the ROC curve >0.70. The constructed risk model demonstrated that the area under the ROC curves in internal and the other three external datasets were all greater than 0.80. A 7-gene risk signature was identified to build a potential diagnostic model and performed well in the external validation group for PE patients. These findings illustrated that immune and stromal cells played essential roles in PE during its progression.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:生物背景知识在生物数据库记录的手动质量保证(QA)中起着重要作用。一个这样的QA任务是检测基于文献的基因本体注释(GOA)中的不一致性。此手动验证基于对用作证据的文献的全面审查,确保了GO注释的准确性。基因本体论(GO)术语,以及GOA记录中的注释基因。虽然已经开发了用于检测GOA中语义不一致的自动方法,它们在预定的环境中运作,缺乏利用更广泛证据的能力,特别是相关领域的背景知识。本文研究了各种类型的背景知识,这些知识可以改善对GOA中普遍存在的不一致性的检测。此外,本文提出了几种将背景知识集成到自动GOA不一致检测过程中的方法。
    结果:我们使用几种与GOA相关的背景知识扩展了以前开发的GOA不一致数据集,包括GeneRIF声明,证据文本中提到的生物学概念,GO的分级和现有的GO注解特定基因。作为自动GOA不一致检测过程的一部分,我们提出了几种有效的方法来集成背景知识。所提出的方法可以改善对自我一致性和几种最普遍的不一致类型的自动检测。
    这是第一个探索利用背景知识的优势并提出一种实用的方法来将知识纳入自动GOA不一致性检测的研究。我们为这项任务的绩效建立了新的基准。我们的方法可能适用于涉及结合生物学背景知识的各种任务。
    方法:https://github.com/jiyuc/de-inconsistency。
    BACKGROUND: Biological background knowledge plays an important role in the manual quality assurance (QA) of biological database records. One such QA task is the detection of inconsistencies in literature-based Gene Ontology Annotation (GOA). This manual verification ensures the accuracy of the GO annotations based on a comprehensive review of the literature used as evidence, Gene Ontology (GO) terms, and annotated genes in GOA records. While automatic approaches for the detection of semantic inconsistencies in GOA have been developed, they operate within predetermined contexts, lacking the ability to leverage broader evidence, especially relevant domain-specific background knowledge. This paper investigates various types of background knowledge that could improve the detection of prevalent inconsistencies in GOA. In addition, the paper proposes several approaches to integrate background knowledge into the automatic GOA inconsistency detection process.
    RESULTS: We have extended a previously developed GOA inconsistency dataset with several kinds of GOA-related background knowledge, including GeneRIF statements, biological concepts mentioned within evidence texts, GO hierarchy and existing GO annotations of the specific gene. We have proposed several effective approaches to integrate background knowledge as part of the automatic GOA inconsistency detection process. The proposed approaches can improve automatic detection of self-consistency and several of the most prevalent types of inconsistencies.
    This is the first study to explore the advantages of utilizing background knowledge and to propose a practical approach to incorporate knowledge in automatic GOA inconsistency detection. We establish a new benchmark for performance on this task. Our methods may be applicable to various tasks that involve incorporating biological background knowledge.
    METHODS: https://github.com/jiyuc/de-inconsistency.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号