differential expression

差异表达
  • 文章类型: Journal Article
    基因调控网络定义了细胞中DNA产物与其他物质之间的相互作用。增加对这些网络的了解提高了描述引发不同疾病的过程的细节水平,并促进了新治疗靶标的开发。这些网络通常用图表示,它们正确构造的主要来源通常是来自差异表达数据的时间序列。在文献中,从这种数据类型推断网络的方法有所不同。大多数情况下,计算学习技术已经实现,它们最终在特定数据集中显示了一些专业化。出于这个原因,需要创建新的和更强大的策略,以根据先前的结果达成共识,以获得特定的概括能力。本文介绍了GENECI(GENE网络共识推断),一种进化机器学习方法,充当构建集合的组织者,以处理文献中报告的主要推理技术的结果,并优化从中得出的共识网络,根据它们的置信水平和拓扑特征。经过设计,该提案面临着从学术基准(DREAM挑战和IRMA网络)收集的数据集,以量化其准确性。随后,它被应用于黑色素瘤患者的真实世界生物学网络,其结果可以与文献中收集的医学研究进行对比。最后,已经证明,它能够优化多个网络的共识,导致突出的鲁棒性和准确性,在面对多个数据集的推理后,获得一定的泛化能力。源代码托管在GitHub的公共存储库中,MIT许可证为https://github.com/AdrianSeguraOrtiz/GENECI。此外,为了方便其安装和使用,与此实现相关的软件已封装在PyPI:https://pypi.org/project/geneci/上的python包中。
    Gene regulatory networks define the interactions between DNA products and other substances in cells. Increasing knowledge of these networks improves the level of detail with which the processes that trigger different diseases are described and fosters the development of new therapeutic targets. These networks are usually represented by graphs, and the primary sources for their correct construction are usually time series from differential expression data. The inference of networks from this data type has been approached differently in the literature. Mostly, computational learning techniques have been implemented, which have finally shown some specialization in specific datasets. For this reason, the need arises to create new and more robust strategies for reaching a consensus based on previous results to gain a particular capacity for generalization. This paper presents GENECI (GEne NEtwork Consensus Inference), an evolutionary machine learning approach that acts as an organizer for constructing ensembles to process the results of the main inference techniques reported in the literature and to optimize the consensus network derived from them, according to their confidence levels and topological characteristics. After its design, the proposal was confronted with datasets collected from academic benchmarks (DREAM challenges and IRMA network) to quantify its accuracy. Subsequently, it was applied to a real-world biological network of melanoma patients whose results could be contrasted with medical research collected in the literature. Finally, it has been proved that its ability to optimize the consensus of several networks leads to outstanding robustness and accuracy, gaining a certain generalization capacity after facing the inference of multiple datasets. The source code is hosted in a public repository at GitHub under MIT license: https://github.com/AdrianSeguraOrtiz/GENECI. Moreover, to facilitate its installation and use, the software associated with this implementation has been encapsulated in a python package available at PyPI: https://pypi.org/project/geneci/.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    转录组学方法越来越多地用于生殖医学以识别候选的子宫内膜生物标志物。然而,众所周知,子宫内膜在月经周期的分子生物学中的进展是可能影响疾病相关基因发现的主要因素。因此,这项研究的目的是系统回顾目前考虑月经周期效应的实践,并证明其在鉴定潜在生物标志物方面存在偏倚.从符合标准的35项研究中,31.43%的人没有登记月经周期阶段。我们分析了11篇论文(包括12项研究)的月经周期效应基因表达综合:三个评价子宫内膜异位症,两个评估复发性植入失败,一个评估复发性妊娠丢失,一项评估子宫肌瘤和五项对照研究,收集整个月经周期的子宫内膜样本。使用线性模型消除月经周期偏倚后,平均鉴定出44.2%以上的基因。即使研究平衡了在不同子宫内膜阶段或仅在分泌中期收集的样品的比例,也可以观察到这种效果。我们的偏倚校正方法通过检索比每相独立分析更多的候选基因来提高统计能力。由于这种做法,我们发现了544个新的在位子宫内膜异位症候选基因,158个基因为异位卵巢子宫内膜异位症和27个基因为复发性种植失败。总之,我们证明月经周期进展掩盖了分子生物标志物,提供了新的指南来解开它们,并提出了一种新的分类,以区分疾病或/和月经周期进展的生物标志物。
    Transcriptomic approaches are increasingly used in reproductive medicine to identify candidate endometrial biomarkers. However, it is known that endometrial progression in the molecular biology of the menstrual cycle is a main factor that could affect the discovery of disorder-related genes. Therefore, the aim of this study was to systematically review current practices for considering the menstrual cycle effect and to demonstrate its bias in the identification of potential biomarkers. From the 35 studies meeting the criteria, 31.43% did not register the menstrual cycle phase. We analysed the menstrual cycle effect in 11 papers (including 12 studies) from Gene Expression Omnibus: three evaluating endometriosis, two evaluating recurrent implantation failure, one evaluating recurrent pregnancy loss, one evaluating uterine fibroids and five control studies, which collected endometrial samples throughout menstrual cycle. An average of 44.2% more genes were identified after removing menstrual cycle bias using linear models. This effect was observed even if studies were balanced in the proportion of samples collected at different endometrial stages or only in the mid-secretory phase. Our bias correction method increased the statistical power by retrieving more candidate genes than per-phase independent analyses. Thanks to this practice, we discovered 544 novel candidate genes for eutopic endometriosis, 158 genes for ectopic ovarian endometriosis and 27 genes for recurrent implantation failure. In conclusion, we demonstrate that menstrual cycle progression masks molecular biomarkers, provides new guidelines to unmask them and proposes a new classification that distinguishes between biomarkers of disorder or/and menstrual cycle progression.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    The development of genome-wide gene expression profiling technologies over the past two decades has produced great opportunity for researchers to explore the transcriptome and to better understand biological systems and their perturbation. In this chapter we provide an overview of microarray and massively parallel sequencing technologies and their application to gene expression analysis. We discuss factors that impact expression data generation and analysis that which should be considered in the application of these technology platforms. We further present the results of a simple illustration study to highlight performance similarities and differences in expression profiling of protein-coding mRNAs with each platform. Based on technical and analytical differences between the two platforms, reports in the literature comparing arrays and RNA-Seq for gene expression, and our own example study and experience, we provide recommendations for platform selection for gene expression studies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号