genotype to phenotype map

  • 文章类型: Preprint
    基因表达反应的进化是适应可变环境的关键组成部分。预测DNA序列如何影响表达是具有挑战性的,因为基因型到表型图谱对于顺式调控元件没有很好的解决。转录因子结合,监管互动,和表观遗传特征,更不用说这些因素对环境的反应了。我们测试了灵活的机器学习模型是否可以学习一些潜在的顺式调节基因型到表型图谱。我们在5个不同的拟南芥种质中使用冷响应转录组谱测试了这种方法。我们首先测试了顺式调节在环境响应中起作用的证据,发现14个和15个基序在冷反应差异调节基因(DEGs)的上游和下游区域显着富集。我们接下来应用卷积神经网络(CNN),它学习DNA序列中的从头顺式调控基序,以预测对环境的表达反应。我们发现CNN以中等精度预测差异表达,有证据表明,生物调控的复杂性和巨大的潜在调控代码阻碍了预测。总的来说,可以根据顺式调控序列的变化来预测特定环境之间的DEG,尽管需要纳入更多信息,并且可能需要更好的模型。
    The evolution of gene expression responses are a critical component of adaptation to variable environments. Predicting how DNA sequence influences expression is challenging because the genotype to phenotype map is not well resolved for cis regulatory elements, transcription factor binding, regulatory interactions, and epigenetic features, not to mention how these factors respond to environment. We tested if flexible machine learning models could learn some of the underlying cis-regulatory genotype to phenotype map. We tested this approach using cold-responsive transcriptome profiles in 5 diverse Arabidopsis thaliana accessions. We first tested for evidence that cis regulation plays a role in environmental response, finding 14 and 15 motifs that were significantly enriched within the up- and down-stream regions of cold-responsive differentially regulated genes (DEGs). We next applied convolutional neural networks (CNNs), which learn de novo cis-regulatory motifs in DNA sequences to predict expression response to environment. We found that CNNs predicted differential expression with moderate accuracy, with evidence that predictions were hindered by biological complexity of regulation and the large potential regulatory code. Overall, DEGs between specific environments can be predicted based on variation in cis-regulatory sequences, although more information needs to be incorporated and better models may be required.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    The emergence of infectious agents with pandemic potential present scientific challenges from detection to data interpretation to understanding determinants of risk and forecasts. Mathematical models could play an essential role in how we prepare for future emergent pathogens. Here, we describe core directions for expansion of the existing tools and knowledge base, including: using mathematical models to identify critical directions and paths for strengthening data collection to detect and respond to outbreaks of novel pathogens; expanding basic theory to identify infectious agents and contexts that present the greatest risks, over both the short and longer term; by strengthening estimation tools that make the most use of the likely range and uncertainties in existing data; and by ensuring modelling applications are carefully communicated and developed within diverse and equitable collaborations for increased public health benefit.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在这篇综述中,我们总结了我们最近的努力,试图通过使用神经网络来表征从癌细胞基因型和环境到其表型的映射的不同方面来理解异质性在癌症进展中的作用。我们的核心前提是,癌症是一个不断发展的系统,需要突变和选择,这些过程发生的主要渠道是癌细胞,其行为在多个生物学尺度上受到调节。选择压力主要由肿瘤生长的微环境驱动,这直接作用于细胞表型。反过来,表型由基因型调节的细胞内途径驱动。整合所有这些过程是一项艰巨的任务,需要弥合许多生物学尺度(即基因型,通路,表型和环境),我们将在这篇综述中只触及表面。我们将专注于使用神经网络作为连接这些不同生物尺度的手段的模型,因为它们使我们能够轻松地创建用于选择的异质性,并且重要的是这种异质性可以在不同的生物学尺度上实现。更具体地说,我们考虑了三个不同的神经网络,它们桥接了这些尺度的不同方面以及与微环境的对话,(I)微环境对进化动力学的影响,(ii)在药物诱导的扰动下从基因型到表型的作图,以及(iii)在不同微环境条件下正常细胞和癌细胞中的途径活性。
    In this review we summarise our recent efforts in trying to understand the role of heterogeneity in cancer progression by using neural networks to characterise different aspects of the mapping from a cancer cells genotype and environment to its phenotype. Our central premise is that cancer is an evolving system subject to mutation and selection, and the primary conduit for these processes to occur is the cancer cell whose behaviour is regulated on multiple biological scales. The selection pressure is mainly driven by the microenvironment that the tumour is growing in and this acts directly upon the cell phenotype. In turn, the phenotype is driven by the intracellular pathways that are regulated by the genotype. Integrating all of these processes is a massive undertaking and requires bridging many biological scales (i.e. genotype, pathway, phenotype and environment) that we will only scratch the surface of in this review. We will focus on models that use neural networks as a means of connecting these different biological scales, since they allow us to easily create heterogeneity for selection to act upon and importantly this heterogeneity can be implemented at different biological scales. More specifically, we consider three different neural networks that bridge different aspects of these scales and the dialogue with the micro-environment, (i) the impact of the micro-environment on evolutionary dynamics, (ii) the mapping from genotype to phenotype under drug-induced perturbations and (iii) pathway activity in both normal and cancer cells under different micro-environmental conditions.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Proteins change over the course of evolutionary time. New protein-coding genes and gene families emerge and diversify, ultimately affecting an organism\'s phenotype and interactions with its environment. Here we survey the range of structural protein change observed in plants and review the role these changes have had in the evolution of plant form and function. Verified examples tying evolutionary change in protein structure to phenotypic change remain scarce. We will review the existing examples, as well as draw from investigations into domestication, and quantitative trait locus (QTL) cloning studies searching for the molecular underpinnings of natural variation. The evolutionary significance of many cloned QTL has not been assessed, but all the examples identified so far have begun to reveal the extent of protein structural diversity tolerated in natural systems. This molecular (and phenotypic) diversity could come to represent part of natural selection\'s source material in the adaptive evolution of novel traits. Protein structure and function can change in many distinct ways, but the changes we identified in studies of natural diversity and protein evolution were predicted to fall primarily into one of six categories: altered active and binding sites; altered protein-protein interactions; altered domain content; altered activity as an activator or repressor; altered protein stability; and hypomorphic and hypermorphic alleles. There was also variability in the evolutionary scale at which particular changes were observed. Some changes were detected at both micro- and macroevolutionary timescales, while others were observed primarily at deep or shallow phylogenetic levels. This variation might be used to determine the trajectory of future investigations in structural molecular evolution.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号