genotype to phenotype

  • 文章类型: Journal Article
    产量预测是基因组选择(GS)辅助作物育种的主要目标。由于产量是一个复杂的数量性状,从基因型数据进行预测是具有挑战性的。迁移学习可以通过利用来自不同,但相关,源域,被认为是一种通过整合多性状数据来提高产量预测的巨大潜在方法。然而,由于缺乏有效的实施框架,该方法之前尚未应用于基因型-表型预测.因此,我们开发了TrG2P,基于迁移学习的框架。TrG2P首先采用卷积神经网络(CNN),使用非产量性状表型和基因型数据来训练模型,从而获得预训练的模型。随后,来自这些预训练模型的卷积层参数被转移到产量预测任务,完全连接的层被重新训练,从而获得微调模型。最后,将微调模型的卷积层和第一个全连接层融合,最后一个完全连接的层被训练以增强预测性能。我们将TrG2P应用于来自玉米(Zeamays)的五组基因型和表型数据,水稻(水稻),和小麦(小麦),并将模型精度与其他七个流行的GS工具进行了比较:rrBLUP,随机森林,支持向量回归,LightGBM,CNN,深度,DNNGP。TrG2P将产量预测精度提高了39.9%,6.8%,在大米中占1.8%,玉米,小麦,分别,与性能最佳的比较模型生成的预测进行比较。因此,我们的工作表明,迁移学习是通过整合非产量性状数据中的信息来改善产量预测的有效策略。我们将增强的预测准确性归因于可从与产量相关的性状中获得的有价值的信息以及训练数据集的增强。TrG2P的Python实现可在https://github.com/lijinlong1991/TrG2P获得。基于Web的工具可在http://trg2p获得。ebreed.cn:81.
    Yield prediction is the primary goal of genomic selection (GS)-assisted crop breeding. Because yield is a complex quantitative trait, making predictions from genotypic data is challenging. Transfer learning can produce an effective model for a target task by leveraging knowledge from a different, but related, source domain and is considered a great potential method for improving yield prediction by integrating multi-trait data. However, it has not previously been applied to genotype-to-phenotype prediction owing to the lack of an efficient implementation framework. We therefore developed TrG2P, a transfer-learning-based framework. TrG2P first employs convolutional neural networks (CNN) to train models using non-yield-trait phenotypic and genotypic data, thus obtaining pre-trained models. Subsequently, the convolutional layer parameters from these pre-trained models are transferred to the yield prediction task, and the fully connected layers are retrained, thus obtaining fine-tuned models. Finally, the convolutional layer and the first fully connected layer of the fine-tuned models are fused, and the last fully connected layer is trained to enhance prediction performance. We applied TrG2P to five sets of genotypic and phenotypic data from maize (Zea mays), rice (Oryza sativa), and wheat (Triticum aestivum) and compared its model precision to that of seven other popular GS tools: ridge regression best linear unbiased prediction (rrBLUP), random forest, support vector regression, light gradient boosting machine (LightGBM), CNN, DeepGS, and deep neural network for genomic prediction (DNNGP). TrG2P improved the accuracy of yield prediction by 39.9%, 6.8%, and 1.8% in rice, maize, and wheat, respectively, compared with predictions generated by the best-performing comparison model. Our work therefore demonstrates that transfer learning is an effective strategy for improving yield prediction by integrating information from non-yield-trait data. We attribute its enhanced prediction accuracy to the valuable information available from traits associated with yield and to training dataset augmentation. The Python implementation of TrG2P is available at https://github.com/lijinlong1991/TrG2P. The web-based tool is available at http://trg2p.ebreed.cn:81.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    生物学的中心目标是了解遗传变异如何产生表型变异,已被描述为基因型到表型(G到P)图。植物形态由内在发育和外在环境输入不断塑造,因此,植物表型是高度多变量的,需要全面的方法来完全量化。然而,植物表型鉴定工作中的一个常见假设是,一些预先选择的测量可以充分描述相关的表型空间。我们对根系结构的遗传基础了解不足至少部分是这种不一致的结果。根系是复杂的3D结构,通常以相对简单的单变量特征测量的2D表示进行研究。在之前的工作中,我们证明了持续的同源性,一种拓扑数据分析方法,不预先假定数据的显著特征,可以扩展表型性状空间,并从常用的2D根表型平台识别新的G到P关系。在这里,我们将工作扩展到来自作图种群的玉米幼苗的整个3D根系结构,该作图种群旨在了解玉米-氮关系的遗传基础。使用84个单变量性状的面板,为3D分支开发的持续同源方法,和集体特征空间的多元向量,我们发现每种方法都能捕获有关根系变异的不同信息,大多数非重叠QTL证明了这一点,因此,根表型性状空间不容易耗尽。这项工作提供了一种数据驱动的方法来评估3D根结构,并强调了非规范表型对于更准确地表示G到P图的重要性。
    A central goal of biology is to understand how genetic variation produces phenotypic variation, which has been described as a genotype to phenotype (G to P) map. The plant form is continuously shaped by intrinsic developmental and extrinsic environmental inputs, and therefore plant phenomes are highly multivariate and require comprehensive approaches to fully quantify. Yet a common assumption in plant phenotyping efforts is that a few pre-selected measurements can adequately describe the relevant phenome space. Our poor understanding of the genetic basis of root system architecture is at least partially a result of this incongruence. Root systems are complex 3D structures that are most often studied as 2D representations measured with relatively simple univariate traits. In prior work, we showed that persistent homology, a topological data analysis method that does not pre-suppose the salient features of the data, could expand the phenotypic trait space and identify new G to P relations from a commonly used 2D root phenotyping platform. Here we extend the work to entire 3D root system architectures of maize seedlings from a mapping population that was designed to understand the genetic basis of maize-nitrogen relations. Using a panel of 84 univariate traits, persistent homology methods developed for 3D branching, and multivariate vectors of the collective trait space, we found that each method captures distinct information about root system variation as evidenced by the majority of non-overlapping QTL, and hence that root phenotypic trait space is not easily exhausted. The work offers a data-driven method for assessing 3D root structure and highlights the importance of non-canonical phenotypes for more accurate representations of the G to P map.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Preprint
    背景:准确的模型对于从高通量基因组数据中估计表型至关重要。虽然遗传和表型数据是敏感的,安全模型对于保护私人信息至关重要。因此,构建准确、安全的模型对表型的安全推断具有重要意义。方法:我们提出了一种具有加密线性模型的同态加密基因型数据的安全推理协议。首先,用Xgboost或Adaboost按特征重要性缩放基因型数据,然后训练线性模型以明文预测表型。其次,使用CKKS方案对模型参数和测试数据进行加密,以进行安全推断。第三,预测CKKS同态加密计算下的表型。最后,客户端对加密的预测进行解密,以计算1-NRMSE/AUC,用于模型评估。结果:使用具有20390个变体的3000个样品的5个表型来验证安全推断协议的性能。该方案在测试数据中实现了3种连续表型的0.9548、0.9639、0.9673(1-NRMSE)和2种类别表型的0.9943、0.99290(AUC)。此外,该方案在100次随机抽样中显示出鲁棒性。此外,该协议在198个样本的加密测试集中达到0.9725(平均准确度),它只需要4.32s的整体推理。这些有助于该协议在iDASH-2022track2挑战中排名第一。结论:我们提出了一种准确且安全的协议来预测基因型的表型,并且需要几秒钟才能获得所有表型的数百个预测。
    UNASSIGNED: Accurate models are crucial to estimate the phenotypes from high throughput genomic data. While the genetic and phenotypic data are sensitive, secure models are essential to protect the private information. Therefore, construct an accurate and secure model is significant in secure inference of phenotypes.
    UNASSIGNED: We propose a secure inference protocol on homomorphically encrypted genotype data with encrypted linear models. Firstly, scale the genotype data by feature importance with Xgboost or Adaboost then train linear models to predict the phenotypes in plaintext. Secondly, encrypt the model parameters and test data with CKKS scheme for secure inference. Thirdly, predict the phenotypes under CKKS homomorphically encryption computation. Finally, decrypt the encrypted predictions by client to compute the 1-NRMSE/AUC for model evaluation.
    UNASSIGNED: 5 phenotypes of 3000 samples with 20390 variants are used to validate the performance of the secure inference protocol. The protocol achieves 0.9548, 0.9639, 0.9673 (1-NRMSE) for 3 continuous phenotypes and 0.9943, 0.99290 (AUC) for 2 category phenotypes in test data. Moreover, the protocol shows robust in 100 times of random sampling. Furthermore, the protocol achieves 0.9725 (the average accuracy) in an encrypted test set with 198 samples, and it only takes 4.32s for the overall inference. These help the protocol rank top one in the iDASH-2022 track2 challenge.
    UNASSIGNED: We propose an accurate and secure protocol to predict the phenotype from genotype and it takes seconds to obtain hundreds of predictions for all phenotypes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    基因组之间密码子频率的差异,基因,或者沿着基因的位置,调节转录和翻译效率,导致表型和功能差异。这里,我们提出了一个多尺度分析的影响,同义密码子重新编码在异源基因表达在人类细胞中,量化不同分子和细胞水平的密码子使用偏倚的表型后果,强调翻译的延伸。产生了六个同义版本的抗生素抗性基因,融合到一个荧光报告,并且在HEK293细胞中独立表达。通过定量转录组和蛋白质组评估分析了多尺度表型,作为基因表达的代理;细胞荧光,作为单细胞水平表达的代表;和在不存在或存在抗生素的情况下的实时细胞增殖,作为细胞适应性的代理。我们表明,密码子使用偏差的差异强烈影响分子和细胞表型:(i)它们导致mRNA水平和蛋白质水平的巨大差异,导致翻译效率差异超过15倍;(ii)它们引入未预测的剪接事件;(iii)它们导致可重复的表型异质性;和(iv)它们导致抗生素抗性的益处和异源表达的负担之间的权衡。在培养的人类细胞中,密码子使用偏倚通过改变mRNA的可用性和翻译的适用性来调节基因表达,导致蛋白质水平的差异,并最终引发功能表型的变化。
    Differences in codon frequency between genomes, genes, or positions along a gene, modulate transcription and translation efficiency, leading to phenotypic and functional differences. Here, we present a multiscale analysis of the effects of synonymous codon recoding during heterologous gene expression in human cells, quantifying the phenotypic consequences of codon usage bias at different molecular and cellular levels, with an emphasis on translation elongation. Six synonymous versions of an antibiotic resistance gene were generated, fused to a fluorescent reporter, and independently expressed in HEK293 cells. Multiscale phenotype was analyzed by means of quantitative transcriptome and proteome assessment, as proxies for gene expression; cellular fluorescence, as a proxy for single-cell level expression; and real-time cell proliferation in absence or presence of antibiotic, as a proxy for the cell fitness. We show that differences in codon usage bias strongly impact the molecular and cellular phenotype: (i) they result in large differences in mRNA levels and protein levels, leading to differences of over 15 times in translation efficiency; (ii) they introduce unpredicted splicing events; (iii) they lead to reproducible phenotypic heterogeneity; and (iv) they lead to a trade-off between the benefit of antibiotic resistance and the burden of heterologous expression. In human cells in culture, codon usage bias modulates gene expression by modifying mRNA availability and suitability for translation, leading to differences in protein levels and eventually eliciting functional phenotypic changes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    家族性心肌病是心力衰竭和心源性猝死的前兆。在过去的几十年里,研究人员已经发现了许多基因突变主要在肌节蛋白和细胞骨架蛋白引起两种不同的疾病表型:肥大性(HCM)和扩张型(DCM)心肌病.然而,将基因型与表型联系起来的分子机制仍不清楚。这里,我们采用系统方法,整合临床前研究的实验结果(例如,鼠数据)进入一个内聚的信号网络,以仔细检查基因型到表型机制。我们利用基于逻辑的微分方程方法开发了HCM/DCM信令网络模型,并评估了模型性能,以预测来自四个上下文的实验数据(HCM,DCM,压力过载,和体积过载)。该模型的总体预测精度为83.8%,在HCM背景下(90%)比DCM(75%)具有更高的准确度。全局敏感性分析确定关键信号反应,以钙介导的肌丝力发育和钙-钙调蛋白激酶信号传导排名最高。结构修订分析表明,潜在的缺失相互作用主要控制钙调节蛋白,提高模型预测精度。联合药物治疗分析表明,钙等信号成分的下调,肌动蛋白及其相关蛋白质,生长因子受体,ERK1/2和PI3K-AKT可以抑制HCM中的肌细胞生长。在患者特异性iPSC衍生心肌细胞(MLP-W4R;MYH7-R723CiPSC-CM)的实验中,ERK1/2和PI3K-AKT的联合抑制拯救了HCM表型,正如模型所预测的那样。在DCM中,PI3K-AKT-NFAT下调与Ras/ERK1/2或肌动蛋白或Gq蛋白上调相结合可改善心肌细胞形态。模型结果表明,通过提高钙敏感性增加主动力的HCM突变可以通过平行生长因子增加ERK活性并减少偏心率。Gq介导的,和Titin途径。此外,该模型模拟了现有药物对HCM和DCM患者心脏生长的影响.该HCM/DCM信号传导模型证明了在家族性心肌病中研究基因型到表型机制的实用性。
    Familial cardiomyopathy is a precursor of heart failure and sudden cardiac death. Over the past several decades, researchers have discovered numerous gene mutations primarily in sarcomeric and cytoskeletal proteins causing two different disease phenotypes: hypertrophic (HCM) and dilated (DCM) cardiomyopathies. However, molecular mechanisms linking genotype to phenotype remain unclear. Here, we employ a systems approach by integrating experimental findings from preclinical studies (e.g., murine data) into a cohesive signaling network to scrutinize genotype to phenotype mechanisms. We developed an HCM/DCM signaling network model utilizing a logic-based differential equations approach and evaluated model performance in predicting experimental data from four contexts (HCM, DCM, pressure overload, and volume overload). The model has an overall prediction accuracy of 83.8%, with higher accuracy in the HCM context (90%) than DCM (75%). Global sensitivity analysis identifies key signaling reactions, with calcium-mediated myofilament force development and calcium-calmodulin kinase signaling ranking the highest. A structural revision analysis indicates potential missing interactions that primarily control calcium regulatory proteins, increasing model prediction accuracy. Combination pharmacotherapy analysis suggests that downregulation of signaling components such as calcium, titin and its associated proteins, growth factor receptors, ERK1/2, and PI3K-AKT could inhibit myocyte growth in HCM. In experiments with patient-specific iPSC-derived cardiomyocytes (MLP-W4R;MYH7-R723C iPSC-CMs), combined inhibition of ERK1/2 and PI3K-AKT rescued the HCM phenotype, as predicted by the model. In DCM, PI3K-AKT-NFAT downregulation combined with upregulation of Ras/ERK1/2 or titin or Gq protein could ameliorate cardiomyocyte morphology. The model results suggest that HCM mutations that increase active force through elevated calcium sensitivity could increase ERK activity and decrease eccentricity through parallel growth factors, Gq-mediated, and titin pathways. Moreover, the model simulated the influence of existing medications on cardiac growth in HCM and DCM contexts. This HCM/DCM signaling model demonstrates utility in investigating genotype to phenotype mechanisms in familial cardiomyopathy.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Editorial
    暂无摘要。
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    功能结构植物模型(FSPM)已经发展了20多年,它们的未来发展。在某种程度上,取决于在作物科学中潜在应用的价值。迄今为止,通过确定适应不利环境的新品种的有价值的性状来稳定作物生产是作物科学的主题。因此,这项研究将研究FSPM如何能够应对作物科学中的可持续作物生产的新挑战。开发的FSPM模拟器官发生,形态发生,和各种环境下的生理活动,并且可以缩小到组织,细胞,和分子水平或升级到整个植物和生态水平。在具有独立和交互式模块的建模框架中,先进的算法提供各种尺度的形态生理学细节。FSPM被证明能够:(i)有效地提供作物理想型,以优化资源分配和使用,以提高生产率和降低疾病风险,(ii)通过将分子基础与植物表型联系起来,指导分子设计育种,并通过额外的建筑尺寸来丰富作物模型,以协助育种,和(iii)在包含三维(3D)建筑性状的分子育种中与植物表型相互作用。这项研究表明,FSPM在加快特定环境的精确育种方面具有巨大的前景,因为它具有指导和整合理想型的能力,表型,分子设计,并将分子基础与目标表型联系起来。因此,FSPM在作物科学中的巨大应用前景广阔,反过来,加速他们的进化,反之亦然。
    Functional-structural plant models (FSPMs) have been evolving for over 2 decades and their future development, to some extent, depends on the value of potential applications in crop science. To date, stabilizing crop production by identifying valuable traits for novel cultivars adapted to adverse environments is topical in crop science. Thus, this study will examine how FSPMs are able to address new challenges in crop science for sustainable crop production. FSPMs developed to simulate organogenesis, morphogenesis, and physiological activities under various environments and are amenable to downscale to the tissue, cellular, and molecular level or upscale to the whole plant and ecological level. In a modeling framework with independent and interactive modules, advanced algorithms provide morphophysiological details at various scales. FSPMs are shown to be able to: (i) provide crop ideotypes efficiently for optimizing the resource distribution and use for greater productivity and less disease risk, (ii) guide molecular design breeding via linking molecular basis to plant phenotypes as well as enrich crop models with an additional architectural dimension to assist breeding, and (iii) interact with plant phenotyping for molecular breeding in embracing three-dimensional (3D) architectural traits. This study illustrates that FSPMs have great prospects in speeding up precision breeding for specific environments due to the capacity for guiding and integrating ideotypes, phenotyping, molecular design, and linking molecular basis to target phenotypes. Consequently, the promising great applications of FSPMs in crop science will, in turn, accelerate their evolution and vice versa.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    抗菌素耐药性(AMR)是全球范围内对人类健康和粮食安全的最严重威胁之一。在畜牧业生产中使用抗微生物剂可导致AMR的出现,可以通过人畜共患疾病的传播对人类产生直接影响。猪具有特殊的风险,因为它们是人畜共患疾病的来源,并且比大多数其他牲畜接受更多的抗生素。在这里,我们使用大规模基因组方法来表征猪链球菌中的AMR,在大多数猪身上发现的共生菌,但这也会导致猪和人类的严重疾病。
    我们获得了16种抗生素的最低抑制浓度(MIC)的重复测量,在678个分离株的小组中,来自世界主要产猪地区。对于几种药物,没有自然分离为“抗性”和“易感”,强调需要将MIC视为数量性状。我们发现国家之间中等收入国家的差异,与他们的抗菌药物使用模式一致。即使不用于治疗猪链球菌的药物,AMR水平也很高,有许多多重耐药的分离株。在与人畜共患传播相关的地区的猪和人类中发现了相似的抗性水平。接下来,我们使用每个分离株的全基因组序列来鉴定43个候选抗性决定子,其中22是S.suis的小说。这些决定因素的存在解释了MIC的大部分变化。但也有有趣的并发症,包括上位性互动,已知的抗性等位基因在某些遗传背景中没有影响。β-内酰胺抗性涉及许多影响小的核心基因组变异,以特征顺序出现。
    我们提供了一个大型数据集,可以分析猪链球菌AMR的多个影响因素。我们观察到的猪链球菌中AMR的高水平反映在抗生素的使用模式上,但我们的结果证实了基因组数据有助于对抗AMR的潜力。
    Antimicrobial resistance (AMR) is among the gravest threats to human health and food security worldwide. The use of antimicrobials in livestock production can lead to emergence of AMR, which can have direct effects on humans through spread of zoonotic disease. Pigs pose a particular risk as they are a source of zoonotic diseases and receive more antimicrobials than most other livestock. Here we use a large-scale genomic approach to characterise AMR in Streptococcus suis, a commensal found in most pigs, but which can also cause serious disease in both pigs and humans.
    We obtained replicated measures of Minimum Inhibitory Concentration (MIC) for 16 antibiotics, across a panel of 678 isolates, from the major pig-producing regions of the world. For several drugs, there was no natural separation into \'resistant\' and \'susceptible\', highlighting the need to treat MIC as a quantitative trait. We found differences in MICs between countries, consistent with their patterns of antimicrobial usage. AMR levels were high even for drugs not used to treat S. suis, with many multidrug-resistant isolates. Similar levels of resistance were found in pigs and humans from regions associated with zoonotic transmission. We next used whole genome sequences for each isolate to identify 43 candidate resistance determinants, 22 of which were novel in S. suis. The presence of these determinants explained most of the variation in MIC. But there were also interesting complications, including epistatic interactions, where known resistance alleles had no effect in some genetic backgrounds. Beta-lactam resistance involved many core genome variants of small effect, appearing in a characteristic order.
    We present a large dataset allowing the analysis of the multiple contributing factors to AMR in S. suis. The high levels of AMR in S. suis that we observe are reflected by antibiotic usage patterns but our results confirm the potential for genomic data to aid in the fight against AMR.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    个体的表型不仅会受到个体自身基因型的影响,被称为直接遗传效应(DGE),还有相互作用伴侣的基因型,间接遗传效应(IGE)。已经使用多基因模型在多个物种中检测到IGE,包括实验室老鼠和人类。然而,潜在的机制在很大程度上仍然未知。IGE(igeGWAS)的全基因组关联研究可以指向IGE基因,但尚未应用于由“同行”引起并影响生物医学表型的非家族性IGE。此外,igeGWAS将在多大程度上识别未被dgeGWAS识别的基因座仍然是一个悬而未决的问题.最后,igeGWAS的发现尚未通过实验操作得到证实。
    我们利用170个行为数据集,生理,和在1812只遗传异质实验室小鼠中测量的形态表型,以研究同性之间产生的IGE,成人,不相关的小鼠被关在同一个笼子里.在这种情况下,我们开发并应用了igeGWAS方法,并鉴定了17种表型(FDR<10%)的24个重要IGE基因座。我们观察到相同表型的IGE基因座和DGE基因座之间没有重叠,这与使用多基因模型估计的相同表型的DGE和IGE之间的中度遗传相关性一致。最后,我们将七个重要的IGE基因座精细映射到单个基因,并在基因敲除模型的实验中找到了支持性证据,证明Epha4在压力应对策略和伤口愈合方面引起了IGE。
    我们的结果证明了igeGWAS识别IGE基因并揭示同伴影响机制的潜力。
    The phenotype of an individual can be affected not only by the individual\'s own genotypes, known as direct genetic effects (DGE), but also by genotypes of interacting partners, indirect genetic effects (IGE). IGE have been detected using polygenic models in multiple species, including laboratory mice and humans. However, the underlying mechanisms remain largely unknown. Genome-wide association studies of IGE (igeGWAS) can point to IGE genes, but have not yet been applied to non-familial IGE arising from \"peers\" and affecting biomedical phenotypes. In addition, the extent to which igeGWAS will identify loci not identified by dgeGWAS remains an open question. Finally, findings from igeGWAS have not been confirmed by experimental manipulation.
    We leverage a dataset of 170 behavioral, physiological, and morphological phenotypes measured in 1812 genetically heterogeneous laboratory mice to study IGE arising between same-sex, adult, unrelated mice housed in the same cage. We develop and apply methods for igeGWAS in this context and identify 24 significant IGE loci for 17 phenotypes (FDR < 10%). We observe no overlap between IGE loci and DGE loci for the same phenotype, which is consistent with the moderate genetic correlations between DGE and IGE for the same phenotype estimated using polygenic models. Finally, we fine-map seven significant IGE loci to individual genes and find supportive evidence in an experiment with a knockout model that Epha4 gives rise to IGE on stress-coping strategy and wound healing.
    Our results demonstrate the potential for igeGWAS to identify IGE genes and shed light into the mechanisms of peer influence.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    鸟类丰富多彩的表型长期以来为进化生物学家提供了丰富的原始材料。鸟类羽毛,喙,皮肤,和鸡蛋-表现出一系列惊人的神秘和显眼的形式-启发早期的自适应着色工作。最近,鸟类的颜色推动了生理上的发现,发展,以及越来越多的导致表型变异的遗传机制。可以相对容易地量化鸟类颜色特征,这使鸟类成为发现表型和基因型之间联系的有吸引力的系统。因此,鸟类着色遗传学领域正在蓬勃发展。在这次审查中,我们强调了与鸟类颜色的遗传基础相关的最新进展和新出现的问题。我们首先描述与2类色素相关的突破:产生红色的类胡萝卜素,黄色,在大多数鸟类和鹦鹉中产生类似颜色的鹦鹉和橙色。然后我们讨论结构颜色,它是通过光与纳米级材料的相互作用产生的,极大地扩展了羽毛调色板。结构性颜色遗传学仍未得到充分研究,但这种范式正在发生变化。接下来,我们探索色素和结构机制之间相互作用产生的颜色如何受到共表达或共调节的基因的控制。我们还确定了研究介导羽毛内部微图案化以及裸露部分和卵着色的基因的机会。我们通过聚焦2个研究领域得出的结论是,色觉和色彩生产之间的机械联系,和物种形成——它们被遗传洞察力所激励,随着新的基因组方法应用于非模型物种,这种趋势可能会继续下去。
    The colorful phenotypes of birds have long provided rich source material for evolutionary biologists. Avian plumage, beaks, skin, and eggs-which exhibit a stunning range of cryptic and conspicuous forms-inspired early work on adaptive coloration. More recently, avian color has fueled discoveries on the physiological, developmental, and-increasingly-genetic mechanisms responsible for phenotypic variation. The relative ease with which avian color traits can be quantified has made birds an attractive system for uncovering links between phenotype and genotype. Accordingly, the field of avian coloration genetics is burgeoning. In this review, we highlight recent advances and emerging questions associated with the genetic underpinnings of bird color. We start by describing breakthroughs related to 2 pigment classes: carotenoids that produce red, yellow, and orange in most birds and psittacofulvins that produce similar colors in parrots. We then discuss structural colors, which are produced by the interaction of light with nanoscale materials and greatly extend the plumage palette. Structural color genetics remain understudied-but this paradigm is changing. We next explore how colors that arise from interactions among pigmentary and structural mechanisms may be controlled by genes that are co-expressed or co-regulated. We also identify opportunities to investigate genes mediating within-feather micropatterning and the coloration of bare parts and eggs. We conclude by spotlighting 2 research areas-mechanistic links between color vision and color production, and speciation-that have been invigorated by genetic insights, a trend likely to continue as new genomic approaches are applied to non-model species.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号