Codon

密码子
  • 文章类型: Journal Article
    密码子最优性的调节是细胞和组织特异性蛋白表达控制的一个日益受到重视的层面。这里,我们使用密码子修饰的报告基因显示果蝇神经干细胞分化为神经元能够从稀有密码子富集的基因中表达蛋白质.从候选人屏幕上看,我们确定了细胞质聚腺苷酸化元件结合(CPEB)蛋白Orb2是神经元中稀有密码子依赖性mRNA稳定性的正调节因子.使用RNA测序,我们发现,具有丰富Orb2结合位点的大脑中Orb2上调的mRNA具有罕见密码子偏好。从这些Orb2调控的mRNA中,我们证明稀有密码子富集对于代谢型谷氨酸受体(mGluR)的mRNA稳定性和社会行为功能很重要。我们的发现揭示了神经干细胞分化改变遗传密码调控以实现关键mRNA稳定性和蛋白质表达的分子机制。
    Regulation of codon optimality is an increasingly appreciated layer of cell- and tissue-specific protein expression control. Here, we use codon-modified reporters to show that differentiation of Drosophila neural stem cells into neurons enables protein expression from rare-codon-enriched genes. From a candidate screen, we identify the cytoplasmic polyadenylation element binding (CPEB) protein Orb2 as a positive regulator of rare-codon-dependent mRNA stability in neurons. Using RNA sequencing, we reveal that Orb2-upregulated mRNAs in the brain with abundant Orb2 binding sites have a rare-codon bias. From these Orb2-regulated mRNAs, we demonstrate that rare-codon enrichment is important for mRNA stability and social behavior function of the metabotropic glutamate receptor (mGluR). Our findings reveal a molecular mechanism by which neural stem cell differentiation shifts genetic code regulation to enable critical mRNA stability and protein expression.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    中心教条将核糖体视为一种分子机器,当它将每个氨基酸添加到其生长的肽链时,一次读取一个mRNA密码子。然而,本研究和先前的研究表明,核糖体实际上感知成对的相邻密码子,因为它们沿着mRNA采取三个核苷酸的步骤。我们检查了GNN密码子,我们发现,在真核生物蛋白质编码开放阅读框(ORF)中,尤其是在NNU密码子之后。酵母中的核糖体谱分析实验表明,当NNU紧随GNN密码子(3')时,在其氨基酰基(A)位点具有NNU的核糖体密度特别高,表明NNU密码子从核糖体A到肽基(P)位点的mRNA穿线较慢。此外,如果评估仅限于最近才到达下一个密码子的核糖体,通过检查21个核苷酸的核糖体足迹(21-ntRFP),当跟随GNN时,观察到多个密码子类别的密度升高。相邻的5'-NNNGNN密码子对的这种惊人的翻译减慢可能是介导的,在某种程度上,通过核糖体的汽车表面,在核糖体易位过程中充当A位点tRNA反密码子的延伸,并通过氢键和pi堆叠与GNN密码子相互作用。5'-NNNGNN密码子邻接的功能后果预计会影响蛋白质编码序列的进化。
    The central dogma treats the ribosome as a molecular machine that reads one mRNA codon at a time as it adds each amino acid to its growing peptide chain. However, this and previous studies suggest that ribosomes actually perceive pairs of adjacent codons as they take three-nucleotide steps along the mRNA. We examined GNN codons, which we find are surprisingly overrepresented in eukaryote protein-coding open reading frames (ORFs), especially immediately after NNU codons. Ribosome profiling experiments in yeast revealed that ribosomes with NNU at their aminoacyl (A) site have particularly elevated densities when NNU is immediately followed (3\') by a GNN codon, indicating slower mRNA threading of the NNU codon from the ribosome\'s A to peptidyl (P) sites. Moreover, if the assessment was limited to ribosomes that have only recently arrived at the next codon, by examining 21-nucleotide ribosome footprints (21-nt RFPs), elevated densities were observed for multiple codon classes when followed by GNN. This striking translation slowdown at adjacent 5\'-NNN GNN codon pairs is likely mediated, in part, by the ribosome\'s CAR surface, which acts as an extension of the A-site tRNA anticodon during ribosome translocation and interacts through hydrogen bonding and pi stacking with the GNN codon. The functional consequences of 5\'-NNN GNN codon adjacency are expected to influence the evolution of protein coding sequences.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:线虫是地球上最丰富和多样化的后生动物,并且已知会严重影响生态系统功能。更好地了解他们的生物学和生态学,包括对不同栖息地和生活方式的潜在适应,是了解他们对全球变化情景的反应的关键。线粒体基因组提供了高物种水平的表征,测序成本低,和易于数据处理,可以提供对线虫进化压力的见解。
    结果:一般来说,线虫线粒体基因组表现出相似的结构特征(例如,基因大小和GC含量),但围绕这些一般模式显示出显著的可变性。组成链偏差显示出强烈的密码子位置特异性G偏斜,并且与线虫生命特征(尤其是寄生摄食习性)的关系等于或大于与预测的系统发育。平均而言,线虫线粒体基因组显示出低的非同义替换率,而且与这些手段的具体偏差也很高。尽管存在显著的突变饱和,非同义(dN)和同义(dS)替代率仍然可以通过摄食习惯和/或栖息地来解释。dN:dS比率低,特别是与寄生虫的生活方式有关,建议存在强大的净化选择。
    结论:线虫线粒体基因组表现出积累成分多样性的能力,结构,和内容,同时仍然保持功能基因。此外,他们展示了快速进化变化的能力,指出多层次选择压力和快速进化之间的潜在相互作用。总之,这项研究有助于为我们理解形成线虫线粒体基因组的潜在进化压力奠定基础,同时概述了未来可能的调查路线。
    BACKGROUND: Nematodes are the most abundant and diverse metazoans on Earth, and are known to significantly affect ecosystem functioning. A better understanding of their biology and ecology, including potential adaptations to diverse habitats and lifestyles, is key to understanding their response to global change scenarios. Mitochondrial genomes offer high species level characterization, low cost of sequencing, and an ease of data handling that can provide insights into nematode evolutionary pressures.
    RESULTS: Generally, nematode mitochondrial genomes exhibited similar structural characteristics (e.g., gene size and GC content), but displayed remarkable variability around these general patterns. Compositional strand biases showed strong codon position specific G skews and relationships with nematode life traits (especially parasitic feeding habits) equal to or greater than with predicted phylogeny. On average, nematode mitochondrial genomes showed low non-synonymous substitution rates, but also high clade specific deviations from these means. Despite the presence of significant mutational saturation, non-synonymous (dN) and synonymous (dS) substitution rates could still be significantly explained by feeding habit and/or habitat. Low ratios of dN:dS rates, particularly associated with the parasitic lifestyles, suggested the presence of strong purifying selection.
    CONCLUSIONS: Nematode mitochondrial genomes demonstrated a capacity to accumulate diversity in composition, structure, and content while still maintaining functional genes. Moreover, they demonstrated a capacity for rapid evolutionary change pointing to a potential interaction between multi-level selection pressures and rapid evolution. In conclusion, this study helps establish a background for our understanding of the potential evolutionary pressures shaping nematode mitochondrial genomes, while outlining likely routes of future inquiry.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    人类副流感病毒3型(HPIV3)是一种主要的儿科呼吸道病原体,缺乏可用的疫苗或抗病毒药物。我们通过密码子对去优化(CPD)产生了活的减毒HPIV3疫苗候选物。HPIV3开放阅读框(ORFs)编码核蛋白(N),磷蛋白(P),矩阵(M),融合(F),血凝素-神经氨酸酶(HN),和聚合酶(L)被单独或组合修饰以产生12种病毒,命名为Min-N,Min-P,Min-M,Min-FHN,Min-L,Min-NP,Min-NPM,最小不良贷款,Min-PM,Min-PFHN,Min-MFHN,和Min-PMFHN。N或L的CPD严重降低了体外生长,没有进一步评估。P或M的CPD与体外干扰素(IFN)反应增加和减少有关,分别,但对病毒复制影响不大。在Vero细胞中,F和HN延迟病毒复制的CPD,但最终滴度与野生型(wt)HPIV3相当。在人肺上皮A549细胞中,CPDF和HN诱导更强的IFN应答,病毒滴度降低了100倍,F和HN蛋白的表达显着降低,而不影响N或P或蛋白质在病毒体中的相对包装。仓鼠鼻内感染后,对于携带CPDF和HN的病毒,鼻甲和肺中的复制倾向于减少最多,最大减少约10倍。尽管体内复制减少(体外CPDF和HN的表达降低),所有病毒均诱导与wt相似的血清HPIV3中和抗体滴度,并提供针对HPIV3攻击的完全保护。总之,HPIV3的CPD产生了适合进一步开发的有希望的疫苗候选物。
    Human parainfluenza virus type 3 (HPIV3) is a major pediatric respiratory pathogen lacking available vaccines or antiviral drugs. We generated live-attenuated HPIV3 vaccine candidates by codon-pair deoptimization (CPD). HPIV3 open reading frames (ORFs) encoding the nucleoprotein (N), phosphoprotein (P), matrix (M), fusion (F), hemagglutinin-neuraminidase (HN), and polymerase (L) were modified singly or in combination to generate 12 viruses designated Min-N, Min-P, Min-M, Min-FHN, Min-L, Min-NP, Min-NPM, Min-NPL, Min-PM, Min-PFHN, Min-MFHN, and Min-PMFHN. CPD of N or L severely reduced growth in vitro and was not further evaluated. CPD of P or M was associated with increased and decreased interferon (IFN) response in vitro, respectively, but had little effect on virus replication. In Vero cells, CPD of F and HN delayed virus replication, but final titers were comparable to wild-type (wt) HPIV3. In human lung epithelial A549 cells, CPD F and HN induced a stronger IFN response, viral titers were reduced 100-fold, and the expression of F and HN proteins was significantly reduced without affecting N or P or the relative packaging of proteins into virions. Following intranasal infection in hamsters, replication in the nasal turbinates and lungs tended to be the most reduced for viruses bearing CPD F and HN, with maximum reductions of approximately 10-fold. Despite decreased in vivo replication (and lower expression of CPD F and HN in vitro), all viruses induced titers of serum HPIV3-neutralizing antibodies similar to wt and provided complete protection against HPIV3 challenge. In summary, CPD of HPIV3 yielded promising vaccine candidates suitable for further development.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    一些代谢最多样化的细菌物种(例如,放线菌)在其DNA中具有较高的GC含量,密码子使用差异很大,与大肠杆菌等易于处理的表达宿主相比,具有不同的蛋白质折叠环境。因此,在大肠杆菌中表达来自这些细菌的生物合成基因簇(BGC)通常会导致有关蛋白质表达和折叠的无数不可预测的问题,延缓新天然产物的生化表征。当前实现可溶性的策略,这些酶在可处理的宿主中的活性表达可能是一个漫长的试错过程。无细胞表达(CFE)已成为有价值的表达平台,作为快速原型表达参数的测试平台。这里,我们使用来自灰色链霉菌的III型聚酮合成酶,RppA,它能催化红色色素的形成,作为研究BGC重构技术的记者。我们应用具有启动子和rppA编码序列的不同组合的构建体文库来研究启动子和密码子使用之间的协同作用。随后,我们评估了无细胞系统在细胞中实施这些重构策略之前对其进行原型化的实用性。总的来说,在无细胞和细胞环境中,密码子协调比传统的密码子优化更能改善天然产物的合成。更重要的是,编码序列和启动子的选择协同影响蛋白质表达,这应该被考虑为未来的努力使用CFE进行高产量的蛋白质表达。当应用于RppA时,启动子策略与GFP观察到的不完全相关,表明不同的启动子策略应适用于不同的蛋白质。体内实验表明存在相关性,但在无细胞和体内表达之间没有完全对齐。通过CFE重构启动子和/或编码序列可以是快速筛选从BCG催化功能产生酶的有价值的策略。这推动了CFE作为天然产品研究工具的发展。
    Some of the most metabolically diverse species of bacteria (e.g., Actinobacteria) have higher GC content in their DNA, differ substantially in codon usage, and have distinct protein folding environments compared to tractable expression hosts like Escherichia coli. Consequentially, expressing biosynthetic gene clusters (BGCs) from these bacteria in E. coli often results in a myriad of unpredictable issues with regard to protein expression and folding, delaying the biochemical characterization of new natural products. Current strategies to achieve soluble, active expression of these enzymes in tractable hosts can be a lengthy trial-and-error process. Cell-free expression (CFE) has emerged as a valuable expression platform as a testbed for rapid prototyping expression parameters. Here, we use a type III polyketide synthase from Streptomyces griseus, RppA, which catalyzes the formation of the red pigment flaviolin, as a reporter to investigate BGC refactoring techniques. We applied a library of constructs with different combinations of promoters and rppA coding sequences to investigate the synergies between promoter and codon usage. Subsequently, we assess the utility of cell-free systems for prototyping these refactoring tactics prior to their implementation in cells. Overall, codon harmonization improves natural product synthesis more than traditional codon optimization across cell-free and cellular environments. More importantly, the choice of coding sequences and promoters impact protein expression synergistically, which should be considered for future efforts to use CFE for high-yield protein expression. The promoter strategy when applied to RppA was not completely correlated with that observed with GFP, indicating that different promoter strategies should be applied for different proteins. In vivo experiments suggest that there is correlation, but not complete alignment between expressing in cell free and in vivo. Refactoring promoters and/or coding sequences via CFE can be a valuable strategy to rapidly screen for catalytically functional production of enzymes from BCGs, which advances CFE as a tool for natural product research.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    遗传密码由61个编码20个氨基酸的密码子组成。这些密码子由在蛋白质合成过程中与特定密码子结合的转移RNA(tRNA)识别。由于碱基对摆动,所有生物体利用少于全部61种可能的反密码子:与密码子在其第三个核苷酸处具有错配的能力。先前的研究观察到细菌的tRNA池与其各自环境的温度之间存在相关性。然而,目前尚不清楚这些模式是否代表生物适应,以维持不同环境中蛋白质合成的效率和准确性。mRNA翻译的机械数学模型用于基于生物体的tRNA池定量每个密码子的预期伸长率和错误率。对一系列细菌进行比较分析,以量化环境温度对tRNA库进化的影响。我们发现,嗜热菌通常比中温菌或嗜冷菌在其tRNA池中代表更多的反密码子。根据我们的模型,这种增加的多样性预计会导致错觉错误的增加。讨论了这对嗜热菌中蛋白质进化的影响。
    The genetic code consists of 61 codons coding for 20 amino acids. These codons are recognized by transfer RNAs (tRNAs) that bind to specific codons during protein synthesis. All organisms utilize less than all 61 possible anticodons due to base pair wobble: the ability to have a mismatch with a codon at its third nucleotide. Previous studies observed a correlation between the tRNA pool of bacteria and the temperature of their respective environments. However, it is unclear if these patterns represent biological adaptations to maintain the efficiency and accuracy of protein synthesis in different environments. A mechanistic mathematical model of mRNA translation is used to quantify the expected elongation rates and error rate for each codon based on an organism\'s tRNA pool. A comparative analysis across a range of bacteria that accounts for covariance due to shared ancestry is performed to quantify the impact of environmental temperature on the evolution of the tRNA pool. We find that thermophiles generally have more anticodons represented in their tRNA pool than mesophiles or psychrophiles. Based on our model, this increased diversity is expected to lead to increased missense errors. The implications of this for protein evolution in thermophiles are discussed.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Galium属属于茜草科,由大约14,000种组成。与其知名亲戚相比,到目前为止,尚未探索Galium属的质体。该属的质体具有典型的,四方结构,但是基因含量不同,因为infA基因在Galiumpalustre和Galiumtrifidum中缺失。对使用整个叶绿体基因组序列作为超级条形码进行精确植物物种鉴定的有效性的评估表明,这种方法在属和部落内进行分子划界的潜力很大。trnE-UUC-psbD区域显示出最大数量的诊断(诊断核苷酸),这可能是新的潜在条形码,不仅在Galium中,但也在其他密切相关的属。相对同义密码子使用(RSCU)似乎与茜草科的系统发育有关,表明在进化过程中,植物开始偏爱特定的密码子而不是其他。
    Galium genus belongs to the Rubiaceae family, which consists of approximately 14,000 species. In comparison to its well-known relatives, the plastomes of the Galium genus have not been explored so far. The plastomes of this genus have a typical, quadripartite structure, but differ in gene content, since the infA gene is missing in Galium palustre and Galium trfidum. An evaluation of the effectiveness of using entire chloroplast genome sequences as superbarcodes for accurate plant species identification revealed the high potential of this method for molecular delimitation within the genus and tribe. The trnE-UUC-psbD region showed the biggest number of diagnostides (diagnostic nucleotides) which might be new potential barcodes, not only in Galium, but also in other closely related genera. Relative synonymous codon usage (RSCU) appeared to be connected with the phylogeny of the Rubiaceae family, showing that during evolution, plants started preferring specific codons over others.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    最近的研究表明,精氨酸是大多数癌症类型中氨基酸改变的最有利靶标,并且已经表明,对精氨酸突变的高度偏好反映了该氨基酸在蛋白质功能中的关键作用。癌症中精氨酸残基的高突变率,然而,也可能是由于CGN家族的精氨酸密码子的突变性增加,因为这些密码子的CpG二核苷酸可能被甲基化。在目前的工作中,我们分析了癌症基因的单碱基替换的光谱(癌基因,肿瘤抑制基因)和癌组织中的过客基因,以评估CpG高突变性和选择对精氨酸突变的贡献。我们的研究表明,与缺乏CpG二核苷酸的精氨酸密码子AGA和AGG相比,CGN密码子家族编码的精氨酸在癌症基因和过客基因中显示出更高的突变率。表明精氨酸突变在癌症中占主导地位主要是由于CpG的高突变性,而不是选择精氨酸替代。然而,我们的结果还表明,精氨酸的CGN密码子可能是癌症基因的致命弱点。原癌基因关键精氨酸的CpG超突变性,导致驱动突变的高复发率,对致癌作用有重要贡献。同样,我们的结果表明,CGA密码子的CpG二核苷酸的超突变性(将其转换为TGA终止密码子)显着导致肿瘤抑制基因的反复截短和失活。
    Recent studies have revealed that arginine is the most favorable target of amino acid alteration in most cancer types and it has been suggested that the high preference for arginine mutations reflects the critical roles of this amino acid in the function of proteins. High rates of mutations of arginine residues in cancer, however, might also be due to increased mutability of arginine codons of the CGN family as the CpG dinucleotides of these codons may be methylated. In the present work we have analyzed spectra of single base substitutions of cancer genes (oncogenes, tumor suppressor genes) and passenger genes in cancer tissues to assess the contributions of CpG hypermutability and selection to arginine mutations. Our studies have shown that arginines encoded by the CGN codon family display higher rates of mutation in both cancer genes and passenger genes than arginine codons AGA and AGG that are devoid of CpG dinucleotide, suggesting that the predominance of arginine mutations in cancer is primarily due to CpG hypermutability, rather than selection for arginine replacement. Nevertheless, our results also suggest that CGN codons for arginines may serve as Achilles\' heels of cancer genes. CpG hypermutability of key arginines of proto-oncogenes, leading to high rates of recurrence of driver mutations, contributes significantly to carcinogenesis. Similarly, our results indicate that hypermutability of the CpG dinucleotide of CGA codons (converting them to TGA stop codons) contributes significantly to recurrent truncation and inactivation of tumor suppressor genes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Rutaceae家族包括经济上重要的植物,因为它们在香料中的广泛应用,食物,油,医学,等。芦荟科植物能够通过生物技术更好地利用。现代生物技术方法主要依赖于功能蛋白在不同载体中的异源表达。然而,几种蛋白质很难在其天然环境之外表达。功能基因在异源系统中的表达潜力可以通过用功能基因的优选最佳密码子替换载体中的稀有同义密码子来最大化。密码子使用偏差在基于生物基因工程的研究和开发中起着至关重要的作用。在目前的研究中,分析了从10个Rutaceae植物家族成员的叶绿体基因组获得的727个编码序列(CDS)的密码子使用偏差。密码子的核苷酸组成分析表明,这些密码子富含A/T(U)碱基和优选的A/T(U)结尾。中立图的分析,有效密码子数(ENC)图,并对ENC与密码子适应指数(CAI)进行相关性分析,这揭示了自然选择是芦荟科植物家族密码子使用偏好的主要驱动力,其次是碱基突变。在ENC与CAI图,Rutaceae家族的密码子使用偏差与基因表达水平的关系可以忽略不计。对于每个样本,我们同时筛选了12个密码子作为首选密码子和高频密码子,其中GCU编码Ala,UUA编码Leu,编码Arg的AGA是最优选的密码子。一起来看,我们的研究揭示了菊科家族的同义密码子使用模式,为今后对芦荟科植物物种进行基因工程研究提供有价值的信息。
    Rutaceae family comprises economically important plants due to their extensive applications in spices, food, oil, medicine, etc. The Rutaceae plants is able to better utilization through biotechnology. Modern biotechnological approaches primarily rely on the heterologous expression of functional proteins in different vectors. However, several proteins are difficult to express outside their native environment. The expression potential of functional genes in heterologous systems can be maximized by replacing the rare synonymous codons in the vector with preferred optimal codons of functional genes. Codon usage bias plays a critical role in biogenetic engineering-based research and development. In the current study, 727 coding sequences (CDSs) obtained from the chloroplast genomes of ten Rutaceae plant family members were analyzed for codon usage bias. The nucleotide composition analysis of codons showed that these codons were rich in A/T(U) bases and preferred A/T(U) endings. Analyses of neutrality plots, effective number of codons (ENC) plots, and correlations between ENC and codon adaptation index (CAI) were conducted, which revealed that natural selection is a major driving force for the Rutaceae plant family\'s codon usage bias, followed by base mutation. In the ENC vs. CAI plot, codon usage bias in the Rutaceae family had a negligible relationship with gene expression level. For each sample, we screened 12 codons as preferred and high-frequency codons simultaneously, of which GCU encoding Ala, UUA encoding Leu, and AGA encoding Arg were the most preferred codons. Taken together, our study unraveled the synonymous codon usage pattern in the Rutaceae family, providing valuable information for the genetic engineering of Rutaceae plant species in the future.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    标准遗传密码定义了地球上几乎所有生命形式的翻译规则。它还确定了通过单核苷酸突变可获得的氨基酸变化,从而影响蛋白质进化能力——突变带来蛋白质功能适应性变异的能力。标准遗传密码最显著的特征之一是它对突变的鲁棒性,然而,这种健壮性是促进还是阻碍了蛋白质的进化,这仍然是一个悬而未决的问题。为了回答这个问题,我们使用来自大规模平行序列到功能测定的数据来构建和分析数十万重新连接的遗传密码下的6种经验适应性景观,包括与蛋白质工程和合成生物学相关的密码子压缩方案。我们发现,强大的遗传密码倾向于通过渲染平滑的适应性景观来增强蛋白质的进化能力,从整个序列空间都很容易接近。然而,标准遗传密码在这方面很少例外,因为许多替代代码比标准代码渲染更平滑的景观。通过构建这些景观的低维可视化,每个包含超过1600万个mRNA序列,我们表明,这种替代代码从根本上改变了高适应度基因型网络的拓扑特征。而优化可进化性的遗传密码在一定程度上取决于氨基酸序列和蛋白质功能之间的详细关系,我们还揭示了工程非标准遗传密码的一般设计原则,以增强和减少可进化性,这可能有助于定向蛋白质进化实验和合成生物的生物保护,分别。
    The standard genetic code defines the rules of translation for nearly every life form on Earth. It also determines the amino acid changes accessible via single-nucleotide mutations, thus influencing protein evolvability-the ability of mutation to bring forth adaptive variation in protein function. One of the most striking features of the standard genetic code is its robustness to mutation, yet it remains an open question whether such robustness facilitates or frustrates protein evolvability. To answer this question, we use data from massively parallel sequence-to-function assays to construct and analyze 6 empirical adaptive landscapes under hundreds of thousands of rewired genetic codes, including those of codon compression schemes relevant to protein engineering and synthetic biology. We find that robust genetic codes tend to enhance protein evolvability by rendering smooth adaptive landscapes with few peaks, which are readily accessible from throughout sequence space. However, the standard genetic code is rarely exceptional in this regard, because many alternative codes render smoother landscapes than the standard code. By constructing low-dimensional visualizations of these landscapes, which each comprise more than 16 million mRNA sequences, we show that such alternative codes radically alter the topological features of the network of high-fitness genotypes. Whereas the genetic codes that optimize evolvability depend to some extent on the detailed relationship between amino acid sequence and protein function, we also uncover general design principles for engineering nonstandard genetic codes for enhanced and diminished evolvability, which may facilitate directed protein evolution experiments and the bio-containment of synthetic organisms, respectively.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号