Directed evolution

定向进化
  • 文章类型: Journal Article
    如果该行业要继续产生新药,则药物发现方面的研究和开发将需要获得显着的效率收益。人们非常期望机器学习(ML)提高研发生产率,但是为了充分利用机器学习的潜力,新一代,高质量的数据集将是必要的。这里,作者提出了一个将高通量显示和选择数据生成与ML相结合的平台。更具体地说,深度学习用于使用DNA库合成来告知新型生物疗法的定向进化,超高吞吐量选择,和下一代测序。通过结合多个计算机模型的学习,他们的平台实现了跨多个重要蛋白质特征的多参数优化。他们还提出了一个模型,用于根据其基础计算机模型的准确性对这些ML驱动的药物发现平台进行基准测试。结合他们的经验实验的吞吐量。
    Research and development in drug discovery will need to find significant efficiency gains if the industry is to continue generating novel drugs. There is great expectation for machine learning (ML) to provide this boost in R&D productivity, but to harness the full potential of ML, the generation of new, high-quality datasets will be necessary. Here, the authors present a platform that combines high-throughput display and selection data generation with ML. More specifically, deep learning is used to inform the directed evolution of novel biotherapeutics using DNA library synthesis, ultra-high throughput selections, and next generation sequencing. By combining the learnings of multiple in silico models, their platform enables multi-parameter optimisation across multiple important protein characteristics. They also present a model for benchmarking these ML-driven drug discovery platforms according to the accuracy of their underlying in silico models, in conjunction with the throughput of their empirical experimentation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    植物进化产生的酶可能不是当今农业环境和植物生物技术应用中产量和质量最大化的最佳酶。通过提高酶的性能,应该可以减轻动力学特性或酶不稳定性对产量和质量的限制。通过应用定向进化,酶可以比自然更快地优化,这需要在体外突变靶基因并筛选或选择突变的基因产物以获得所需特征。连续定向进化是一种更有效和可扩展的版本,它通过目标基因的易错复制和宿主细胞的生长速率与目标基因的功能耦合,在体内同时完成诱变和选择步骤。然而,公开的连续系统需要定制的质粒组装,和方便的多用途平台不可用。我们讨论了两种适用于酶连续定向进化的系统,酿酒酵母中的OrthoRep和大肠杆菌中的EvolvR,以及我们为适应高通量植物酶工程的每个系统所做的试点努力。为了测试我们修改的系统,我们使用了硫胺素合成酶THI4,以前被确定为改善的主要候选酶。我们适应的OrthoRep系统显示了高效植物酶工程的希望。
    Plant evolution has produced enzymes that may not be optimal for maximizing yield and quality in today\'s agricultural environments and plant biotechnology applications. By improving enzyme performance, it should be possible to alleviate constraints on yield and quality currently imposed by kinetic properties or enzyme instability. Enzymes can be optimized more quickly than naturally possible by applying directed evolution, which entails mutating a target gene in vitro and screening or selecting the mutated gene products for the desired characteristics. Continuous directed evolution is a more efficient and scalable version that accomplishes the mutagenesis and selection steps simultaneously in vivo via error-prone replication of the target gene and coupling of the host cell\'s growth rate to the target gene\'s function. However, published continuous systems require custom plasmid assembly, and convenient multipurpose platforms are not available. We discuss two systems suitable for continuous directed evolution of enzymes, OrthoRep in Saccharomyces cerevisiae and EvolvR in Escherichia coli, and our pilot efforts to adapt each system for high-throughput plant enzyme engineering. To test our modified systems, we used the thiamin synthesis enzyme THI4, previously identified as a prime candidate for improvement. Our adapted OrthoRep system shows promise for efficient plant enzyme engineering.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    Protein-protein interactions (PPIs) are involved in nearly all cellular processes. PPIs are particularly crucial for mediating selectivity along signaling pathways. Thus, measuring the competitive interplay between PPIs in a cell is important for both understanding fundamental cellular regulation and developing therapeutics targeting those whose dysregulation is associated with disease. A variety of split protein reporter-based tools are available to measure if two proteins interact within a cell and thereby characterize the general determinants of their interactions. PPIs, however, occur within complex networks facilitated by dynamic biophysical nuances that determine activity and selectivity. Evolved, proximity-dependent split T7 RNA polymerase (RNAP) biosensors have recently been used to perform deep mutational scanning of PPI interfaces, and to create synthetic gene circuits. In this chapter, we present the application of proximity-dependent split RNAP biosensors as a method to measure multidimensional PPIs in live cells. Orthogonal split RNAP \"tags\" encode each interaction in a unique RNA signal, thereby enabling the study of multiple competitive PPIs in live cells. Each unique RNA signal can be quantified via established RNA analysis methods. Herein, we provide advice and protocols to aid other researchers in using the split RNAP biosensor, focusing primarily on how to detect multiple PPIs in mammalian cells, including their dynamic interplay in the presence of small molecule inhibitors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Editorial
    分子进化杂志的编辑们聚集在一起,为分子进化领域提出了一系列关键挑战和未来方向。主题包括益生元化学和RNA世界的挑战和新方向,早期细胞基因组和蛋白质的重建,大分子和功能进化,进化细胞生物学,基因组进化,分子进化生态学,病毒系统动力学,理论人口基因组学,体细胞分子进化,和定向进化。虽然我们的努力并不意味着详尽无遗,它反映了分子进化领域的研究问题和问题,这让我们的编辑感到兴奋。
    A collection of the editors of Journal of Molecular Evolution have gotten together to pose a set of key challenges and future directions for the field of molecular evolution. Topics include challenges and new directions in prebiotic chemistry and the RNA world, reconstruction of early cellular genomes and proteins, macromolecular and functional evolution, evolutionary cell biology, genome evolution, molecular evolutionary ecology, viral phylodynamics, theoretical population genomics, somatic cell molecular evolution, and directed evolution. While our effort is not meant to be exhaustive, it reflects research questions and problems in the field of molecular evolution that are exciting to our editors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    The work aiming to unravel the correlation between protein sequence and function in the absence of structural information can be highly rewarding. We present a new way of considering descriptors from the amino acids index database for modeling and predicting the fitness value of a polypeptide chain. This approach includes the following steps: (i) Calculating Q elementary numerical sequences (Ele_SEQ) depending on the encoding of the amino acid residues, (ii) determining an extended numerical sequence (Ext_SEQ) by concatenating the Q elementary numerical sequences, wherein at least one elementary numerical sequence is a protein spectrum obtained by applying fast Fourier transformation (FFT), and (iii) predicting a value of fitness for polypeptide variants (train and/or validation set). These new descriptors were tested on four sets of proteins of different lengths (GLP-2, TNF alpha, cytochrome P450, and epoxide hydrolase) and activities (cAMP activation, binding affinity, thermostability and enantioselectivity). We show that the use of multiple physicochemical descriptors coupled with the implementation of the FFT, taking into account the interactions between residues of amino amides within the protein sequence, could lead to very significant improvement in the quality of models and predictions. The choice of the descriptor or of the combination of descriptors and/or FFT is dependent on the couple protein/fitness. This approach can provide potential users with value added to existing mutant libraries where screening efforts have so far been unsuccessful in finding improved polypeptide mutants for useful applications.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    Positions identified in directed evolution campaigns or by (semi)rational design can be recombined iteratively or simultaneously. Iterative recombination has yielded many success stories and is beneficially used if screening capabilities are limited (four iterative SSMs generate 20×4=80 different enzyme variants). Simultaneous site saturation mutagenesis offers significantly higher diversity (204 =160 000 variants) and enables greater improvements to be found, especially if the selected positions are in close proximity to each other (cooperative effects). Here we report a first comprehensive comparison of iterative and simultaneous saturation of four residues in Candida parapsilosis alcohol dehydrogenase 5 (cpADH5) with methyl 3-hydroxyhexanoate as substrate. Screening of 7200 clones from 33 site saturation mutagenesis libraries (exploring 17 recombination paths) yielded the cpADH5 W286A variant, with a 82-fold improved initial activity toward methyl 3-hydroxyhexanoate. Screening 3500 clones from a single OmniChange library with four positions (C57, W116, L119, and W286; 1.8 % of the generated sequence space) yielded the cpADH5 C57V/W286S variant, with a 108-fold improvement in initial activity toward methyl 3-hydroxyhexanoate. A 1.8 % coverage of the sequence space of the simultaneous multisite saturation library was, in comparison to the investigated 17 recombination paths, sufficient to identify a cpADH5 variant with improved activity.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    定向进化是工程酶功能的一个非常强大的策略。将这种方法应用于糖苷酶为开发化学糖生物学中高度专业化的工具提供了巨大的潜力。进行酶定向进化需要世代,通过随机诱变,必须在高通量测定中筛选大量变体酶的突变文库。用于文库创建的结构指导的“半理性”方法允许研究人员针对特定的氨基酸位置进行诱变,集中突变可能最有效的地方,以产生可管理大小的突变文库,最大限度地减少筛选努力,同时最大限度地提高发现改良突变体的机会。精心设计的化验,可以使用专门准备的基材,能够有效筛选这些突变文库。本章将详细介绍糖苷酶结构指导定向进化的一般方法,先前已用于工程化血型抗原切割酶。
    Directed evolution is an incredibly powerful strategy for engineering enzyme function. Applying this approach to glycosidases offers enormous potential for the development of highly specialized tools in chemical glycobiology. Performing enzyme directed evolution requires the generation, by random mutagenesis, of mutant libraries from which large numbers of variant enzymes must be screened in high-throughput assays. A structure-guided \"semirational\" method for library creation allows researchers to target specific amino acid positions for mutagenesis, concentrating mutations where they might be most effective in order to produce mutant libraries of a manageable size, minimizing screening effort while maximizing the chances of finding improved mutants. Well-designed assays, which may use specially prepared substrates, enable efficient screening of these mutant libraries. This chapter will detail general methods in the structure-guided directed evolution of glycosidases, which have previously been employed in engineering a blood group antigen-cleaving enzyme.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    蛋白质编码基因中的同义突变显著影响翻译效率。我们合成了一对编码绿色荧光蛋白的基因,这些基因被160个同义突变分开,以研究影响翻译效率的关键因素。一个序列针对大肠杆菌(GFP(Eco))进行优化,另一个针对枯草芽孢杆菌(GFP(Bsu))进行优化。当这些基因在大肠杆菌中表达时,GFP(Eco)发出的荧光比GFP(Bsu)强12倍,证实GFP(Bsu)基因的次优性质。然后我们采用定向进化来改善GFP(Bsu)的表达。随机诱变和DNA改组用于产生突变文库,对其进行荧光筛选。鉴定了显示6倍荧光增强的变体,其在Gly-4的罕见密码子中含有单个突变(G10A)。然而,取代产生了另一种罕见的密码子,AGA,对于Arg,这表明这种改善是由罕见密码子以外的因素引起的。我们接下来将饱和诱变应用于Gly-4。最暗的变体含有Gly-4的GGG密码子(GFP(Bsu)-G)。考虑到突变的位置,我们假设起始密码子周围mRNA二级结构的不稳定改善了表达。然后,我们将GFP(Bsu)的5'-非翻译区(5'UTR)中的核苷酸三联体随机化,与Gly-4密码子互补。鉴定了显示6倍荧光增强的变体,表现出不稳定的二级结构。当这个5'UTR序列与GFP(Bsu)-G组合时,实现了22倍的荧光改善。总的来说,起始密码子周围mRNA二级结构的稳定性主要影响翻译效率。
    Synonymous mutations in protein coding genes significantly impact translation efficiency. We synthesized a pair of genes encoding green fluorescent protein that were separated by 160 synonymous mutations to investigate key factors that affect translation efficiency. One sequence was optimized for Escherichia coli (GFP(Eco)) and the other for Bacillus subtilis (GFP(Bsu)). When the genes were expressed in E. coli, GFP(Eco) fluoresced 12-fold stronger than GFP(Bsu), confirming the suboptimal nature of the GFP(Bsu) gene. We then employed directed evolution to improve the expression of GFP(Bsu). Random mutagenesis and DNA shuffling was used to generate mutant libraries, which were screened for fluorescence. A variant showing 6-fold fluorescence enhancement was identified, which contained a single mutation (G10A) in a rare codon for Gly-4. However, the substitution generated another type of rare codon, AGA, for Arg, suggesting that the improvement was caused by a factor other than the rare codon. We next applied saturation mutagenesis to Gly-4. The darkest variant contained a GGG codon (GFP(Bsu)-G) for Gly-4. Taking the location of the mutation into account, we hypothesized that destabilization of the mRNA secondary structure around the initiation codon improved the expression. We then randomized the nucleotide triplet in 5\'-untranslated region (5\'UTR) of GFP(Bsu), which is complementary to the Gly-4 codon. A variant showing 6-fold fluorescence enhancement was identified, which exhibited a destabilized secondary structure. When this 5\'UTR sequence was combined with GFP(Bsu)-G, 22-fold fluorescent improvement was achieved. Collectively, the stability of the mRNA secondary structure around the initiation codon predominantly affected the translation efficiency.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号