Deep mutational scanning

深度突变扫描
  • 文章类型: Journal Article
    B细胞使用其表面表达的B细胞抗原受体(BCR)监视身体的异物,四聚体复合物,其包含结合抗原的膜束缚抗体(mIg)和将这种相互作用传递至B细胞的信号传导二聚体(CD79AB)。IgM和IgG同种型BCR的最新低温电子显微镜(cryo-EM)结构提供了其结构的第一个完整视图。揭示了mIg和CD79AB之间最大的相互作用表面在它们的跨膜结构域(TMD)中。这些结构支持数十年的生化工作,以询问功能性BCR的组装要求,并为解释突变的影响提供基础。在这里,我们报告了集中的饱和诱变,以全面表征BCR表面表达所需的mIgTMD中相互作用的性质。我们在合并竞争测定中同时检查了600个单氨基酸变化的影响,并通过下一代测序量化了它们的影响。我们的深度突变扫描结果反映了一个特征丰富的TMD序列,有些位置完全不能耐受突变,而另一些位置需要特定的生化特性,如电荷,极性或疏水性,强调饱和诱变的高价值,例如,丙氨酸扫描。数据与已发表的诱变和低温EM结构密切相关,同时还突出了几个位置和表面,这些位置和表面以前没有被表征或具有难以纯粹基于结构合理化的效果。这个无偏见和完整的诱变数据集作为知情假设检验的参考和框架,设计疗法以调节BCR表面表达并注释患者突变。
    B cells surveil the body for foreign matter using their surface-expressed B cell antigen receptor (BCR), a tetrameric complex comprising a membrane-tethered antibody (mIg) that binds antigens and a signaling dimer (CD79AB) that conveys this interaction to the B cell. Recent cryogenic electron microscopy (cryo-EM) structures of IgM and IgG isotype BCRs provide the first complete views of their architecture, revealing that the largest interaction surfaces between the mIg and CD79AB are in their transmembrane domains (TMDs). These structures support decades of biochemical work interrogating the requirements for assembly of a functional BCR and provide the basis for explaining the effects of mutations. Here we report a focused saturating mutagenesis to comprehensively characterize the nature of the interactions in the mIg TMD that are required for BCR surface expression. We examined the effects of 600 single-amino-acid changes simultaneously in a pooled competition assay and quantified their effects by next-generation sequencing. Our deep mutational scanning results reflect a feature-rich TMD sequence, with some positions completely intolerant to mutation and others requiring specific biochemical properties such as charge, polarity or hydrophobicity, emphasizing the high value of saturating mutagenesis over, for example, alanine scanning. The data agree closely with published mutagenesis and the cryo-EM structures, while also highlighting several positions and surfaces that have not previously been characterized or have effects that are difficult to rationalize purely based on structure. This unbiased and complete mutagenesis dataset serves as a reference and framework for informed hypothesis testing, design of therapeutics to regulate BCR surface expression and to annotate patient mutations.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    多结构域酶可以通过结构域间相互作用和催化结构域固有的结构特征来调节。酪氨酸磷酸酶SHP2是由域间相互作用调节的多结构域蛋白的典型实例。该酶具有蛋白酪氨酸磷酸酶(PTP)结构域和两个磷酸酪氨酸识别结构域(N-SH2和C-SH2),它们通过自抑制性相互作用调节磷酸酶活性。SHP2通过与SH2结构域结合的磷蛋白被规范激活,这导致了大的域间重排,但自身抑制也可能被疾病相关突变破坏。SHP2激活机制的许多细节仍不清楚,生理相关的活性构象仍然难以捉摸,和数百种SHP2的人类变体尚未被功能表征。这里,我们对全长SHP2及其分离的PTP域进行深度突变扫描,以检查突变对域间调控和催化活性的影响。我们的实验提供了SHP2突变敏感性的全面图谱,在存在和不存在域间调控的情况下。再加上分子动力学模拟,我们的研究揭示了控制SHP2自抑制和活性状态稳定性的新结构特征。我们的分析还确定了SHP2活性位点之外的控制PTP结构域动力学和固有催化活性的关键残基。这项工作扩展了我们对SHP2调控的理解,并为SHP2致病性提供了新的见解。
    Multi-domain enzymes can be regulated by both inter-domain interactions and structural features intrinsic to the catalytic domain. The tyrosine phosphatase SHP2 is a quintessential example of a multi-domain protein that is regulated by inter-domain interactions. This enzyme has a protein tyrosine phosphatase (PTP) domain and two phosphotyrosine-recognition domains (N-SH2 and C-SH2) that regulate phosphatase activity through autoinhibitory interactions. SHP2 is canonically activated by phosphoprotein binding to the SH2 domains, which causes large inter-domain rearrangements, but autoinhibition can also be disrupted by disease-associated mutations. Many details of the SHP2 activation mechanism are still unclear, the physiologically-relevant active conformations remain elusive, and hundreds of human variants of SHP2 have not been functionally characterized. Here, we perform deep mutational scanning on both full-length SHP2 and its isolated PTP domain to examine mutational effects on inter-domain regulation and catalytic activity. Our experiments provide a comprehensive map of SHP2 mutational sensitivity, both in the presence and absence of inter-domain regulation. Coupled with molecular dynamics simulations, our investigation reveals novel structural features that govern the stability of the autoinhibited and active states of SHP2. Our analysis also identifies key residues beyond the SHP2 active site that control PTP domain dynamics and intrinsic catalytic activity. This work expands our understanding of SHP2 regulation and provides new insights into SHP2 pathogenicity.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    提交给MaveDB的变体功能测定的大规模实验措施有可能为解决不确定意义的变体提供关键信息,但是报告与测定序列相关的结果阻碍了它们的下游效用。变异效应图谱联盟将变异效应数据的多重分析映射到人类参考序列,创建一组健壮的机器可读的同源性映射。这种方法在MaveDB中处理了大约250万个蛋白质和基因组变体,成功地映射98.61%的检查变体和传播数据的资源,如UCSC基因组浏览器和Ensembl变异效应预测。
    The large-scale experimental measures of variant functional assays submitted to MaveDB have the potential to provide key information for resolving variants of uncertain significance, but the reporting of results relative to assayed sequence hinders their downstream utility. The Atlas of Variant Effects Alliance mapped multiplexed assays of variant effect data to human reference sequences, creating a robust set of machine-readable homology mappings. This method processed approximately 2.5 million protein and genomic variants in MaveDB, successfully mapping 98.61% of examined variants and disseminating data to resources such as the UCSC Genome Browser and Ensembl Variant Effect Predictor.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    腺相关病毒2(AAV2)是以其感染人类细胞和类似生物体的能力而闻名的微小病毒。它们最近成为基因治疗领域的杰出候选者,主要归因于它们在人类中固有的非致病性以及与它们的操纵相关的安全性。AAV2作为基因治疗载体的功效取决于它们浸润宿主细胞的能力,一种依赖于它们构建能够破坏靶细胞细胞核的衣壳的能力的现象。为了增强他们的感染潜力,研究人员通过将突变引入衣壳来广泛审查各种组合文库,旨在提高他们的效率。高通量实验技术的出现,比如深度突变扫描(DMS),已经使通过实验评估这些图书馆的适用性达到预期目的变得可行。值得注意的是,机器学习开始展示其在从序列数据解决突变景观中的预测方面的潜力。在这种情况下,我们引入了一个生物物理启发的模型,旨在预测DMS实验中遗传变异的生存能力。该模型是针对AAV2衣壳蛋白中CAP区域的特定片段而定制的。为了评估其有效性,我们用不同的数据集进行模型训练,每个人都量身定制,以探索受选择过程影响的突变景观的不同方面。我们对生物物理模型的评估集中在两个主要目标上:(i)为变体的对数选择性提供定量预测,以及(ii)将其部署为二元分类器以将序列分类为可行和非可行类别。
    Adeno-associated viruses 2 (AAV2) are minute viruses renowned for their capacity to infect human cells and akin organisms. They have recently emerged as prominent candidates in the field of gene therapy, primarily attributed to their inherent non-pathogenic nature in humans and the safety associated with their manipulation. The efficacy of AAV2 as gene therapy vectors hinges on their ability to infiltrate host cells, a phenomenon reliant on their competence to construct a capsid capable of breaching the nucleus of the target cell. To enhance their infection potential, researchers have extensively scrutinized various combinatorial libraries by introducing mutations into the capsid, aiming to boost their effectiveness. The emergence of high-throughput experimental techniques, like deep mutational scanning (DMS), has made it feasible to experimentally assess the fitness of these libraries for their intended purpose. Notably, machine learning is starting to demonstrate its potential in addressing predictions within the mutational landscape from sequence data. In this context, we introduce a biophysically-inspired model designed to predict the viability of genetic variants in DMS experiments. This model is tailored to a specific segment of the CAP region within AAV2\'s capsid protein. To evaluate its effectiveness, we conduct model training with diverse datasets, each tailored to explore different aspects of the mutational landscape influenced by the selection process. Our assessment of the biophysical model centers on two primary objectives: (i) providing quantitative forecasts for the log-selectivity of variants and (ii) deploying it as a binary classifier to categorize sequences into viable and non-viable classes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    由于缺乏足够的临床病例报告,解释在人口规模测序工作中发现的大量罕见遗传变异并破译它们与人类健康和疾病的关联是一个严峻的挑战。克服这个问题的一个有希望的途径是深度突变扫描(DMS),在模型细胞系中引入和评估大规模遗传变异的方法。DMS允许对变体进行公正的调查,包括那些在临床报告中没有发现的,从而改善罕见疾病诊断。目前,限制DMS全部潜力的主要障碍是疾病机制特异性功能测定的可用性。因此,我们探索了适合检查广泛疾病机制的高通量功能方法.我们特别关注不需要机器人或自动化的方法,而是使用精心设计的分子工具将生物机制转化为易于检测的信号。如细胞存活率,荧光或耐药性。这里,我们的目标是弥合疾病相关检测方法与纳入DMS框架之间的差距.
    Interpreting the wealth of rare genetic variants discovered in population-scale sequencing efforts and deciphering their associations with human health and disease present a critical challenge due to the lack of sufficient clinical case reports. One promising avenue to overcome this problem is deep mutational scanning (DMS), a method of introducing and evaluating large-scale genetic variants in model cell lines. DMS allows unbiased investigation of variants, including those that are not found in clinical reports, thus improving rare disease diagnostics. Currently, the main obstacle limiting the full potential of DMS is the availability of functional assays that are specific to disease mechanisms. Thus, we explore high-throughput functional methodologies suitable to examine broad disease mechanisms. We specifically focus on methods that do not require robotics or automation but instead use well-designed molecular tools to transform biological mechanisms into easily detectable signals, such as cell survival rate, fluorescence or drug resistance. Here, we aim to bridge the gap between disease-relevant assays and their integration into the DMS framework.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    深度突变扫描(DMS)分析是通过测量数千个序列变体对蛋白质功能的影响来研究序列-功能关系的强大工具。在DMS实验中,一些技术伪像可能会非线性地扭曲所获得的功能分数,可能会对结果的解释产生偏差。因此,我们在deepPCA工作流程中测试了几个技术参数,蛋白质-蛋白质相互作用的DMS测定,以识别非线性的技术来源。我们发现许多DMS测定共有的参数,如转化DNA的量,收获和文库组成的时间点可能会导致数据的非线性。以最小化这些非线性效应的方式设计实验将改善突变效应的量化和解释。
    Deep Mutational Scanning (DMS) assays are powerful tools to study sequence-function relationships by measuring the effects of thousands of sequence variants on protein function. During a DMS experiment, several technical artefacts might distort non-linearly the functional score obtained, potentially biasing the interpretation of the results. We therefore tested several technical parameters in the deepPCA workflow, a DMS assay for protein-protein interactions, in order to identify technical sources of non-linearities. We found that parameters common to many DMS assays such as amount of transformed DNA, timepoint of harvest and library composition can cause non-linearities in the data. Designing experiments in a way to minimize these non-linear effects will improve the quantification and interpretation of mutation effects.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    转录因子PAX6的无义和错义突变引起广泛的眼部发育缺陷,包括无虹膜,小眼症和结肠瘤。为了了解PAX6:DNA结合的变化如何导致这些表型,我们将PAX6配对结构域的饱和诱变与酵母单杂交(Y1H)分析相结合,其中PAX6-GAL4融合基因的表达驱动抗生素抗性.我们定量了超过2700个单一氨基酸变体与两个DNA序列元件的结合。N端亚结构域和接头区的面向DNA残基的突变是最有害的,脯氨酸和带负电荷的残基的突变。许多变体引起序列特异性分子功能增益效应,包括在71位的变体,其增加与LE9增强子的结合但降低与SELEX衍生的结合位点的结合。在没有抗生素选择的情况下,保留DNA结合的变体减缓了酵母的生长,可能是因为这些变异扰乱了酵母转录组。针对已知患者变异进行基准测试,并将ACMG/AMP指南应用于变异分类,我们获得了支持性到中度证据,即977个变异体可能是致病性的,1306个变异体可能是良性的.我们的分析表明,PAX6配对结构域中的大多数致病突变可以简单地通过这些突变对PAX6的影响来解释:DNA关联,并建立了Y1H作为解释转录因子变异效应的通用测定法。
    Nonsense and missense mutations in the transcription factor PAX6 cause a wide range of eye development defects, including aniridia, microphthalmia and coloboma. To understand how changes of PAX6:DNA binding cause these phenotypes, we combined saturation mutagenesis of the paired domain of PAX6 with a yeast one-hybrid (Y1H) assay in which expression of a PAX6-GAL4 fusion gene drives antibiotic resistance. We quantified binding of more than 2700 single amino-acid variants to two DNA sequence elements. Mutations in DNA-facing residues of the N-terminal subdomain and linker region were most detrimental, as were mutations to prolines and to negatively charged residues. Many variants caused sequence-specific molecular gain-of-function effects, including variants in position 71 that increased binding to the LE9 enhancer but decreased binding to a SELEX-derived binding site. In the absence of antibiotic selection, variants that retained DNA binding slowed yeast growth, likely because such variants perturbed the yeast transcriptome. Benchmarking against known patient variants and applying ACMG/AMP guidelines to variant classification, we obtained supporting-to-moderate evidence that 977 variants are likely pathogenic and 1306 are likely benign. Our analysis shows that most pathogenic mutations in the paired domain of PAX6 can be explained simply by the effects of these mutations on PAX6:DNA association, and establishes Y1H as a generalisable assay for the interpretation of variant effects in transcription factors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    蛋白质的遗传结构——其序列产生其功能的一组因果规则——也决定了其可能的进化轨迹。先前的研究表明,蛋白质的遗传结构非常复杂,具有普遍的上位性相互作用,限制了进化并使功能难以从序列中预测。大多数这项工作只分析了两种感兴趣的蛋白质之间的直接路径-排除了绝大多数可能的基因型和进化轨迹-并且只考虑了单一的蛋白质功能。未解决功能特异性的遗传结构及其对新功能进化的影响。这里,我们开发了一种基于序数逻辑回归的新方法,可以从20态组合深度突变扫描(DMS)实验中直接表征多种蛋白质功能的全局遗传决定因素。我们用它来剖析转录因子对DNA特异性的遗传结构和进化,使用来自古代类固醇激素受体的组合DMS的数据激活两个生物相关DNA元件的转录能力。我们表明,DNA识别的遗传结构由一组密集的主要和配对效应组成,这些效应涉及蛋白质-DNA界面中几乎所有可能的氨基酸状态,但是高阶上位只起到很小的作用。成对相互作用扩大了功能序列集,并且是不同DNA元件特异性的主要决定因素。他们还大量扩大了单残基突变将特异性从一个DNA靶标切换到另一个的机会。通过将具有不同功能的变体在序列空间中靠近在一起,因此,成对上位促进而不是限制新功能的发展。
    A protein\'s genetic architecture - the set of causal rules by which its sequence produces its functions - also determines its possible evolutionary trajectories. Prior research has proposed that the genetic architecture of proteins is very complex, with pervasive epistatic interactions that constrain evolution and make function difficult to predict from sequence. Most of this work has analyzed only the direct paths between two proteins of interest - excluding the vast majority of possible genotypes and evolutionary trajectories - and has considered only a single protein function, leaving unaddressed the genetic architecture of functional specificity and its impact on the evolution of new functions. Here, we develop a new method based on ordinal logistic regression to directly characterize the global genetic determinants of multiple protein functions from 20-state combinatorial deep mutational scanning (DMS) experiments. We use it to dissect the genetic architecture and evolution of a transcription factor\'s specificity for DNA, using data from a combinatorial DMS of an ancient steroid hormone receptor\'s capacity to activate transcription from two biologically relevant DNA elements. We show that the genetic architecture of DNA recognition consists of a dense set of main and pairwise effects that involve virtually every possible amino acid state in the protein-DNA interface, but higher-order epistasis plays only a tiny role. Pairwise interactions enlarge the set of functional sequences and are the primary determinants of specificity for different DNA elements. They also massively expand the number of opportunities for single-residue mutations to switch specificity from one DNA target to another. By bringing variants with different functions close together in sequence space, pairwise epistasis therefore facilitates rather than constrains the evolution of new functions.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    乙型肝炎病毒(HBV)是一种小型双链DNA病毒,可慢性感染2.96亿人。超过一半的紧凑基因组在两个重叠的阅读框中编码蛋白质,在进化过程中,多种选择压力可以作用于共享的核苷酸。这项研究将基于RNA的HBV细胞培养系统与深度突变扫描(DMS)相结合,以消除HBV基因组中的顺式和反式作用序列要求。结果支持聚合酶翻译的泄漏核糖体扫描模型,提供HBV聚合酶在单核苷酸分辨率的健身图,并确定与HBV聚合酶终止密码子相邻的保守脯氨酸,使核糖体停滞。进一步的实验表明,停滞的核糖体将新生的聚合酶束缚在其模板RNA上,确保HBV基因组的顺式优先RNA包装和逆转录。
    Hepatitis B virus (HBV) is a small double-stranded DNA virus that chronically infects 296 million people. Over half of its compact genome encodes proteins in two overlapping reading frames, and during evolution, multiple selective pressures can act on shared nucleotides. This study combines an RNA-based HBV cell culture system with deep mutational scanning (DMS) to uncouple cis- and trans-acting sequence requirements in the HBV genome. The results support a leaky ribosome scanning model for polymerase translation, provide a fitness map of the HBV polymerase at single-nucleotide resolution, and identify conserved prolines adjacent to the HBV polymerase termination codon that stall ribosomes. Further experiments indicated that stalled ribosomes tether the nascent polymerase to its template RNA, ensuring cis-preferential RNA packaging and reverse transcription of the HBV genome.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    活动质体病原体,包括布鲁氏锥虫,T.Cruzi,和利什曼原虫物种,早期分歧,真核生物,单细胞寄生虫.对来自这些病原体的许多蛋白质的功能理解受到与来自其他模型生物的蛋白质的有限序列同源性的阻碍。在这里,我们描述了在T.brucei中高通量深度突变扫描方法的开发,该方法有助于快速和无偏见地评估蛋白质中许多可能的氨基酸取代对细胞适应性的影响。通过相对细胞生长来衡量。该方法利用了几种分子技术:具有感兴趣的野生型基因的条件表达和突变变体文库的组成型表达的细胞,degron控制的I-SceI大范围核酸酶的稳定,以介导突变等位基因文库的高效转染,和高通量测序读数,用于有条件敲除野生型基因表达和突变变体的排他性表达后的细胞生长。使用此方法,我们查询了KREPB4(B4)的明显非催化RNaseIII样结构域中氨基酸取代的影响,它是RNA编辑催化复合物(RECs)的重要组成部分,该复合物在布鲁氏菌中进行线粒体RNA编辑。我们测量了数千种B4变体对血流形式细胞生长的影响,并验证了含有单个氨基酸取代的最有害变体。至关重要的是,表型和氨基酸保守性之间没有相关性,证明了这种方法比传统的序列同源性搜索更强大,可以识别功能残基。血流形式细胞生长表型与结构模型相结合,RECC蛋白质接近度数据,并分析了顺环形式布氏T.brucei中的选定取代。这些分析表明,B4RNaseIII样结构域对于维持RECC完整性和RECC蛋白丰度至关重要,并且还参与在血流和前循环形式生命周期阶段之间发生的RECS变化。
    Kinetoplastid pathogens including Trypanosoma brucei, T. cruzi, and Leishmania species, are early diverged, eukaryotic, unicellular parasites. Functional understanding of many proteins from these pathogens has been hampered by limited sequence homology to proteins from other model organisms. Here we describe the development of a high-throughput deep mutational scanning approach in T. brucei that facilitates rapid and unbiased assessment of the impacts of many possible amino acid substitutions within a protein on cell fitness, as measured by relative cell growth. The approach leverages several molecular technologies: cells with conditional expression of a wild-type gene of interest and constitutive expression of a library of mutant variants, degron-controlled stabilization of I-SceI meganuclease to mediate highly efficient transfection of a mutant allele library, and a high-throughput sequencing readout for cell growth upon conditional knockdown of wild-type gene expression and exclusive expression of mutant variants. Using this method, we queried the effects of amino acid substitutions in the apparently non-catalytic RNase III-like domain of KREPB4 (B4), which is an essential component of the RNA Editing Catalytic Complexes (RECCs) that carry out mitochondrial RNA editing in T. brucei. We measured the impacts of thousands of B4 variants on bloodstream form cell growth and validated the most deleterious variants containing single amino acid substitutions. Crucially, there was no correlation between phenotypes and amino acid conservation, demonstrating the greater power of this method over traditional sequence homology searching to identify functional residues. The bloodstream form cell growth phenotypes were combined with structural modeling, RECC protein proximity data, and analysis of selected substitutions in procyclic form T. brucei. These analyses revealed that the B4 RNaseIII-like domain is essential for maintenance of RECC integrity and RECC protein abundances and is also involved in changes in RECCs that occur between bloodstream and procyclic form life cycle stages.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号