colabfold

ColabFold
  • 文章类型: Journal Article
    AlphaFold2(AF2)是近年来出现的一项开创性创新,彻底改变了几个科学领域。特别是结构生物学,药物设计,以及疾病机制的阐明。许多科学家现在每天都在使用AF2,包括非专业用户。本章针对的是后者。这里讨论了充分利用AF2产生高质量生物模型的技巧和技巧。我们建议非专业用户在使用AF2模型时如何保持批判性观点,并提供有关如何正确评估它们的指南。在展示了如何使用ColabFold执行我们自己的结构预测之后,我们列出了通过添加原始AF2模型中缺少的信息来改进AF2模型的几种方法。通过使用AlphaFill等软件向模型中添加辅因子和配体,或在半胱氨酸之间添加二硫键的模型,我们引导用户建立适合药物设计等应用的高质量生物模型,蛋白质相互作用,或分子动力学研究。
    AlphaFold2 (AF2) has emerged in recent years as a groundbreaking innovation that has revolutionized several scientific fields, in particular structural biology, drug design, and the elucidation of disease mechanisms. Many scientists now use AF2 on a daily basis, including non-specialist users. This chapter is aimed at the latter. Tips and tricks for getting the most out of AF2 to produce a high-quality biological model are discussed here. We suggest to non-specialist users how to maintain a critical perspective when working with AF2 models and provide guidelines on how to properly evaluate them. After showing how to perform our own structure prediction using ColabFold, we list several ways to improve AF2 models by adding information that is missing from the original AF2 model. By using software such as AlphaFill to add cofactors and ligands to the models, or MODELLER to add disulfide bridges between cysteines, we guide users to build a high-quality biological model suitable for applications such as drug design, protein interaction, or molecular dynamics studies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    人工智能彻底改变了蛋白质结构预测领域。然而,随着更强大、更复杂的软件的开发,它是可访问性和易用性,而不是功能,正在迅速成为最终用户的限制因素。LazyAF是一个基于GoogleColaboratory的管道,它集成了现有的ColabFoldBATCH软件,以简化中等规模的蛋白质-蛋白质相互作用预测过程。LazyAF用于预测在广泛宿主范围的多药抗性质粒RK2上编码的76种蛋白质的相互作用组,证明了管道提供的易用性和可及性。
    Artificial intelligence has revolutionized the field of protein structure prediction. However, with more powerful and complex software being developed, it is accessibility and ease of use rather than capability that is quickly becoming a limiting factor to end users. LazyAF is a Google Colaboratory-based pipeline which integrates the existing ColabFold BATCH software to streamline the process of medium-scale protein-protein interaction prediction. LazyAF was used to predict the interactome of the 76 proteins encoded on the broad-host-range multi-drug resistance plasmid RK2, demonstrating the ease and accessibility the pipeline provides.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    细胞分裂素(CK)是最重要的植物激素之一,可调节植物中的各种过程。CK受体,传感器混合组氨酸激酶,是20多年前发现的,但是对于植物生物学家来说,其信号传导的结构基础仍然是一个挑战。迄今为止,只有CK受体结构的两个片段,传感模块和接收器域,通过实验解决。其他一些区域是通过基于与CK受体同源的蛋白质结构的分子建模来建立的。然而,从长远来看,这些数据已被证明不足以解决全尺寸CK受体的结构。CK受体的功能单位是受体二聚体。在这篇文章中,首次提出了基于AlphaFoldMultimer和ColabFold模型的全长CK受体二聚体形式的分子结构。可见受体与磷酸转移蛋白相互作用后的结构变化。根据数学模拟和现有数据,两种类型的二聚体受体与激素的复合物,一半或完全结合,在触发信号中似乎是活跃的。此外,概述了使用该模型和类似模型来解决CK信令的剩余基本问题的前景。
    Cytokinins (CK) are one of the most important classes of phytohormones that regulate a wide range of processes in plants. A CK receptor, a sensor hybrid histidine kinase, was discovered more than 20 years ago, but the structural basis for its signaling is still a challenge for plant biologists. To date, only two fragments of the CK receptor structure, the sensory module and the receiver domain, were experimentally resolved. Some other regions were built up by molecular modeling based on structures of proteins homologous to CK receptors. However, in the long term, these data have proven insufficient for solving the structure of the full-sized CK receptor. The functional unit of CK receptor is the receptor dimer. In this article, a molecular structure of the dimeric form of the full-length CK receptor based on AlphaFold Multimer and ColabFold modeling is presented for the first time. Structural changes of the receptor upon interacting with phosphotransfer protein are visualized. According to mathematical simulation and available data, both types of dimeric receptor complexes with hormones, either half- or fully liganded, appear to be active in triggering signals. In addition, the prospects of using this and similar models to address remaining fundamental problems of CK signaling were outlined.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Comparative Study
    蛋白质结构测定是生物学研究的一个关键方面,使我们能够了解蛋白质的功能和潜在的应用。深度学习和人工智能的最新进展导致了几种蛋白质结构预测工具的开发。例如AlphaFold2和ColabFold。然而,它们的性能主要是在特征明确的蛋白质上进行评估的,它们在缺乏实验结构的蛋白质上的表现,比如很多蛇毒毒素,受到的审查较少。在这项研究中,我们评估了三种模型工具对1000多种蛇毒毒素结构的预测,这些结构不存在实验结构。我们的研究结果表明,AlphaFold2(AF2)在所有评估参数中表现最好。我们还观察到ColabFold(CF)的得分仅比AF2稍差,而计算强度较低。所有工具都在内在紊乱的区域挣扎,如环和前肽区域,并且在预测功能结构域的结构方面表现良好。总的来说,我们的研究强调了在处理没有实验结构的蛋白质时谨慎行事的重要性,特别是那些大的,包含灵活的区域。尽管如此,利用计算结构预测工具可以为蛋白质与不同靶标相互作用的建模提供有价值的见解,并揭示潜在的结合位点,活跃的网站,和构象变化,以及用于试剂的潜在分子粘合剂的设计,诊断,或治疗目的。
    Protein structure determination is a critical aspect of biological research, enabling us to understand protein function and potential applications. Recent advances in deep learning and artificial intelligence have led to the development of several protein structure prediction tools, such as AlphaFold2 and ColabFold. However, their performance has primarily been evaluated on well-characterised proteins and their ability to predict sturtctures of proteins lacking experimental structures, such as many snake venom toxins, has been less scrutinised. In this study, we evaluated three modelling tools on their prediction of over 1000 snake venom toxin structures for which no experimental structures exist. Our findings show that AlphaFold2 (AF2) performed the best across all assessed parameters. We also observed that ColabFold (CF) only scored slightly worse than AF2, while being computationally less intensive. All tools struggled with regions of intrinsic disorder, such as loops and propeptide regions, and performed well in predicting the structure of functional domains. Overall, our study highlights the importance of exercising caution when working with proteins with no experimental structures available, particularly those that are large and contain flexible regions. Nonetheless, leveraging computational structure prediction tools can provide valuable insights into the modelling of protein interactions with different targets and reveal potential binding sites, active sites, and conformational changes, as well as into the design of potential molecular binders for reagent, diagnostic, or therapeutic purposes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    镁(Mg2)是细胞中最丰富的二价阳离子,对于许多细胞过程至关重要。尽管它很重要,细胞内Mg2转运及其调节的机制知之甚少。MgtE是几乎一半细菌物种的主要Mg2转运系统,是哺乳动物SLC41A1转运蛋白的直系同源物,与神经退行性疾病和癌症有关。直到日期,只有来自嗜热菌(MgtETT)的MgtE主要在洗涤剂胶束中被广泛表征,生物相关膜中与门控相关的结构动力学很少。MgtE同源物从芽孢杆菌(MgtEBF)是独特的,因为它缺乏整个Mg2+传感N-结构域,但在TM域中具有保守的结构基序,用于Mg2转运。在这项工作中,我们已经成功地纯化了这种新的同源物,以稳定和功能的形式,和ColabFold结构预测分析表明同型二聚体。Further,微尺度热电泳(MST)实验表明,MgtEBF结合Mg2+和ATP,类似于MgtETT。重要的是,我们证明,尽管缺少N域,在重建的蛋白脂质体中存在向内定向的Mg2梯度的情况下,MgtEBF介导Mg2转运功能。此外,比较膜模拟物中MgtEBFTM域中Trp残基的组织和动力学,在apo和Mg2+结合形式中,表明Mg2的细胞质结合可能涉及TM结构域的适度的门控相关构象变化。总的来说,我们的结果表明,与门控相关的结构动力学(水化动力学,全长MgtEBF的构象异质性)在功能相关的膜环境中发生了显着变化,强调脂质-蛋白质相互作用在MgtE门控机制中的重要性。
    Magnesium (Mg2+) is the most abundant divalent cation in the cell and is critical for numerous cellular processes. Despite its importance, the mechanisms of intracellular Mg2+ transport and its regulation are poorly understood. MgtE is the main Mg2+ transport system in almost half of bacterial species and is an ortholog of mammalian SLC41A1 transporters, which are implicated in neurodegenerative diseases and cancer. To date, only MgtE from Thermus thermophilus (MgtETT) has been extensively characterized, mostly in detergent micelles, and gating-related structural dynamics in biologically relevant membranes are scarce. The MgtE homolog from Bacillus firmus (MgtEBF) is unique since it lacks the entire Mg2+-sensing N-domain but has conserved structural motifs in the TM-domain for Mg2+ transport. In this work, we have successfully purified this novel homolog in a stable and functional form, and ColabFold structure prediction analysis suggests a homodimer. Further, microscale thermophoresis experiments show that MgtEBF binds Mg2+ and ATP, similar to MgtETT. Importantly, we show that, despite lacking the N-domain, MgtEBF mediates Mg2+ transport function in the presence of an inwardly directed Mg2+ gradient in reconstituted proteoliposomes. Furthermore, comparison of the organization and dynamics of Trp residues in the TM-domain of MgtEBF in membrane mimetics, in apo- and Mg2+-bound forms, suggests that the cytoplasmic binding of Mg2+ might involve modest gating-related conformational changes at the TM-domain. Overall, our results show that the gating-related structural dynamics (hydration dynamics, conformational heterogeneity) of the full-length MgtEBF is significantly changed in functionally pertinent membrane environment, emphasizing the importance of lipid-protein interactions in MgtE gating mechanisms.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    植物作为生产重组蛋白的具有成本效益和可扩展的平台正在获得吸引力。然而,由于其疏水性质,在植物中表达整合膜蛋白具有挑战性。在我们的研究中,我们分别在烟草和茶花中使用瞬时和稳定表达系统来表达SARS-CoV-2E和M整合蛋白,并将它们靶向脂滴(LD)。LD为折叠疏水性蛋白质提供了理想的环境,并有助于通过浮选进行纯化。我们测试了具有不同接头和标签的各种蛋白质融合体,并使用3D结构预测来评估其效果。E和M主要位于本氏烟草叶片的ER中,但当与油质蛋白融合时,E可以靶向积聚油的烟草中的LD。LD积分蛋白。在茶花种子中,然而,发现E和M与纯化的LD相关。通过油质蛋白增强LDs内E和M的积累,我们在纯化的漂浮部分中富集了这些蛋白质。该策略提供了使用植物系统有效生产和纯化疏水性药物和疫苗的替代方法。本文受版权保护。保留所有权利。
    Plants are gaining traction as a cost-effective and scalable platform for producing recombinant proteins. However, expressing integral membrane proteins in plants is challenging due to their hydrophobic nature. In our study, we used transient and stable expression systems in Nicotiana benthamiana and Camelina sativa respectively to express SARS-CoV-2 E and M integral proteins, and target them to lipid droplets (LDs). LDs offer an ideal environment for folding hydrophobic proteins and aid in their purification through flotation. We tested various protein fusions with different linkers and tags and used three dimensional structure predictions to assess their effects. E and M mostly localized in the ER in N. benthamiana leaves but E could be targeted to LDs in oil accumulating tobacco when fused with oleosin, a LD integral protein. In Camelina sativa seeds, E and M were however found associated with purified LDs. By enhancing the accumulation of E and M within LDs through oleosin, we enriched these proteins in the purified floating fraction. This strategy provides an alternative approach for efficiently producing and purifying hydrophobic pharmaceuticals and vaccines using plant systems.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    茉莉酸(JAs)是植物激素,在发育和逆境中具有关键作用。它们通过介导称为JAZ的MYC抑制剂的蛋白水解来激活MYC转录因子。在JA缺席的情况下,JAZ蛋白通过组装MYC-JAZ-NINJA-TPL抑制复合物结合和抑制MYC。然而,JAZ和NINJA预计在很大程度上是非结构化的,这排除了他们的实验结构确定。通过生化的结合,突变,以及生物物理分析和AlphaFold衍生的ColabFold建模,我们表征了JAZ-JAZ和JAZ-NINJA相互作用,并生成了详细的模型,高置信度域接口。我们证明JAZ,NINJA,和MYC接口域在隔离中是动态的,并且在复杂的组装后以逐步的顺序变得稳定。相比之下,界面外的大多数JAZ和NINJA区域保持高度动态,无法在单一构象中建模。我们的数据表明,小JAZZIM基序通过单独的表面介导JAZ-JAZ和JAZ-NINJA相互作用,并进一步表明NINJA调节JAZ二聚化。这项研究通过提供对动力学的洞察来推进JA信号传导的知识,互动,和JA阻遏复合物的JAZ-NINJA核心的结构。
    Jasmonates (JAs) are plant hormones with crucial roles in development and stress resilience. They activate MYC transcription factors by mediating the proteolysis of MYC inhibitors called JAZ proteins. In the absence of JA, JAZ proteins bind and inhibit MYC through the assembly of MYC-JAZ-Novel Interactor of JAZ (NINJA)-TPL repressor complexes. However, JAZ and NINJA are predicted to be largely intrinsically unstructured, which has precluded their experimental structure determination. Through a combination of biochemical, mutational, and biophysical analyses and AlphaFold-derived ColabFold modeling, we characterized JAZ-JAZ and JAZ-NINJA interactions and generated models with detailed, high-confidence domain interfaces. We demonstrate that JAZ, NINJA, and MYC interface domains are dynamic in isolation and become stabilized in a stepwise order upon complex assembly. By contrast, most JAZ and NINJA regions outside of the interfaces remain highly dynamic and cannot be modeled in a single conformation. Our data indicate that the small JAZ Zinc finger expressed in Inflorescence Meristem (ZIM) motif mediates JAZ-JAZ and JAZ-NINJA interactions through separate surfaces, and our data further suggest that NINJA modulates JAZ dimerization. This study advances our understanding of JA signaling by providing insights into the dynamics, interactions, and structure of the JAZ-NINJA core of the JA repressor complex.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    FtsQBL是位于细菌分裂体组装中途的重要分子复合物。为了可视化和理解它的结构,以及它的膜锚定的后果,我们使用深度学习预测工具产生了大肠杆菌复合体的模型,AlphaFold2.将异源三聚体模型插入3脂质模型膜中,并进行500ns原子分子动力学模拟。该模型质量极好,捕获了大多数实验得出的结构特征,在二级结构和侧链水平。该模型由所有三种蛋白质的C末端区域贡献的独特互锁模块组成。FtsB和FtsL的功能上重要的收缩控制域残基位于距膜表面约43-49的固定垂直位置。虽然所有三种蛋白质的周质结构域都是明确的和刚性的,每个的单个跨膜螺旋都是柔性的,它们的集体扭曲和弯曲有助于大多数结构变化,根据主成分分析。仅考虑FtsQ,相对于其复合状态,该蛋白质在其自由状态下更灵活-最大的结构变化位于跨膜螺旋和α结构域之间的肘部。FtsQ和FtsL的无序N端结构域与内膜的细胞质表面缔合,而不是自由地进入溶剂。接触网络分析强调了FtsQBL中互锁三聚体模块的形成,在调节复合体的整体结构中起着核心作用。
    The FtsQBL is an essential molecular complex sitting midway through bacterial divisome assembly. To visualize and understand its structure, and the consequences of its membrane anchorage, we produced a model of the E. coli complex using the deep-learning prediction utility, AlphaFold 2. The heterotrimeric model was inserted into a 3-lipid model membrane and subjected to a 500-ns atomistic molecular dynamics simulation. The model is superb in quality and captures most experimentally derived structural features, at both the secondary structure and the side-chain levels. The model consists of a uniquely interlocking module contributed by the C-terminal regions of all three proteins. The functionally important constriction control domain residues of FtsB and FtsL are located at a fixed vertical position of ∼43-49 Å from the membrane surface. While the periplasmic domains of all three proteins are well-defined and rigid, the single transmembrane helices of each are flexible and their collective twisting and bending contribute to most structural variations, according to principal component analysis. Considering FtsQ only, the protein is more flexible in its free state relative to its complexed state-with the biggest structural changes located at the elbow between the transmembrane helix and the α-domain. The disordered N-terminal domains of FtsQ and FtsL associate with the cytoplasmic surface of the inner membrane instead of freely venturing into the solvent. Contact network analysis highlighted the formation of the interlocking trimeric module in FtsQBL as playing a central role in mediating the overall structure of the complex.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    细胞器之间的脂质转移需要在脂质穿过细胞质时屏蔽脂质的疏水部分的蛋白质。在过去的十年中,已经发现了一种新的脂质转移蛋白(LTP)结构形式:由β-折叠构成的长疏水凹槽,在膜接触部位的细胞器之间桥接。真核生物有五个桥状LTPs家族:VPS13,ATG2,SHIP164,霍比特人和Tweek。这些通过由一个域组成的桥梁被统一成一个单一的超家族,称为重复β沟(RBG)域,它构建成具有疏水衬里凹槽和亲水外部的杆状多聚体。这里,对RBG超家族的序列和预测结构进行了深入分析。系统发育学表明,最后一个真核共同祖先包含所有五个RBG蛋白,与复制的VPS13。当前的一组长RBG蛋白似乎已经出现在甚至更早的祖先中,具有4个RBG结构域的较短形式。大多数RBG蛋白的末端具有两亲螺旋,可能是直接或间接双层相互作用的适应,尽管这还有待于测试。对此的一个例外是SHIP164的C末端,其具有卷曲螺旋。最后,RBG桥的外表面显示在其大部分长度上具有保守的残基,指示伴侣互动的网站,几乎所有这些都是未知的。这些发现可以为未来的细胞生物学和生化实验提供信息。
    Lipid transfer between organelles requires proteins that shield the hydrophobic portions of lipids as they cross the cytoplasm. In the last decade a new structural form of lipid transfer protein (LTP) has been found: long hydrophobic grooves made of beta-sheet that bridge between organelles at membrane contact sites. Eukaryotes have five families of bridge-like LTPs: VPS13, ATG2, SHIP164, Hobbit and Tweek. These are unified into a single superfamily through their bridges being composed of just one domain, called the repeating beta groove (RBG) domain, which builds into rod shaped multimers with a hydrophobic-lined groove and hydrophilic exterior. Here, sequences and predicted structures of the RBG superfamily were analyzed in depth. Phylogenetics showed that the last eukaryotic common ancestor contained all five RBG proteins, with duplicated VPS13s. The current set of long RBG protein appears to have arisen in even earlier ancestors from shorter forms with 4 RBG domains. The extreme ends of most RBG proteins have amphipathic helices that might be an adaptation for direct or indirect bilayer interaction, although this has yet to be tested. The one exception to this is the C-terminus of SHIP164, which instead has a coiled-coil. Finally, the exterior surfaces of the RBG bridges are shown to have conserved residues along most of their length, indicating sites for partner interactions almost all of which are unknown. These findings can inform future cell biological and biochemical experiments.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    造礁珊瑚在海洋生态系统中发挥着重要作用,从结构角度分析它们的蛋白质组将对探索它们的生物学产生积极影响。在这里,我们将质谱与新发表的ColabFold集成在一起,以获得主要的造礁珊瑚的数字结构蛋白质组。
    在Acroporamuricata中的8,382个同源蛋白中,Montiporafoliosa,并鉴定出疣状孢子虫,8,166在大约4,060个GPU小时的计算后收到了预测的结构。得到的数据集覆盖了83.6%的残基,具有自信的预测,而25.9%的人有很高的信心。
    我们的工作为珊瑚研究提供了有洞察力的预测,证实了ColabFold在实践中的可靠性,并有望成为结构蛋白质组学即将到来的高通量时代的参考案例。
    Reef-building corals play an important role in the marine ecosystem, and analyzing their proteomes from a structural perspective will exert positive effects on exploring their biology. Here we integrated mass spectrometry with newly published ColabFold to obtain digital structural proteomes of dominant reef-building corals.
    Of the 8,382 homologous proteins in Acropora muricata, Montipora foliosa, and Pocillopora verrucosa identified, 8,166 received predicted structures after about 4,060 GPU hours of computation. The resulting dataset covers 83.6% of residues with a confident prediction, while 25.9% have very high confidence.
    Our work provides insight-worthy predictions for coral research, confirms the reliability of ColabFold in practice, and is expected to be a reference case in the impending high-throughput era of structural proteomics.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号