secondary structure

二级结构
  • 文章类型: Journal Article
    来自蛋白质语言模型(pLM)的嵌入正在取代来自多个序列比对(MSA)的进化信息,作为蛋白质预测的最成功输入。这是因为嵌入捕获了进化信息吗?我们测试了各种方法,以明确地将进化信息纳入各种蛋白质预测任务的嵌入中。而较旧的pLM(SeqVec,ProtBert)通过MSA显著改进,最近的pLMProtT5没有受益。对于大多数任务,基于pLM的方法优于基于MSA的方法,两者的结合甚至降低了某些(内在障碍)的性能。我们强调了基于pLM的方法的有效性,并发现集成MSA的好处有限。
    Embeddings from protein Language Models (pLMs) are replacing evolutionary information from multiple sequence alignments (MSAs) as the most successful input for protein prediction. Is this because embeddings capture evolutionary information? We tested various approaches to explicitly incorporate evolutionary information into embeddings on various protein prediction tasks. While older pLMs (SeqVec, ProtBert) significantly improved through MSAs, the more recent pLM ProtT5 did not benefit. For most tasks, pLM-based outperformed MSA-based methods, and the combination of both even decreased performance for some (intrinsic disorder). We highlight the effectiveness of pLM-based methods and find limited benefits from integrating MSAs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    贝塔匝数,其中蛋白质骨架在四个氨基酸残基上突然改变方向,是α螺旋和β折叠后最常见的蛋白质二级结构类型,并发挥关键的结构和功能作用。以前的工作已经产生了多个精度级别的转弯几何分类系统,但是这些在骨干二面角(Ramachandran)空间中运行,并且缺少局部欧几里德空间坐标系和转弯的结构对齐,或转弯骨干形状的任何系统欧几里得空间表征,给可视化带来了挑战,比较和分析了各种转弯构造,转弯的设计以及包含它们的结构。这项工作得出了一个转弯局部坐标系,该坐标系隐式对齐转弯,以及一组几何描述符,这些描述符表征了转弯的整体BB形状,并描述了现有系统未明确捕获的结构变化模式。通过证明描述符值与β转角H键的静电能之间的明确关系,这些模式被证明是有意义的,关键侧链图案的过度表示,以及转弯的结构背景。几何转向描述符补充了Ramachandran空间分类,它们可用于选择与特定侧链相互作用或上下文兼容的转向结构。潜在的应用包括蛋白质设计和其他任务,其中增强的欧几里德空间表征可以提高理解或性能。基于网络的工具探索,MapTurns,和ProfileTurn,可在www。betaturn.com,合并转向局部坐标和转向描述符,并展示它们的效用。
    Beta turns, in which the protein backbone abruptly changes direction over four amino acid residues, are the most common type of protein secondary structure after alpha helices and beta sheets and play key structural and functional roles. Previous work has produced classification systems for turn geometry at multiple levels of precision, but these operate in backbone dihedral-angle (Ramachandran) space, and the absence of a local Euclidean-space coordinate system and structural alignment for turns, or of any systematic Euclidean-space characterization of turn backbone shape, presents challenges for the visualization, comparison and analysis of the wide range of turn conformations and the design of turns and the structures that incorporate them. This work derives a turn-local coordinate system that implicitly aligns turns, together with a set of geometric descriptors that characterize the bulk BB shapes of turns and describe modes of structural variation not explicitly captured by existing systems. These modes are shown to be meaningful by the demonstration of clear relationships between descriptor values and the electrostatic energy of the beta-turn H-bond, the overrepresentations of key side-chain motifs, and the structural contexts of turns. Geometric turn descriptors complement Ramachandran-space classifications, and they can be used to select turn structures for compatibility with particular side-chain interactions or contexts. Potential applications include protein design and other tasks in which an enhanced Euclidean-space characterization of turns may improve understanding or performance. The web-based tools ExploreTurns, MapTurns, and ProfileTurn, available at www.betaturn.com, incorporate turn-local coordinates and turn descriptors and demonstrate their utility.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    通过MD模拟研究β-折叠比率和链长度对全β蛋白的影响。在宽温度范围内模拟由具有各种β-折叠比率或不同数量的重复单元的不同重复单元组成的蛋白质样品。模拟结果表明,β-折叠二级结构比例较高的蛋白质获得了较小的回转半径,其具有较低的非结合能量,蛋白质内具有更多的HBs。均方根偏差(RMSD)和均方根波动(RMSF)均随温度的升高而增大,尤其是在链条较长的情况下。根据重复的二级结构也显示了可见周期。RMSF的几个最小值位于参与β-折叠的Cα原子的骨架上,表明它是一种稳定的二级结构。我们还得出结论,具有短链或较低比例β-折叠的蛋白质可以很容易地将其定向和紧凑的结构转化为其他结构,如随机线圈,turns,甚至是α-螺旋。这些结果阐明了从蛋白质的初级水平到3D结构以及潜在预测的蛋白质折叠的关系。
    The effect of β-sheet ratio and chain length on all-β proteins was investigated by MD simulations. Protein samples composed of different repeating units with various β-sheet ratios or a different number of repeating units were simulated under a broad temperature range. The simulation results show that the smaller radius of gyration was achieved by the protein with the higher proportion of β-sheet secondary structure, which had the lower nonbonded energy with more HBs within the protein. The root mean square deviation (RMSD) and the root mean square fluctuation (RMSF) both increased with temperature, especially in the case of a longer chain. The visible period was also shown according to the repeated secondary structure. Several minimum values of RMSF were located on the skeleton of Cα atoms participating in the β-sheet, indicating that it is a kind of stable secondary structure. We also concluded that proteins with a short chain or a lower ratio of β-sheet could easily transform their oriented and compact structures to other ones, such as random coils, turns, and even α-helices. These results clarified the relationship from the primary level to the 3D structure of proteins and potentially predicted protein folding.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    醇溶蛋白,来自植物的蛋白质,在制药和食品科学中有广泛的应用。九灶是白酒酿造工业的副产品,是醇溶蛋白的重要来源。尽管它很重要,关于白酒九灶(PBJ)醇溶蛋白提取技术和性质的知识仍然有限。反胶束(RM)提取为纯化蛋白质提供了一种有效且具有成本效益的方法。在本研究中,醇溶蛋白是使用RM提取方法从白酒九灶中提取的,随后对其二级结构进行了表征,形态学,和粒度分布。我们的发现表明,使用进一步的RM提取提取的纯化的醇溶蛋白具有更高的α-螺旋含量(13.25%),形成大规模的蛋白质网络,与NaOH-乙醇法获得的粗谷醇溶蛋白相比,粒径分布更窄。这项研究表明,RMs提取在从酿造工业副产品中提取醇溶蛋白方面具有潜在的应用,提供一种环保的白酒九灶回收方式。
    Prolamins, proteins derived from plants, have extensive applications in pharmaceutics and food science. Jiuzao is a byproduct of the Baijiu brewing industry, and is a great source of prolamin. Despite its importance, knowledge regarding the extraction techniques and the properties of prolamin derived from Baijiu Jiuzao (PBJ) remains limited. Reverse micelles (RMs) extraction offers an efficient and cost-effective method for purifying proteins. In the present study, prolamin was extracted from Baijiu Jiuzao using RMs extraction and subsequently characterized in terms of its secondary structure, morphology, and particle size distribution. Our findings indicate that the purified prolamin extracted using further RMs extraction possessed higher α-helix content (+13.25%), forming a large-scale protein network, and narrower particle size distributions compared to the crude prolamin obtained by NaOH-ethanol method. This research suggests that RMs extraction has potential applications in extracting prolamin from brewing industry byproducts, offering an environmentally friendly approach to Baijiu Jiuzao recycling.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    生物活性肽中的N-加帽(N-cap)和C-加帽(C-cap),包括特定的氨基酸或非常规的基团基序,已显示通过干扰肽的二级结构来调节针对药理学靶标的活性,从而产生不寻常的脚手架。已显示在线性肽中插入加帽基序可通过降低其对蛋白水解裂解的敏感性来防止肽降解。线性肽中的N-或C-加帽区域中的不寻常基团对一些官能团的替换已经导致具有改进的二级结构和增强的活性的优化的肽变体。此外,一些必需的氨基酸残基,当放置在抗菌肽(AMP)加帽区域时,能够络合金属,如Cu2+,Ni2+,和Zn2+,产生了被称为金属AMP的家族,能够提高抗菌效果,以及其他活动。因此,这篇综述提出并讨论了在AMP中创建N-和C-cap基序的不同策略,旨在微调这类抗菌药物。
    N-capping (N-cap) and C-capping (C-cap) in biologically active peptides, including specific amino acids or unconventional group motifs, have been shown to modulate activity against pharmacological targets by interfering with the peptide\'s secondary structure, thus generating unusual scaffolds. The insertion of capping motifs in linear peptides has been shown to prevent peptide degradation by reducing its susceptibility to proteolytic cleavage, and the replacement of some functional groups by unusual groups in N- or C-capping regions in linear peptides has led to optimized peptide variants with improved secondary structure and enhanced activity. Furthermore, some essential amino acid residues that, when placed in antimicrobial peptide (AMP) capping regions, are capable of complexing metals such as Cu2+, Ni2+, and Zn2+, give rise to the family known as metallo-AMPs, which are capable of boosting antimicrobial efficacy, as well as other activities. Therefore, this review presents and discusses the different strategies for creating N- and C-cap motifs in AMPs, aiming at fine-tuning this class of antimicrobials.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    蛋白质与聚合物的缀合通过产生新的大分子提供了巨大的生物技术潜力。本文介绍了与具有不同分子量的线性可生物降解聚磷酸酯聚合物缀合的麦芽糖结合蛋白(MBP)的结构特性的实验发现。我们研究了同位素对蛋白质和聚合物的影响。圆二色性和荧光光谱以及小角度中子散射表明,共轭过程使蛋白质不稳定,对二级结构的影响大于三级结构,即使在室温下,并且MBP中两个结构域的存在可能导致其观察到的不稳定性。值得注意的是,解折叠温度在天然MBP和缀合物之间不同。特别是,这项研究揭示了复杂的相互作用的因素,如影响蛋白质稳定性的氘代和构象变化的结合过程。在MBP蛋白的情况下,全氘代会影响氢键网络和疏水相互作用。蛋白质的全氘代会影响氢键网络和疏水相互作用。这在氘代MBP蛋白的热稳定性下降中很明显,在共轭中,尤其是高分子聚合物。
    The conjugation of proteins with polymers offers immense biotechnological potential by creating novel macromolecules. This article presents experimental findings on the structural properties of maltose-binding protein (MBP) conjugated with linear biodegradable polyphosphoester polymers with different molecular weights. We studied isotopic effects on both proteins and polymers. Circular dichroism and fluorescence spectroscopy and small-angle neutron scattering reveal that the conjugation process destabilizes the protein, affecting the secondary more than the tertiary structure, even at room temperature, and that the presence of two domains in the MBP may contribute to its observed instability. Notably, unfolding temperatures differ between native MBP and the conjugates. In particular, this study sheds light on the complex interplay of factors such as the deuteration influencing protein stability and conformational changes in the conjugation processes. The perdeuteration influences the hydrogen bond network and hydrophobic interactions in the case of the MBP protein. The perdeuteration of the protein influences the hydrogen bond network and hydrophobic interactions. This is evident in the decreased thermal stability of deuterated MBP protein, in the conjugate, especially with high-molecular-mass polymers.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    具有抗微生物活性或蛋白酶抑制活性的肽是补充传统抗生素或癌症化疗的潜在候选物。然而,许多肽的潜力受到诸如细胞毒性或水解敏感性等缺点的限制。因此,修饰有希望的肽结构的策略可能是开发更有希望的临床候选药物的有效方法。在这项研究中,成熟肽OSTI-1949,一种来自Odorranaschmackeri的Kunitz型抑制剂,并成功合成了四个设计的类似物。与母体肽相反,类似物显示出令人印象深刻的多功能,包括抗菌,抗癌,和胰蛋白酶抑制活性。在安全方面,在最高测试浓度下,溶血活性没有明显变化,和类似物OSTI-2461显示对癌细胞系的活性增加,而对正常细胞(HaCaT)无细胞毒性。总之,通过天然Kunitz型肽的结构修饰,类似物的生物活性得到改善,同时保持低细胞毒性。通过形成人工α-螺旋和β-折叠结构增强螺旋度的策略为开发用于临床治疗的原始生物活性肽提供了有希望的方法。
    Peptides with antimicrobial activity or protease inhibitory activity are potential candidates to supplement traditional antibiotics or cancer chemotherapies. However, the potential of many peptides are limited by drawbacks such as cytotoxicity or susceptibility to hydrolysis. Therefore, strategies to modify the structure of promising peptides may represent an effective approach for developing more promising clinical candidates. In this study, the mature peptide OSTI-1949, a Kunitz-type inhibitor from Odorrana schmackeri, and four designed analogues were successfully synthesised. In contrast to the parent peptide, the analogues showed impressive multi-functionality including antimicrobial, anticancer, and trypsin inhibitory activities. In terms of safety, there were no obvious changes observed in the haemolytic activity at the highest tested concentration, and the analogue OSTI-2461 showed an increase in activity against cancer cell lines without cytotoxicity to normal cells (HaCaT). In summary, through structural modification of a natural Kunitz-type peptide, the biological activity of analogues was improved whilst retaining low cytotoxicity. The strategy of helicity enhancement by forming an artificial α-helix and ß-sheet structure provides a promising way to develop original bioactive peptides for clinical therapeutics.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    RAMOSA1(RA1)是一种Cys2-His2型(C2H2)锌指转录因子,可控制植物分生组织的命运和身份,在玉米驯化中起着重要作用。尽管它很重要,RA1的起源未知,植物的进化只被部分理解。在本文中,我们基于来自48个胚胎植物物种的73个氨基酸序列,提出了一个解决良好的系统发育。恢复的树拓扑表明,在草的进化过程中,RA1来自两个连续的SUPERMAN复制,产生三个不同的草序列谱系:RA1样A,RA1-likeB,和RA1;然而,这些副本中的大多数具有未知的功能。我们的发现表明,尽管缺乏传统的核定位信号,但RA1和RA1样在细胞核中起作用。这里,我们报告说,副本的编码区域多样化,有了它,它们的蛋白质结构,提示DNA结合和蛋白质-蛋白质相互作用的不同模式。此外,每个保留的拷贝沿着其启动子区域多样化的调控元件,表明它们在上游监管方面的差异。一起来看,证据表明,草类中的RA1和RA1样基因家族通过基因复制实现亚功能化和新功能化.
    RAMOSA1 (RA1) is a Cys2-His2-type (C2H2) zinc finger transcription factor that controls plant meristem fate and identity and has played an important role in maize domestication. Despite its importance, the origin of RA1 is unknown, and the evolution in plants is only partially understood. In this paper, we present a well-resolved phylogeny based on 73 amino acid sequences from 48 embryophyte species. The recovered tree topology indicates that, during grass evolution, RA1 arose from two consecutive SUPERMAN duplications, resulting in three distinct grass sequence lineages: RA1-like A, RA1-like B, and RA1; however, most of these copies have unknown functions. Our findings indicate that RA1 and RA1-like play roles in the nucleus despite lacking a traditional nuclear localization signal. Here, we report that copies diversified their coding region and, with it, their protein structure, suggesting different patterns of DNA binding and protein-protein interaction. In addition, each of the retained copies diversified regulatory elements along their promoter regions, indicating differences in their upstream regulation. Taken together, the evidence indicates that the RA1 and RA1-like gene families in grasses underwent subfunctionalization and neofunctionalization enabled by gene duplication.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    蛋白质-蛋白质和蛋白质-水氢键相互作用在蛋白质在折叠或展开过程中通过过渡态的方式中起着至关重要的作用。但是分子动力学(MD)模拟中大量的这些相互作用使它们难以分析。这里,我们引入状态空间表示和相关的“稀有性”度量来识别和量化过渡状态通过(传输)事件。将此表示应用于捕获GTTWW域的多个折叠和展开事件的长MD模拟轨迹,一种小蛋白质,通常用作折叠过程的模型,我们确定了三个过渡类别:高速公路(更快),曲折(较慢),和模糊(中间)。我们开发了数据超声和可视化工具来分析氢键动力学,during,在这些过渡事件之后。通过这些工具,我们能够确定与\"高速公路\"、\"曲折\"和\"模糊\"过渡相关的特征性氢键模式,并设计可以直接从数据中识别这些相同的折叠途径和关键的蛋白质-水相互作用的算法。高度协同的氢键可以减慢或加速运输。此外,对WW域表面的蛋白质-水氢键动力学的分析表明,氢键寿命从折叠到未折叠的构象增加,其中模糊的跃迁为异常值。总之,氢键动力学为过境的异质性提供了一个直接的窗口,由于复杂的能源景观,持续时间可能变化很大(10倍)。
    Protein-protein and protein-water hydrogen bonding interactions play essential roles in the way a protein passes through the transition state during folding or unfolding, but the large number of these interactions in molecular dynamics (MD) simulations makes them difficult to analyze. Here, we introduce a state space representation and associated \"rarity\" measure to identify and quantify transition state passage (transit) events. Applying this representation to a long MD simulation trajectory that captured multiple folding and unfolding events of the GTT WW domain, a small protein often used as a model for the folding process, we identified three transition categories: Highway (faster), Meander (slower), and Ambiguous (intermediate). We developed data sonification and visualization tools to analyze hydrogen bond dynamics before, during, and after these transition events. By means of these tools, we were able to identify characteristic hydrogen bonding patterns associated with \"Highway\" versus \"Meander\" versus \"Ambiguous\" transitions and to design algorithms that can identify these same folding pathways and critical protein-water interactions directly from the data. Highly cooperative hydrogen bonding can either slow down or speed up transit. Furthermore, an analysis of protein-water hydrogen bond dynamics at the surface of WW domain shows an increase in hydrogen bond lifetime from folded to unfolded conformations with Ambiguous transitions as an outlier. In summary, hydrogen bond dynamics provide a direct window into the heterogeneity of transits, which can vary widely in duration (by a factor of 10) due to a complex energy landscape.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    分析导致转录激活域功能的因素仍然是一项关键但具有挑战性的任务,因为它们的序列具有显著的多样性和内在的无序性质。几乎所有旨在预测激活域的现有方法都涉及传统的机器学习方法,如逻辑回归,无法捕获数据或普通卷积神经网络中的复杂模式,并且在结构特征的探索方面受到限制。然而,在检测激活域的结构特性方面有巨大的潜力,以及研究序列中残基特征之间复杂关系的机会。为了解决这些问题,我们利用了图神经网络的强大功能,它可以以节点和边的形式表示结构数据,允许节点之间交换信息。我们已经尝试了两种图形公式,一个将残基作为节点,另一个将原子分配为节点。还开发了逻辑回归模型来分析特征重要性。对于所有的模型,对几个特征组合进行了实验。具有氨基酸类型的残基级GNN模型,残基位置,酸性/碱性/芳香性和二级结构特征组合给出了性能最佳的模型,F1评分和AUROC为97.9%,当应用于我们使用的数据集时,分别为71%和97.1%,优于文献中的其他现有方法。在分析的其他基于结构的特征中,螺旋的两亲性也被证明是分类的重要特征。Logistic回归结果表明,使序列具有功能的最主要特征是序列中不同类型氨基酸的频率。我们的结果一致表明,功能序列具有更多的酸性和芳香族残基,而碱性残基在非功能序列中更多。
    Analysis of factors that lead to the functionality of transcriptional activation domains remains a crucial and yet challenging task owing to the significant diversity in their sequences and their intrinsically disordered nature. Almost all existing methods that have aimed to predict activation domains have involved traditional machine learning approaches, such as logistic regression, that are unable to capture complex patterns in data or plain convolutional neural networks and have been limited in exploration of structural features. However, there is a tremendous potential in the inspection of the structural properties of activation domains, and an opportunity to investigate complex relationships between features of residues in the sequence. To address these, we have utilized the power of graph neural networks which can represent structural data in the form of nodes and edges, allowing nodes to exchange information among themselves. We have experimented with two kinds of graph formulations, one involving residues as nodes and the other assigning atoms to be the nodes. A logistic regression model was also developed to analyze feature importance. For all the models, several feature combinations were experimented with. The residue-level GNN model with amino acid type, residue position, acidic/basic/aromatic property and secondary structure feature combination gave the best performing model with accuracy, F1 score and AUROC of 97.9%, 71% and 97.1% respectively which outperformed other existing methods in the literature when applied on the dataset we used. Among the other structure-based features that were analyzed, the amphipathic property of helices also proved to be an important feature for classification. Logistic regression results showed that the most dominant feature that makes a sequence functional is the frequency of different types of amino acids in the sequence. Our results consistent have shown that functional sequences have more acidic and aromatic residues whereas basic residues are seen more in non-functional sequences.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号