protein quaternary structure

蛋白质四级结构
  • 文章类型: Journal Article
    蛋白质复合物四级结构模型的质量预测,在缺乏其真实结构的情况下,被称为模型精度估计(EMA)。EMA可用于对预测的蛋白质复合物结构进行排名,并在生物医学研究中适当地使用它们,例如蛋白质-蛋白质相互作用研究,蛋白质设计,和药物发现。随着更准确的蛋白质复合物(多聚体)预测工具的出现,例如AlphaFold2-Multimer和ESFold,蛋白质复合物结构的准确性估计引起了越来越多的关注。已经开发了许多深度学习方法来解决这个问题;然而,明显缺乏对这些方法的全面概述,以促进未来的发展。解决这个差距,我们回顾了过去几年开发的蛋白质复合物结构的深度学习EMA方法,分析他们的方法,数据和特征构造。我们还提供了一些潜在的新进展的前瞻性总结,以进一步提高EMA方法的准确性。
    The quality prediction of quaternary structure models of a protein complex, in the absence of its true structure, is known as the Estimation of Model Accuracy (EMA). EMA is useful for ranking predicted protein complex structures and using them appropriately in biomedical research, such as protein-protein interaction studies, protein design, and drug discovery. With the advent of more accurate protein complex (multimer) prediction tools, such as AlphaFold2-Multimer and ESMFold, the estimation of the accuracy of protein complex structures has attracted increasing attention. Many deep learning methods have been developed to tackle this problem; however, there is a noticeable absence of a comprehensive overview of these methods to facilitate future development. Addressing this gap, we present a review of deep learning EMA methods for protein complex structures developed in the past several years, analyzing their methodologies, data and feature construction. We also provide a prospective summary of some potential new developments for further improving the accuracy of the EMA methods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Förster共振能量转移(FRET)光谱法是一种用于从FRET效率的分布确定蛋白质寡聚体的四级结构的方法,该FRET效率的分布是从表达目标蛋白质的细胞的荧光图像的像素中提取的。FRET光谱法方案目前依赖于从基于强度的实验获得光谱分辨的荧光数据。另一种成像方法,荧光寿命成像显微镜(FLIM),是从由FRET引起的供体的荧光寿命的减少来计算图像中的每个像素的FRET效率的广泛使用的替代方案。在具有不同比例的供体和受体的寡聚体的FLIM研究中,供体寿命可以通过将时间分辨的荧光衰减数据与预定数量的指数衰减曲线进行拟合来获得。然而,这需要了解样品中荧光蛋白的数量和相对排列,这正是FRET光谱法的目标,因此产生了一个难题,该难题阻止了FLIM仪器的用户执行FRET光谱法。这里,我们描述了通过使用基于积分的方法从荧光衰减曲线计算FRET效率,在时间分辨荧光显微镜上实现FRET光谱法的尝试。这种方法,我们称之为时间集成FRET(或tiFRET),在活细胞的细胞质中表达的寡聚荧光蛋白构建体进行测试。目前的结果表明,tiFRET是实现FRET光谱法的一种有前途的方法,并建议对仪器进行潜在的调整,以提高此类研究的准确性和分辨率。
    Förster resonance energy transfer (FRET) spectrometry is a method for determining the quaternary structure of protein oligomers from distributions of FRET efficiencies that are drawn from pixels of fluorescence images of cells expressing the proteins of interest. FRET spectrometry protocols currently rely on obtaining spectrally resolved fluorescence data from intensity-based experiments. Another imaging method, fluorescence lifetime imaging microscopy (FLIM), is a widely used alternative to compute FRET efficiencies for each pixel in an image from the reduction of the fluorescence lifetime of the donors caused by FRET. In FLIM studies of oligomers with different proportions of donors and acceptors, the donor lifetimes may be obtained by fitting the temporally resolved fluorescence decay data with a predetermined number of exponential decay curves. However, this requires knowledge of the number and the relative arrangement of the fluorescent proteins in the sample, which is precisely the goal of FRET spectrometry, thus creating a conundrum that has prevented users of FLIM instruments from performing FRET spectrometry. Here, we describe an attempt to implement FRET spectrometry on temporally resolved fluorescence microscopes by using an integration-based method of computing the FRET efficiency from fluorescence decay curves. This method, which we dubbed time-integrated FRET (or tiFRET), was tested on oligomeric fluorescent protein constructs expressed in the cytoplasm of living cells. The present results show that tiFRET is a promising way of implementing FRET spectrometry and suggest potential instrument adjustments for increasing accuracy and resolution in this kind of study.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    晶格中生理相关的四元结构(QSs)的鉴定具有挑战性。为了预测特定QS的生理相关性,QSalign搜索其中亚基在相同几何结构中相互作用的同源结构。这种方法被证明是准确的,但仅限于蛋白质数据库(PDB)中已经存在的结构。这里,我们介绍一个网络服务器(www.QSalign.org)允许用户向QSalign管道提交他们选择的同寡聚体结构。给定用户上传的结构,基于序列相似性和PFAM域结构,提取序列并用于搜索同源物。如果在同源物和用户上传的QS之间检测到结构保守,推断生理相关性。Web服务器还使用PISA生成替代QS,并以与提交的查询相同的方式处理它们以扩大预测。结果页还显示了查询的蛋白质家族中的代表性QSs,如果没有检测到QS保守性或蛋白质出现单体,这是有益的。这些代表性QS还可以用作同源性建模的起点。
    The identification of physiologically relevant quaternary structures (QSs) in crystal lattices is challenging. To predict the physiological relevance of a particular QS, QSalign searches for homologous structures in which subunits interact in the same geometry. This approach proved accurate but was limited to structures already present in the Protein Data Bank (PDB). Here, we introduce a webserver (www.QSalign.org) allowing users to submit homo-oligomeric structures of their choice to the QSalign pipeline. Given a user-uploaded structure, the sequence is extracted and used to search homologs based on sequence similarity and PFAM domain architecture. If structural conservation is detected between a homolog and the user-uploaded QS, physiological relevance is inferred. The web server also generates alternative QSs with PISA and processes them the same way as the query submitted to widen the predictions. The result page also shows representative QSs in the protein family of the query, which is informative if no QS conservation was detected or if the protein appears monomeric. These representative QSs can also serve as a starting point for homology modeling.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    对生物分子机制和疾病的准确理解需要有关蛋白质四级结构(QS)的信息。从晶体学数据推断QS信息的关键挑战是将生物界面与偶然的晶体堆积接触区分开。这里,我们使用QS在同源物之间的保守性来推断异源寡聚体的生物学相关性。我们比较了异质低聚物的结构和组成,这让我们可以注释7,810个生理相关的复合物,1,060个可能的错误,和1,432个,具有有关亚基化学计量和组成的比较信息。不包括免疫球蛋白,这些注释涵盖了PDB中超过51%的杂低聚物。我们整理了577个异寡聚复合物的数据集以对这些注释进行基准测试,显示准确率>94%。当同源性信息不可用时,我们比较了不同存储库的QS(PDB,PISA,和EPPIC)得出置信度估计。这项工作提供了高质量的注释以及大型的异质装配基准数据集。
    An accurate understanding of biomolecular mechanisms and diseases requires information on protein quaternary structure (QS). A critical challenge in inferring QS information from crystallography data is distinguishing biological interfaces from fortuitous crystal-packing contacts. Here, we employ QS conservation across homologs to infer the biological relevance of hetero-oligomers. We compare the structures and compositions of hetero-oligomers, which allow us to annotate 7,810 complexes as physiologically relevant, 1,060 as likely errors, and 1,432 with comparative information on subunit stoichiometry and composition. Excluding immunoglobulins, these annotations encompass over 51% of hetero-oligomers in the PDB. We curate a dataset of 577 hetero-oligomeric complexes to benchmark these annotations, which reveals an accuracy >94%. When homology information is not available, we compare QS across repositories (PDB, PISA, and EPPIC) to derive confidence estimates. This work provides high-quality annotations along with a large benchmark dataset of hetero-assemblies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    已经确定了来自盐藻Halobacterium的Lsm蛋白的结构和RNA结合特性。该蛋白的一个显著特征是存在连接β3和β4链的短L4环。由于细菌Lsm蛋白(也称为Hfq蛋白)具有短的L4环并形成六聚体,而古细菌Lsm蛋白(SmAP)具有长L4环并形成七聚体,已经提出L4环的长度可能影响Lsm蛋白的四级结构。此外,L4环覆盖对应于Hfq中一个RNA结合位点的SmAP区域,从而影响蛋白质的RNA结合特性。我们的结果显示来自H.salinarum的SmAP形成七聚体并且具有与具有长L4环的同源蛋白相同的RNA结合特性。因此,L4的长度不控制蛋白质颗粒中单体的数量,并且不影响Lsm蛋白质的RNA结合特性。
    The structure and the RNA-binding properties of the Lsm protein from Halobacterium salinarum have been determined. A distinctive feature of this protein is the presence of a short L4 loop connecting the β3 and β4 strands. Since bacterial Lsm proteins (also called Hfq proteins) have a short L4 loop and form hexamers, whereas archaeal Lsm proteins (SmAP) have a long L4 loop and form heptamers, it has been suggested that the length of the L4 loop may affect the quaternary structure of Lsm proteins. Moreover, the L4 loop covers the region of SmAP corresponding to one of the RNA-binding sites in Hfq, and thus can affect the RNA-binding properties of the protein. Our results show that the SmAP from H. salinarum forms heptamers and possesses the same RNA-binding properties as homologous proteins with the long L4 loop. Therefore, the length of the L4 does not govern the number of monomers in the protein particles and does not affect the RNA-binding properties of Lsm proteins.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    HupZ是A群链球菌血红素获取和利用途径中预期的血红素降解酶。含有C末端V5-His6标签的分离的HupZ蛋白表现出弱的血红素降解活性。这里,我们通过生化重新审视并表征了HupZ-V5-His6蛋白,诱变,蛋白质四级结构,UV-vis,EPR,和共振拉曼光谱。结果表明,三价铁血红素-蛋白质复合物没有显示预期的三价铁EPR信号,并且与HupZ结合的血红素触发了更高的寡聚状态的形成。我们发现血红素与HupZ的结合是一个依赖O2的过程。HupZ序列中的单个组氨酸残基,His111,不与铁血红素结合,也不涉及弱血红素降解活性。由于血红素的缓慢结合以及新发现的弱血红素降解活性与His6标签的关联,我们的结果不赞成血红素加氧酶的分配。总之,数据表明该蛋白质通过其His6标签结合血红素,导致血红素诱导的高阶寡聚结构和血红素堆叠。这项工作强调了在研究血红素利用蛋白期间解释实验观察时考虑外源标签的重要性。
    HupZ is an expected heme degrading enzyme in the heme acquisition and utilization pathway in Group A Streptococcus. The isolated HupZ protein containing a C-terminal V5-His6 tag exhibits a weak heme degradation activity. Here, we revisited and characterized the HupZ-V5-His6 protein via biochemical, mutagenesis, protein quaternary structure, UV-vis, EPR, and resonance Raman spectroscopies. The results show that the ferric heme-protein complex did not display an expected ferric EPR signal and that heme binding to HupZ triggered the formation of higher oligomeric states. We found that heme binding to HupZ was an O2-dependent process. The single histidine residue in the HupZ sequence, His111, did not bind to the ferric heme, nor was it involved with the weak heme-degradation activity. Our results do not favor the heme oxygenase assignment because of the slow binding of heme and the newly discovered association of the weak heme degradation activity with the His6-tag. Altogether, the data suggest that the protein binds heme by its His6-tag, resulting in a heme-induced higher-order oligomeric structure and heme stacking. This work emphasizes the importance of considering exogenous tags when interpreting experimental observations during the study of heme utilization proteins.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    BACKGROUND: The information of quaternary structure attributes of proteins is very important because it is closely related to the biological functions of proteins. With the rapid development of new generation sequencing technology, we are facing a challenge: how to automatically identify the four-level attributes of new polypeptide chains according to their sequence information (i.e., whether they are formed as just as a monomer, or as a hetero-oligomer, or a homo-oligomer).
    OBJECTIVE: In this article, our goal is to find a new way to represent protein sequences, thereby improving the prediction rate of protein quaternary structure.
    METHODS: In this article, we developed a prediction system for protein quaternary structural type in which a protein sequence was expressed by combining the Pfam functional-domain and gene ontology. turn protein features into digital sequences, and complete the prediction of quaternary structure through specific machine learning algorithms and verification algorithm.
    RESULTS: Our data set contains 5495 protein samples. Through the method provided in this paper, we classify proteins into monomer, or as a hetero-oligomer, or a homo-oligomer, and the prediction rate is 74.38%, which is 3.24% higher than that of previous studies. Through this new feature extraction method, we can further classify the four-level structure of proteins, and the results are also correspondingly improved.
    CONCLUSIONS: After the applying the new prediction system, compared with the previous results, we have successfully improved the prediction rate. We have reason to believe that the feature extraction method in this paper has better practicability and can be used as a reference for other protein classification problems.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    精确了解蛋白质的四级结构对于阐明其功能和进化至关重要。我们关于第四纪结构的知识的主要部分是从X射线晶体学数据推断出来的,但是这个推理过程是困难和容易出错的。困难在于区分偶然的蛋白质接触,组成蛋白质晶体的晶格,来自天然细胞环境中存在的生物蛋白质接触。这里,我们回顾了旨在区分两种接触类型的方法,并描述了下载蛋白质四级结构信息和识别高置信度四级结构的资源。使用四级结构的高置信度数据集对于结构分析至关重要,功能,和蛋白质的进化特性。
    A precise knowledge of the quaternary structure of proteins is essential to illuminate both their function and their evolution. The major part of our knowledge on quaternary structure is inferred from X-ray crystallography data, but this inference process is hard and error-prone. The difficulty lies in discriminating fortuitous protein contacts, which make up the lattice of protein crystals, from biological protein contacts that exist in the native cellular environment. Here, we review methods devised to discriminate between both types of contacts and describe resources for downloading protein quaternary structure information and identifying high-confidence quaternary structures. The use of high-confidence datasets of quaternary structures will be critical for the analysis of structural, functional, and evolutionary properties of proteins.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Protein quaternary structure complex is also known as a multimer, which plays an important role in a cell. The dimer structure of transcription factors is involved in gene regulation, but the trimer structure of virus-infection-associated glycoproteins is related to the human immunodeficiency virus. The classification of the protein quaternary structure complex for the post-genome era of proteomics research will be of great help. Classification systems among protein quaternary structures have not been widely developed. Therefore, we designed the architecture of a two-layer machine learning technique in this study, and developed the classification system PClass. The protein quaternary structure of the complex is divided into five categories, namely, monomer, dimer, trimer, tetramer, and other subunit classes. In the framework of the bootstrap method with a support vector machine, we propose a new model selection method. Each type of complex is classified based on sequences, entropy, and accessible surface area, thereby generating a plurality of feature modules. Subsequently, the optimal model of effectiveness is selected as each kind of complex feature module. In this stage, the optimal performance can reach as high as 70% of Matthews correlation coefficient (MCC). The second layer of construction combines the first-layer module to integrate mechanisms and the use of six machine learning methods to improve the prediction performance. This system can be improved over 10% in MCC. Finally, we analyzed the performance of our classification system using transcription factors in dimer structure and virus-infection-associated glycoprotein in trimer structure. PClass is available via a web interface at http://predictor.nchu.edu.tw/PClass/.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    The mapping between biological genotypes and phenotypes is central to the study of biological evolution. Here, we introduce a rich, intuitive and biologically realistic genotype-phenotype (GP) map that serves as a model of self-assembling biological structures, such as protein complexes, and remains computationally and analytically tractable. Our GP map arises naturally from the self-assembly of polyomino structures on a two-dimensional lattice and exhibits a number of properties: redundancy (genotypes vastly outnumber phenotypes), phenotype bias (genotypic redundancy varies greatly between phenotypes), genotype component disconnectivity (phenotypes consist of disconnected mutational networks) and shape space covering (most phenotypes can be reached in a small number of mutations). We also show that the mutational robustness of phenotypes scales very roughly logarithmically with phenotype redundancy and is positively correlated with phenotypic evolvability. Although our GP map describes the assembly of disconnected objects, it shares many properties with other popular GP maps for connected units, such as models for RNA secondary structure or the hydrophobic-polar (HP) lattice model for protein tertiary structure. The remarkable fact that these important properties similarly emerge from such different models suggests the possibility that universal features underlie a much wider class of biologically realistic GP maps.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号