Amino Acid Sequence

氨基酸序列
  • 文章类型: Journal Article
    幼体激素受体基因的潜在作用(耐甲氧烯,Met)在七叶球藻(鞘翅目:球藻)(鞘翅目:球藻)的繁殖中,通过克隆进行了调查,通过定量实时PCR分析表达谱,并通过RNA干扰(RNAi)。CsMet编码1518bp的开放阅读框,预测的蛋白质产物为505个氨基酸;后者在氨基酸残基30-83和102-175处包含2个Per-Arnt-Sim重复序列。CsMet在不同的C.septempintunctata幼虫发育阶段表达,在三龄中表达最高。CsMet在成年女性中的表达从20到30d逐渐增加,25和30d时的表达水平显着高于1-15d时的水平。CsMet在20d男性成年人中的表达显着高于1-15d男性。CsMet在男性和女性成年人的脂肪体组织中的表达水平显着高于头部中的表达。胸部,和生殖系统。CsMet-dsRNA注射后5天和10天,CsMet表达比对照组显著降低了75.05%和58.38%,分别。注射CsMet-dsRNA的C.septempunctata中的卵巢发育和卵黄发生显着延迟,并且产生了较少的成熟卵。本研究为七叶草的大规模饲养提供了有价值的信息。
    The potential role of the juvenile hormone receptor gene (methoprene-tolerant, Met) in reproduction of Coccinella septempunctata L. (Coleoptera: Coccinellidae)(Coleoptera: Coccinellidae), was investigated by cloning, analyzing expression profiles by quantitative real-time PCR, and via RNA interference (RNAi). CsMet encoded a 1518-bp open reading frames with a predicted protein product of 505 amino acids; the latter contained 2 Per-Arnt-Sim repeat profile at amino acid residues 30-83 and 102-175. CsMet was expressed in different C. septempunctata larvae developmental stages and was most highly expressed in third instar. CsMet expression in female adults gradually increased from 20 to 30 d, and expression levels at 25 and 30 d were significantly higher than levels at 1-15 d. CsMet expression in 20-d-old male adults was significantly higher than in males aged 1-15 d. CsMet expression levels in fat body tissues of male and female adults were significantly higher than expression in the head, thorax, and reproductive system. At 5 and 10 d after CsMet-dsRNA injection, CsMet expression was significantly lower than the controls by 75.05% and 58.38%, respectively. Ovary development and vitellogenesis in C. septempunctata injected with CsMet-dsRNA were significantly delayed and fewer mature eggs were produced. This study provides valuable information for the large-scale rearing of C. septempunctata.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    一些牛抗体的重链(CDR3H)可变区(VH)的第三个互补决定区高度延伸,由48个或更多的残基组成。这些“超长”CDR3H形成β带状茎,从抗体表面突出,在其顶点处具有二硫键交联的旋钮区,该区域主导了其他CDR环的抗原相互作用。天然配对牛超龙抗体(D08)的Fab片段的结构,通过单个B细胞测序鉴定,已确定为1.6µ分辨率。通过将D08天然轻链与不相关的抗原未知超长抗体的轻链交换,结果表明,可变结构域的CDR3之间的相互作用可能会影响超长CDR3H的精确定位;然而,与其他晶体结构的比较表明,晶体堆积也是一个主要的贡献者。结论是,在平衡中,超长CDR3H环的精确定位很可能是由于晶体堆积的限制。
    The third complementary-determining regions of the heavy-chain (CDR3H) variable regions (VH) of some cattle antibodies are highly extended, consisting of 48 or more residues. These `ultralong\' CDR3Hs form β-ribbon stalks that protrude from the surface of the antibody with a disulfide cross-linked knob region at their apex that dominates antigen interactions over the other CDR loops. The structure of the Fab fragment of a naturally paired bovine ultralong antibody (D08), identified by single B-cell sequencing, has been determined to 1.6 Å resolution. By swapping the D08 native light chain with that of an unrelated antigen-unknown ultralong antibody, it is shown that interactions between the CDR3s of the variable domains potentially affect the fine positioning of the ultralong CDR3H; however, comparison with other crystallographic structures shows that crystalline packing is also a major contributor. It is concluded that, on balance, the exact positioning of ultralong CDR3H loops is most likely to be due to the constraints of crystal packing.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    主要金库蛋白(MVP)是金库复合体的主要成分,它是在大多数真核生物中发现的高度保守的核糖核蛋白复合物。先前已发现MVP或穹顶在多药抗性癌细胞中过表达,并且涉及各种细胞过程,例如细胞信号传导和先天免疫。MVP的精确功能是,然而,了解甚少,其在低等真核生物中的表达和可能的功能没有得到很好的表征。在这项研究中,我们报道大西洋鲑鱼虱子表达三个全长MVP旁系同源物(LsMVP1-3)。此外,我们扩大了搜索范围,并在其他几种蜕皮动物物种中鉴定了MVP直系同源物。LsMVP显示在各种组织中以转录物和蛋白质水平表达。此外,通过进行差速离心,证明了LsMVP组装成库的证据.发现LsMVP在水泥中高度表达,由成年雌性鲑鱼虱中的一对水泥腺产生的细胞外物质。水泥对于形成蛋串很重要,蛋串可作为胚胎发育的保护性涂层。我们的结果表明LsMVP作为分泌性水泥蛋白可能具有新的功能。LsMVP可能在结构或生殖功能中发挥作用,尽管这还需要进一步调查。
    Major vault protein (MVP) is the main component of the vault complex, which is a highly conserved ribonucleoprotein complex found in most eukaryotic organisms. MVP or vaults have previously been found to be overexpressed in multidrug-resistant cancer cells and implicated in various cellular processes such as cell signaling and innate immunity. The precise function of MVP is, however, poorly understood and its expression and probable function in lower eukaryotes are not well characterized. In this study, we report that the Atlantic salmon louse expresses three full-length MVP paralogues (LsMVP1-3). Furthermore, we extended our search and identified MVP orthologues in several other ecdysozoan species. LsMVPs were shown to be expressed in various tissues at both transcript and protein levels. In addition, evidence for LsMVP to assemble into vaults was demonstrated by performing differential centrifugation. LsMVP was found to be highly expressed in cement, an extracellular material produced by a pair of cement glands in the adult female salmon louse. Cement is important for the formation of egg strings that serve as protective coats for developing embryos. Our results imply a possible novel function of LsMVP as a secretory cement protein. LsMVP may play a role in structural or reproductive functions, although this has to be further investigated.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    分析在基于质谱的蛋白质组实验中获得的数据的主要目的是肽和蛋白质鉴定,或将串联质谱正确分配给一个氨基酸序列。将经验片段光谱与理论预测的光谱进行比较或与收集的光谱库进行匹配是通常接受的蛋白质鉴定和定义其氨基酸序列的策略。尽管这些方法被广泛使用,并且对于特征明确的模型生物或测量的蛋白质是相当有效的,它们无法检测到以前没有注释过或罕见的新肽序列。这项研究提出了PowerNovo工具,用于使用在各种类型的质量分析仪和不同的片段化技术中获得的串联质谱对蛋白质进行从头测序。PowerNovo涉及肽测序模型的集合:用于检测串联质谱中的规律性的模型,前体,和碎片离子和自然语言处理模型,它具有肽序列质量评估的功能,并有助于重建噪声序列。测试结果表明,PowerNovo的性能与广泛使用的PointNovo相当,甚至更好,DeepNovo,卡萨诺沃,和Novor包裹。此外,PowerNovo提供质谱数据的完整处理(管道)周期,在预测肽序列的同时,涉及肽组装和蛋白质推断块。
    The primary objective of analyzing the data obtained in a mass spectrometry-based proteomic experiment is peptide and protein identification, or correct assignment of the tandem mass spectrum to one amino acid sequence. Comparison of empirical fragment spectra with the theoretical predicted one or matching with the collected spectra library are commonly accepted strategies of proteins identification and defining of their amino acid sequences. Although these approaches are widely used and are appreciably efficient for the well-characterized model organisms or measured proteins, they cannot detect novel peptide sequences that have not been previously annotated or are rare. This study presents PowerNovo tool for de novo sequencing of proteins using tandem mass spectra acquired in a variety of types of mass analyzers and different fragmentation techniques. PowerNovo involves an ensemble of models for peptide sequencing: model for detecting regularities in tandem mass spectra, precursors, and fragment ions and a natural language processing model, which has a function of peptide sequence quality assessment and helps with reconstruction of noisy sequences. The results of testing showed that the performance of PowerNovo is comparable and even better than widely utilized PointNovo, DeepNovo, Casanovo, and Novor packages. Also, PowerNovo provides complete cycle of processing (pipeline) of mass spectrometry data and, along with predicting the peptide sequence, involves the peptide assembly and protein inference blocks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    诊断方法的整合有望在流行和非流行地区推进疟疾传播的监测。血清学检测是鉴别和界定疟疾传播的有价值的工具,作为快速诊断测试(RDT)和厚涂片显微镜的补充方法。这里,我们评估了针对包含PvMSP-1Sal-I菌株的整个氨基酸序列的肽的抗体作为间日疟原虫暴露的可行血清学生物标志物的潜力。
    我们筛选了包含间日疟原虫孢子表面蛋白1(PvMSP-1)Sal-I菌株的完整氨基酸序列的肽作为间日疟原虫暴露的潜在生物标志物。这里,使用SPOT合成技术,然后进行免疫印迹,鉴定了由感染间日疟原虫的个体的抗体特异性识别的免疫显性肽。在免疫印迹测定中,基于它们的较高且特异的反应性来选择两个15聚体肽。随后,使用SPPS(固相肽合成)以可溶性形式合成肽p70和p314,并通过ELISA(IgG,和子类)。
    这项研究揭示了来自巴西亚马逊地区的大多数间日疟原虫感染个体中针对肽p314的IgG抗体的存在。计算机B细胞表位预测进一步支持利用p314作为评估疟疾传播的潜在生物标志物。由于其氨基酸序列是PvMSP-1保守区块的一部分而得到加强。的确,与感染恶性疟原虫的患者和从未接触过疟疾的未感染个体相比,间日疟原虫感染的患者通过IgG1和IgG3对p314的识别明显更高。
    UNASSIGNED: The integration of diagnostic methods holds promise for advancing the surveillance of malaria transmission in both endemic and non-endemic regions. Serological assays emerge as valuable tools to identify and delimit malaria transmission, serving as a complementary method to rapid diagnostic tests (RDT) and thick smear microscopy. Here, we evaluate the potential of antibodies directed against peptides encompassing the entire amino acid sequence of the PvMSP-1 Sal-I strain as viable serological biomarkers for P. vivax exposure.
    UNASSIGNED: We screened peptides encompassing the complete amino acid sequence of the Plasmodium vivax Merozoite Surface Protein 1 (PvMSP-1) Sal-I strain as potential biomarkers for P. vivax exposure. Here, immunodominant peptides specifically recognized by antibodies from individuals infected with P. vivax were identified using the SPOT-synthesis technique followed by immunoblotting. Two 15-mer peptides were selected based on their higher and specific reactivity in immunoblotting assays. Subsequently, peptides p70 and p314 were synthesized in soluble form using SPPS (Solid Phase Peptide Synthesis) and tested by ELISA (IgG, and subclasses).
    UNASSIGNED: This study unveils the presence of IgG antibodies against the peptide p314 in most P. vivax-infected individuals from the Brazilian Amazon region. In silico B-cell epitope prediction further supports the utilization of p314 as a potential biomarker for evaluating malaria transmission, strengthened by its amino acid sequence being part of a conserved block of PvMSP-1. Indeed, compared to patients infected with P. falciparum and uninfected individuals never exposed to malaria, P. vivax-infected patients have a notably higher recognition of p314 by IgG1 and IgG3.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在生物信息学中,仅根据其氨基酸序列了解蛋白质的功能是一项至关重要但复杂的任务。传统上,事实证明,这一挑战是困难的。然而,近年来见证了深度学习作为一种强大工具的兴起,在蛋白质功能预测方面取得了显著成功。他们的优势在于他们能够自动从蛋白质序列中学习信息特征,然后可以用来预测蛋白质的功能。这项研究建立在这些进步的基础上,提出了一个新的模型:CNN-CBAM+BiGRU。它包含一个卷积块注意模块(CBAM)与BiGRU。CBAM充当聚光灯,指导CNN专注于蛋白质数据中信息最丰富的部分,导致更准确的特征提取。BiGRU,一种循环神经网络(RNN),擅长捕捉蛋白质序列中的远程依赖关系,这对于准确的函数预测至关重要。所提出的模型整合了CNN-CBAM和BiGRU的优势。这项研究的发现,通过实验验证,展示这种组合方法的有效性。对于人类数据集,对于细胞成分,建议的方法优于CNN-BIGRU+ATT模型+1.0%,+1.1%的分子功能,生物过程+0.5%。对于酵母数据集,对于细胞成分,建议的方法优于CNN-BIGRU+ATT模型+2.4%,+1.2%的分子功能,生物过程+0.6%。
    Understanding a protein\'s function based solely on its amino acid sequence is a crucial but intricate task in bioinformatics. Traditionally, this challenge has proven difficult. However, recent years have witnessed the rise of deep learning as a powerful tool, achieving significant success in protein function prediction. Their strength lies in their ability to automatically learn informative features from protein sequences, which can then be used to predict the protein\'s function. This study builds upon these advancements by proposing a novel model: CNN-CBAM+BiGRU. It incorporates a Convolutional Block Attention Module (CBAM) alongside BiGRUs. CBAM acts as a spotlight, guiding the CNN to focus on the most informative parts of the protein data, leading to more accurate feature extraction. BiGRUs, a type of Recurrent Neural Network (RNN), excel at capturing long-range dependencies within the protein sequence, which are essential for accurate function prediction. The proposed model integrates the strengths of both CNN-CBAM and BiGRU. This study\'s findings, validated through experimentation, showcase the effectiveness of this combined approach. For the human dataset, the suggested method outperforms the CNN-BIGRU+ATT model by +1.0 % for cellular components, +1.1 % for molecular functions, and +0.5 % for biological processes. For the yeast dataset, the suggested method outperforms the CNN-BIGRU+ATT model by +2.4 % for the cellular component, +1.2 % for molecular functions, and +0.6 % for biological processes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    蛋白酪氨酸磷酸酶非受体21型(PTPN21)是一种调节细胞生长和侵袭的胞浆蛋白酪氨酸磷酸酶。由于其致癌特性,PTPN21最近已成为癌症的潜在治疗靶标。在这项研究中,PTPN21FERM结构域的三维结构通过X射线晶体学以2.1µ分辨率确定。晶体结构表明,该结构域具有典型的FERM折叠,并由三个亚结构域组成,这些亚结构域通过高度保守的分子内疏水相互作用紧密堆积。与此一致,PTPN21FERM结构域与其他几个FERM结构域具有高度的结构同源性。此外,结构叠加证明了PTPN21FERM结构域的两个推定的蛋白质结合位点,它们被认为与其结合伴侣的相互作用有关,驱动蛋白家族成员1C。因此,这些数据表明,PTPN21的FERM结构域充当介导蛋白质-蛋白质相互作用的模块,像其他FERM域一样。
    Protein tyrosine phosphatase non-receptor type 21 (PTPN21) is a cytosolic protein tyrosine phosphatase that regulates cell growth and invasion. Due to its oncogenic properties, PTPN21 has recently emerged as a potential therapeutic target for cancer. In this study, the three-dimensional structure of the PTPN21 FERM domain was determined at 2.1 Å resolution by X-ray crystallography. The crystal structure showed that this domain harbors canonical FERM folding and consists of three subdomains that are tightly packed via highly conserved intramolecular hydrophobic interactions. Consistent with this, the PTPN21 FERM domain shares high structural homology with several other FERM domains. Moreover, structural superimposition demonstrated two putative protein-binding sites of the PTPN21 FERM domain, which are presumed to be associated with interaction with its binding partner, kinesin family member 1C. Thus, these data suggest that the FERM domain of PTPN21 serves as a module that mediates protein-protein interaction, like other FERM domains.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:激素受体通过与配体结合发挥其功能,这导致由基因组或非基因组机制介导的细胞信号传导激活。tick虫及其宿主Bostaurus的内在分子通讯包括涉及激素的内分泌调节。在本研究中,我们对R.microplus膜相关孕酮受体(RmMAPRC)进行了分子和计算机模拟分析。
    方法:使用生物信息学工具分析RmMAPRC蛋白序列,通过三维建模和分子对接对其结构进行表征。半定量逆转录和聚合酶链反应(sqRT-PCR)评估了蜱器官和胚胎细胞中RmMAPRC基因的存在和相对表达。
    结果:RmMAPRC在唾液腺中的相对表达,卵巢,胚胎细胞显示3%的过表达,13%,24%,分别。生物信息学分析表明,RmMAPRC对应于〜23.7kDa的孕酮受体膜成分1(RmPGRMC1),具有N末端跨膜结构域和C末端细胞色素b5样血红素/类固醇结合结构域。对接结果表明RmPGRMC1可以与孕酮(P4)结合,一些孕激素,和P4拮抗剂。系统发育重建表明,Rhipicephalusspp。MAPRC受体聚集在包括阑尾R.在内的进化枝中,R.sanguineus,和R.microplus(RmMAPRC),哺乳动物和蠕虫MAPRC受体聚集在远离蜱的两个独立的分支中。
    结论:RmPGRMC1的存在突出了作为节肢动物寄生虫成功的保守适应性机制的调控的重要性,使其成为滴答控制的目标。
    BACKGROUND: Hormone receptors exert their function through binding with their ligands, which results in cellular signaling activation mediated by genomic or non-genomic mechanisms. The intrinsic molecular communication of tick Rhipicephalus microplus and its host Bos taurus comprises an endocrine regulation involving hormones. In the present study, we performed a molecular and in silico analysis of a Membrane Associated Progesterone Receptor in R. microplus (RmMAPRC).
    METHODS: The RmMAPRC protein sequence was analyzed with bioinformatics tools, and its structure was characterized by three-dimensional (3D) modeling and molecular docking. A semi-quantitative reverse transcription and polymerase chain reaction (sqRT-PCR) assessed the RmMAPRC gene presence and relative expression in tick organs and embryonic cells.
    RESULTS: RmMAPRC relative expression in salivary glands, ovaries, and embryonic cells showed overexpression of 3%, 13%, and 24%, respectively. Bioinformatic analysis revealed that RmMAPRC corresponded to a Progesterone Receptor Membrane Component 1 (RmPGRMC1) of ~23.7 kDa, with an N-terminal transmembrane domain and a C-terminal Cytochrome b5-like heme/steroid binding domain. The docking results suggest that RmPGRMC1 could bind to progesterone (P4), some progestins, and P4 antagonists. The phylogenetic reconstruction showed that Rhipicephalus spp. MAPRC receptors were clustered in a clade that includes R. appendiculatus, R. sanguineus, and R. microplus (RmMAPRC), and mammals and helminths MAPRC receptors clustered in two separated clades away from ticks.
    CONCLUSIONS: The presence of RmPGRMC1 highlights the importance of transregulation as a conserved adaptive mechanism that has succeeded for arthropod parasites, making it a target for tick control.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    白腐真菌和细菌群落是独特的生态系统,在木材分解过程中发生不同类型的共生相互作用,比如合作,互利,营养竞争,和对抗。几丁质活性裂解多糖单加氧酶(LPMO)在这些共生相互作用中的作用是本研究的主题。
    在这项研究中,生物信息学工具用于分析通过隐马尔可夫模型(HMM)从CucPhuong原始森林中白腐真菌周围收集的腐殖质样品的细菌宏基因组DNA数据库中挖掘的推定LPMO的序列和结构,越南。编码推定的LPMO的两个基因在大肠杆菌中表达并纯化用于酶活性测定。
    通过氨基酸序列比较确认了根据HMM谱注释为推定LPMO的31种全长蛋白质。比较结果表明,尽管蛋白质的氨基酸序列差异很大,它们共有9种保守的氨基酸,包括表征细菌LPMO的活性位点的H1-Hx-Yz基序的两个组氨酸和一个苯丙氨酸。这些蛋白质的结构分析表明它们是具有不同功能的多结构域蛋白质。使用Alphafold2对这些推定的LPMO的催化域3-D结构的预测表明,它们的空间结构在形状上非常相似,尽管它们的蛋白质序列非常不同。测试蛋白质GL0247266和GL0183513的活性的结果显示它们是几丁质活性LPMO。使用Alphafold2预测这两个LPMO的3-D结构表明GL0247266具有五个功能结构域,而GL0183513有四个功能域,其中两个与霍乱弧菌GbpA蛋白的GbpA_2和GbpA_3结构域相似。在其他11种蛋白质中也检测到GbpA_2-GbpA_3复合物。根据功能域的结构特征,可以假设几丁质活性GbpA样LPMO在原始森林中分解树木上共存的真菌和细菌群落之间的关系中的作用。
    UNASSIGNED: White-rot fungi and bacteria communities are unique ecosystems with different types of symbiotic interactions occurring during wood decomposition, such as cooperation, mutualism, nutritional competition, and antagonism. The role of chitin-active lytic polysaccharide monooxygenases (LPMOs) in these symbiotic interactions is the subject of this study.
    UNASSIGNED: In this study, bioinformatics tools were used to analyze the sequence and structure of putative LPMOs mined by hidden Markov model (HMM) profiles from the bacterial metagenomic DNA database of collected humus samples around white-rot fungi in Cuc Phuong primary forest, Vietnam. Two genes encoding putative LPMOs were expressed in E. coli and purified for enzyme activity assay.
    UNASSIGNED: Thirty-one full-length proteins annotated as putative LPMOs according to HMM profiles were confirmed by amino acid sequence comparison. The comparison results showed that although the amino acid sequences of the proteins were very different, they shared nine conserved amino acids, including two histidine and one phenylalanine that characterize the H1-Hx-Yz motif of the active site of bacterial LPMOs. Structural analysis of these proteins revealed that they are multidomain proteins with different functions. Prediction of the catalytic domain 3-D structure of these putative LPMOs using Alphafold2 showed that their spatial structures were very similar in shape, although their protein sequences were very different. The results of testing the activity of proteins GL0247266 and GL0183513 show that they are chitin-active LPMOs. Prediction of the 3-D structures of these two LPMOs using Alphafold2 showed that GL0247266 had five functional domains, while GL0183513 had four functional domains, two of which that were similar to the GbpA_2 and GbpA_3 domains of protein GbpA of Vibrio cholerae bacteria. The GbpA_2 - GbpA_3 complex was also detected in 11 other proteins. Based on the structural characteristics of functional domains, it is possible to hypothesize the role of chitin-active GbpA-like LPMOs in the relationship between fungal and bacterial communities coexisting on decomposing trees in primary forests.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:CBM13家族包含主要存在于酶和几种蓖麻毒素-B凝集素中的碳水化合物结合模块。蓖麻毒素B凝集素结构域在很大程度上类似于CBM13模块。历史上,蓖麻毒素B凝集素和CBM13蛋白被认为是完全不同的,尽管它们的结构和功能相似。
    结果:在这项数据挖掘研究中,我们研究了这些交织在一起的蛋白质组的结构和功能相似性。由于结构和功能的高度相似性,以及几个数据库中命名使用的差异,可能会出现混乱。首先,我们演示了公共蛋白质数据库如何使用不同的命名系统来描述CBM13模块和推定的蓖麻毒素B凝集素结构域。我们建议引入一个新的CBM13域标识符,以及UniProt中CAZy交叉引用的扩展,以保护公共数据库中CAZy和非CAZy条目之间的区别。由于其他凝集素家族和CBM家族可能会出现类似的问题,我们建议在所有现有CBM家族中引入新的CBMInterPro域标识符.第二,我们调查了系统发育,推定的蓖麻毒素B凝集素结构域和CBM13模块之间的命名和结构相似性,利用序列相似性网络。我们得出的结论是,蓖麻毒素B/CBM13超家族可能比最初认为的要大,并且几个推定的蓖麻毒素B凝集素结构域可能显示CAZyme功能,尽管生化证据仍有待提供。
    结论:蓖麻毒素B凝集素结构域和CBM13模块是相关的蛋白质组,其数据库语义目前偏向蓖麻毒素B凝集素。在UniProt中的CAZy交叉引用的修订和在InterPro中的专用CBM13域标识符的引入可以解决此问题。此外,我们的分析表明,几种具有推定的蓖麻毒素B凝集素结构域的蛋白质与CBM13模块显示出非常强的结构相似性。因此,蓖麻毒素-B凝集素结构域和CBM13模块可被认为是较大蓖麻毒素-B/CBM13超家族的远隔成员。
    BACKGROUND: The CBM13 family comprises carbohydrate-binding modules that occur mainly in enzymes and in several ricin-B lectins. The ricin-B lectin domain resembles the CBM13 module to a large extent. Historically, ricin-B lectins and CBM13 proteins were considered completely distinct, despite their structural and functional similarities.
    RESULTS: In this data mining study, we investigate structural and functional similarities of these intertwined protein groups. Because of the high structural and functional similarities, and differences in nomenclature usage in several databases, confusion can arise. First, we demonstrate how public protein databases use different nomenclature systems to describe CBM13 modules and putative ricin-B lectin domains. We suggest the introduction of a novel CBM13 domain identifier, as well as the extension of CAZy cross-references in UniProt to guard the distinction between CAZy and non-CAZy entries in public databases. Since similar problems may occur with other lectin families and CBM families, we suggest the introduction of novel CBM InterPro domain identifiers to all existing CBM families. Second, we investigated phylogenetic, nomenclatural and structural similarities between putative ricin-B lectin domains and CBM13 modules, making use of sequence similarity networks. We concluded that the ricin-B/CBM13 superfamily may be larger than initially thought and that several putative ricin-B lectin domains may display CAZyme functionalities, although biochemical proof remains to be delivered.
    CONCLUSIONS: Ricin-B lectin domains and CBM13 modules are associated groups of proteins whose database semantics are currently biased towards ricin-B lectins. Revision of the CAZy cross-reference in UniProt and introduction of a dedicated CBM13 domain identifier in InterPro may resolve this issue. In addition, our analyses show that several proteins with putative ricin-B lectin domains show very strong structural similarity to CBM13 modules. Therefore ricin-B lectin domains and CBM13 modules could be considered distant members of a larger ricin-B/CBM13 superfamily.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号