Protein Interaction Mapping

蛋白质相互作用作图
  • 文章类型: Journal Article
    已知蛋白质-蛋白质相互作用(PPIs)参与大多数细胞功能,详细了解这种相互作用对于研究它们在正常和病理条件下的作用至关重要。通过计算方法的进步,在识别PPI方面正在取得重大进展。特别是,基于AlphaFold2机器学习的模型已被证明可以通过预测蛋白质复合物的3D结构来加速药物发现过程.在这一章中,提供了用于预测PAR-3与其蛋白质伴侣衔接分子crk之间的蛋白质间相互作用的简单方案。这种基于人工智能和公开可用的方法可以为进一步研究治疗药物靶标提供资源。
    Protein-protein interactions (PPIs) are known to be involved in most cellular functions, and a detailed knowledge of such interactions is essential for studying their role in normal and pathological conditions. Significant progress is being made in the identification of PPIs through advances in computational methods. In particular, the AlphaFold2 machine learning-based model has been shown to accelerate drug discovery process by predicting the 3D structure of protein complexes. In this chapter, a straightforward protocol for predicting interprotein interactions between PAR-3 and its protein partner adapter molecule crk is provided. Such artificial intelligence-based and publicly available approaches can provide a resource for further investigation of therapeutic drug targets.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Case Reports
    8-羟基鸟嘌呤(8-oxoG)是最常见的氧化DNA损伤,未修复的8-oxoG与精子中的DNA片段化有关。然而,8-oxoG对精子发生的分子效应尚不完全清楚。这里,我们确定了一个不育公牛(C14)由于弱精子症。我们通过反相液相色谱/质谱(RP-LC/MS)比较了8-oxoG的整体浓度,通过下一代测序(OG-seq)得出的8-oxoG的基因组分布,通过二维聚丙烯酰胺凝胶电泳,然后通过肽质量指纹图谱(2D-PAGE/PMF)在C14和可育公牛(C13)的精子中表达精子蛋白。我们发现,C13和C14精子中8-oxoG的平均水平分别占总dG的0.027%和0.044%,并且在不育精子DNA中明显更高(p=0.0028)。超过81%的8-oxoG基因座分布在转录起始位点(TSS)周围,而携带8-oxoG的165个基因是不育精子所独有的。功能富集和网络分析显示,高尔基体显着富集了不育精子的8-oxoG基因的产物(q=2.2×10-7)。蛋白质组学分析证实顶体相关蛋白,包括顶体结合蛋白(ACRBP),在不育精子中下调。这些初步结果表明,精子发生过程中8-oxoG的形成失调了顶体相关的基因网络,导致精子结构和功能缺陷,导致不育。
    8-Hydroxyguanine (8-oxoG) is the most common oxidative DNA lesion and unrepaired 8-oxoG is associated with DNA fragmentation in sperm. However, the molecular effects of 8-oxoG on spermatogenesis are not entirely understood. Here, we identified one infertile bull (C14) due to asthenoteratozoospermia. We compared the global concentration of 8-oxoG by reverse-phase liquid chromatography/mass spectrometry (RP-LC/MS), the genomic distribution of 8-oxoG by next-generation sequencing (OG-seq), and the expression of sperm proteins by 2-dimensional polyacrylamide gel electrophoresis followed by peptide mass fingerprinting (2D-PAGE/PMF) in the sperm of C14 with those of a fertile bull (C13). We found that the average levels of 8-oxoG in C13 and C14 sperm were 0.027% and 0.044% of the total dG and it was significantly greater in infertile sperm DNA (p = 0.0028). Over 81% of the 8-oxoG loci were distributed around the transcription start site (TSS) and 165 genes harboring 8-oxoG were exclusive to infertile sperm. Functional enrichment and network analysis revealed that the Golgi apparatus was significantly enriched with the products from 8-oxoG genes of infertile sperm (q = 2.2 × 10-7). Proteomic analysis verified that acrosome-related proteins, including acrosin-binding protein (ACRBP), were downregulated in infertile sperm. These preliminary results suggest that 8-oxoG formation during spermatogenesis dysregulated the acrosome-related gene network, causing structural and functional defects of sperm and leading to infertility.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    罕见的基于变异的分析开始确定神经精神疾病和其他疾病的风险基因。然而,鉴定的基因仅占预测的因果基因的一小部分。最近的研究表明,罕见的破坏性变异在特定的基因集中显着富集。能够联合建模罕见变异和基因集以识别富集的基因集并使用这些富集的基因集来优先考虑其他风险基因的方法可以提高对疾病遗传结构的理解。
    我们提出了DECO(从头突变的综合分析,罕见的病例/对照变异和通过基因集的组学信息),罕见变异和基因集分析的综合方法。该方法可以(i)直接在统计模型内测试基因集的富集,和(ii)使用富集的基因集对现有基因进行排序,并优先考虑测试的疾病的其他风险基因。在模拟中,DECO比仅使用变体数据的同源方法表现更好。为了演示所提出的协议的应用,我们已经将这种方法应用于精神分裂症的罕见变异数据集.与仅使用变体信息的方法相比,DECO能够优先考虑其他风险基因。
    DECO可用于分析任何疾病的罕见变异和生物学途径或细胞类型。该软件包可在Githubhttps://github.com/hoangtn/DECO上获得。
    Rare variant-based analyses are beginning to identify risk genes for neuropsychiatric disorders and other diseases. However, the identified genes only account for a fraction of predicted causal genes. Recent studies have shown that rare damaging variants are significantly enriched in specific gene-sets. Methods which are able to jointly model rare variants and gene-sets to identify enriched gene-sets and use these enriched gene-sets to prioritize additional risk genes could improve understanding of the genetic architecture of diseases.
    We propose DECO (Integrated analysis of de novo mutations, rare case/control variants and omics information via gene-sets), an integrated method for rare-variant and gene-set analysis. The method can (i) test the enrichment of gene-sets directly within the statistical model, and (ii) use enriched gene-sets to rank existing genes and prioritize additional risk genes for tested disorders. In simulations, DECO performs better than a homologous method that uses only variant data. To demonstrate the application of the proposed protocol, we have applied this approach to rare-variant datasets of schizophrenia. Compared with a method which only uses variant information, DECO is able to prioritize additional risk genes.
    DECO can be used to analyze rare-variants and biological pathways or cell types for any disease. The package is available on Github https://github.com/hoangtn/DECO.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:正确评分蛋白质-蛋白质对接模型以找出正确的模型是一个开放的挑战,也是CAPRI(预测相互作用的批判性评估)中的评估对象,社区范围内的盲目对接实验。我们在现场介绍了CONSRNK(CONSensusRANKing),第一种纯共识方法。也可作为Web服务器,CONSRank根据对接模型匹配其中最常见的残基间接触的能力对对接模型进行排序。在所有最新的CAPRI回合中,我们一直在盲目地测试CONSRank,我们展示了它与最先进的能量和基于知识的评分功能竞争。最近,我们开发了Clust-CONSRANK,一种引入基于联系人的模型聚类的算法,作为CONSRANK评分过程的初步步骤。在最新的CASP13-CAPRI联合实验中,我们作为得分手参加了一个新颖的管道,结合我们的评分工具,CONSRank和Clust-CONSRank,使用我们的接口分析工具COCOMAPS。正在形成的共识的力量指导了10个提交模型的选择,他们的最终排名得到了界面分析结果的帮助。
    结果:由于上述方法,到目前为止,我们是CASP13-CAPRI排名第一的得分手,在大多数目标中,高/中质量模型都排在前1位(总共19个中有11个)。我们也是排名前十的第一个得分手,与另一组相当,也是前5名的第二得分手。Further,相对于绑定接口的预测,我们排名最高,在所有得分手和预测者中。使用CASP13-CAPRI目标作为案例研究,我们在这里详细说明我们采用的方法。
    结论:在最终模型选择和排名中引入了一些灵活性,以及根据目标区分采用的评分方法是我们非常成功的业绩的关键资产,与之前的CAPRI轮相比。我们提出的方法完全基于社区可用的方法,因此可以由任何用户复制。
    BACKGROUND: Properly scoring protein-protein docking models to single out the correct ones is an open challenge, also object of assessment in CAPRI (Critical Assessment of PRedicted Interactions), a community-wide blind docking experiment. We introduced in the field CONSRANK (CONSensus RANKing), the first pure consensus method. Also available as a web server, CONSRANK ranks docking models in an ensemble based on their ability to match the most frequent inter-residue contacts in it. We have been blindly testing CONSRANK in all the latest CAPRI rounds, where we showed it to perform competitively with the state-of-the-art energy and knowledge-based scoring functions. More recently, we developed Clust-CONSRANK, an algorithm introducing a contact-based clustering of the models as a preliminary step of the CONSRANK scoring process. In the latest CASP13-CAPRI joint experiment, we participated as scorers with a novel pipeline, combining both our scoring tools, CONSRANK and Clust-CONSRANK, with our interface analysis tool COCOMAPS. Selection of the 10 models for submission was guided by the strength of the emerging consensus, and their final ranking was assisted by results of the interface analysis.
    RESULTS: As a result of the above approach, we were by far the first scorer in the CASP13-CAPRI top-1 ranking, having high/medium quality models ranked at the top-1 position for the majority of targets (11 out of the total 19). We were also the first scorer in the top-10 ranking, on a par with another group, and the second scorer in the top-5 ranking. Further, we topped the ranking relative to the prediction of binding interfaces, among all the scorers and predictors. Using the CASP13-CAPRI targets as case studies, we illustrate here in detail the approach we adopted.
    CONCLUSIONS: Introducing some flexibility in the final model selection and ranking, as well as differentiating the adopted scoring approach depending on the targets were the key assets for our highly successful performance, as compared to previous CAPRI rounds. The approach we propose is entirely based on methods made available to the community and could thus be reproduced by any user.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    We propose an effective machine learning approach to identify group of interacting single nucleotide polymorphisms (SNPs), which contribute most to the breast cancer (BC) risk by assuming dependencies among BCAC iCOGS SNPs. We adopt a gradient tree boosting method followed by an adaptive iterative SNP search to capture complex non-linear SNP-SNP interactions and consequently, obtain group of interacting SNPs with high BC risk-predictive potential. We also propose a support vector machine formed by the identified SNPs to classify BC cases and controls. Our approach achieves mean average precision (mAP) of 72.66, 67.24 and 69.25 in discriminating BC cases and controls in KBCP, OBCS and merged KBCP-OBCS sample sets, respectively. These results are better than the mAP of 70.08, 63.61 and 66.41 obtained by using a polygenic risk score model derived from 51 known BC-associated SNPs, respectively, in KBCP, OBCS and merged KBCP-OBCS sample sets. BC subtype analysis further reveals that the 200 identified KBCP SNPs from the proposed method performs favorably in classifying estrogen receptor positive (ER+) and negative (ER-) BC cases both in KBCP and OBCS data. Further, a biological analysis of the identified SNPs reveals genes related to important BC-related mechanisms, estrogen metabolism and apoptosis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Infection and disease progression is the outcome of protein interactions between pathogen and host. Pathogen, the role player of Infection, is becoming a severe threat to life as because of its adaptability toward drugs and evolutionary dynamism in nature. Identifying protein targets by analyzing protein interactions between host and pathogen is the key point. Proteins with higher degree and possessing some topologically significant graph theoretical measures are found to be drug targets. On the other hand, exceptional nodes may be involved in infection mechanism because of some pathway process and biologically unknown factors. In this article, we attempt to investigate characteristics of host-pathogen protein interactions by presenting a comprehensive review of computational approaches applied on different infectious diseases. As an illustration, we have analyzed a case study on infectious disease malaria, with its causative agent Plasmodium falciparum acting as \'Bait\' and host, Homo sapiens/human acting as \'Prey\'. In this pathogen-host interaction network based on some interconnectivity and centrality properties, proteins are viewed as central, peripheral, hub and non-hub nodes and their significance on infection process. Besides, it is observed that because of sparseness of the pathogen and host interaction network, there may be some topologically unimportant but biologically significant proteins, which can also act as Bait/Prey. So, functional similarity or gene ontology mapping can help us in this case to identify these proteins.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Letter
    暂无摘要。
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    In this study, a minimum dominating set based approach was developed and implemented as a Cytoscape plugin to identify critical and redundant proteins in a protein interaction network. We focused on the investigation of the properties associated with critical proteins in the context of the analysis of interaction networks specific to cell cycle in both yeast and human. A total of 132 yeast genes and 129 human proteins have been identified as critical nodes while 950 in yeast and 980 in human have been categorized as redundant nodes. A clear distinction between critical and redundant proteins was observed when examining their topological parameters including betweenness centrality, suggesting a central role of critical proteins in the control of a network. The significant differences in terms of gene coexpression and functional similarity were observed between the two sets of proteins in yeast. Critical proteins were found to be enriched with essential genes in both networks and have a more deleterious effect on the network integrity than their redundant counterparts. Furthermore, we obtained statistically significant enrichments of proteins that govern human diseases including cancer-related and virus-targeted genes in the corresponding set of critical proteins.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Comparative Study
    The overall topology and interfacial interactions play key roles in understanding structural and functional principles of protein complexes. Elastic Network Model (ENM) and Protein Contact Network (PCN) are two widely used methods for high throughput investigation of structures and interactions within protein complexes. In this work, the comparative analysis of ENM and PCN relative to hemoglobin (Hb) was taken as case study. We examine four types of structural and dynamical paradigms, namely, conformational change between different states of Hbs, modular analysis, allosteric mechanisms studies, and interface characterization of an Hb. The comparative study shows that ENM has an advantage in studying dynamical properties and protein-protein interfaces, while PCN is better for describing protein structures quantitatively both from local and from global levels. We suggest that the integration of ENM and PCN would give a potential but powerful tool in structural systems biology.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    Accurately predicting the survival outcome of patients is of great importance in clinical cancer research. In the past decade, building survival prediction models based on gene expression data has received increasing interest. However, the existing methods are mainly based on individual gene signatures, which are known to have limited prediction accuracy on independent datasets and unclear biological relevance. Here, we propose a novel pathway-based survival prediction method called DRWPSurv in order to accurately predict survival outcome. DRWPSurv integrates gene expression profiles and prior gene interaction information to topologically infer survival associated pathway activities, and uses the pathway activities as features to construct Lasso-Cox model. It uses topological importance of genes evaluated by directed random walk to enhance the robustness of pathway activities and thereby improve the predictive performance. We applied DRWPSurv on three independent breast cancer datasets and compared the predictive performance with a traditional gene-based method and four pathway-based methods. Results showed that pathway-based methods obtained comparable or better predictive performance than the gene-based method, whereas DRWPSurv could predict survival outcome with better accuracy and robustness among the pathway-based methods. In addition, the risk pathways identified by DRWPSurv provide biologically informative models for breast cancer prognosis and treatment.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号