关键词: Complex identification method Human protein complex Multifunctional protein Protein interaction network SARS-CoV-2-affected complex

Mesh : Humans Computational Biology / methods Protein Interaction Mapping / methods Protein Interaction Maps Proteins / metabolism Saccharomyces cerevisiae / metabolism Algorithms

来  源:   DOI:10.1016/j.gpb.2023.05.001   PDF(Pubmed)

Abstract:
A fundamental principle of biology is that proteins tend to form complexes to play important roles in the core functions of cells. For a complete understanding of human cellular functions, it is crucial to have a comprehensive atlas of human protein complexes. Unfortunately, we still lack such a comprehensive atlas of experimentally validated protein complexes, which prevents us from gaining a complete understanding of the compositions and functions of human protein complexes, as well as the underlying biological mechanisms. To fill this gap, we built Human Protein Complexes Atlas (HPC-Atlas), as far as we know, the most accurate and comprehensive atlas of human protein complexes available to date. We integrated two latest protein interaction networks, and developed a novel computational method to identify nearly 9000 protein complexes, including many previously uncharacterized complexes. Compared with the existing methods, our method achieved outstanding performance on both testing and independent datasets. Furthermore, with HPC-Atlas we identified 751 severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)-affected human protein complexes, and 456 multifunctional proteins that contain many potential moonlighting proteins. These results suggest that HPC-Atlas can serve as not only a computing framework to effectively identify biologically meaningful protein complexes by integrating multiple protein data sources, but also a valuable resource for exploring new biological findings. The HPC-Atlas webserver is freely available at http://www.yulpan.top/HPC-Atlas.
摘要:
生物学的基本原理是蛋白质倾向于形成复合物以在细胞的核心功能中发挥重要作用。为了全面了解人类细胞功能,拥有全面的人类蛋白质复合物图谱至关重要。不幸的是,我们仍然缺乏这样一个全面的经过实验验证的蛋白质复合物的图谱,这使我们无法完全了解人类蛋白质复合物的组成和功能以及生物学机制。为了填补这个空白,我们建立了人类蛋白质复合物图谱(HPC-Atlas),据我们所知,迄今为止最准确和最全面的人类蛋白质复合物图谱。我们整合了两个最新的蛋白质相互作用网络,并开发了一种新的计算方法来鉴定近9000种蛋白质复合物,包括许多以前未表征的复合物。与现有工程相比,我们的方法在测试和独立集上都取得了出色的性能。此外,使用HPC-Atlas,我们确定了751种严重急性呼吸综合征冠状病毒2(SARS-CoV-2)影响人类蛋白质复合物,和456种多功能蛋白质,其中包含许多潜在的月光蛋白。这些结果表明,HPC-Atlas不仅可以作为一个计算框架,通过整合多个蛋白质数据源来有效识别生物学上有意义的蛋白质复合物。也是探索新生物学发现的宝贵资源。HPC-Atlas网络服务器可在http://www上免费获得。Yulpan.顶部/HPC-Atlas。
公众号