Cheminformatics

化学信息学
  • 文章类型: Journal Article
    先前的研究表明,分子的三维(3D)几何和电子结构在确定其关键性质和分子间相互作用中起着至关重要的作用。因此,有必要建立一个包含分子最稳定的3D几何构象和电子结构的量子化学(QC)属性数据库。在这项研究中,高质量的QC属性数据库,叫做QuanDB,被开发,其中包括结构多样的分子实体,并具有用户友好的界面。目前,QuanDB包含来自公共数据库和科学文献的154,610种化合物,有10,125个脚手架。元素组成包括九种元素:H,C,O,N,P,S,F,Cl,和Br。对于每个分子,QuanDB提供53个全局和5个局部QC属性以及最稳定的3D构象。这些属性分为三类:几何结构,电子结构,和热力学。B3LYP-D3(BJ)/6-311G(d)/SMD/水和B3LYP-D3(BJ)/def2-TZVP/SMD/水理论水平的几何结构优化和单点能量计算,分别,用于确保高精度的QC属性计算,计算成本超过107个核心小时。QuanDB提供高价值的几何和电子结构信息,用于分子表示模型,这对基于机器学习的分子设计至关重要,从而有助于化合物空间的全面描述。作为QC属性的新的高质量数据集,QuanDB有望成为机器学习模型训练和优化的基准工具,从而进一步推进新药和新材料的开发。QuanDB是免费提供的,没有注册,在https://quandb。cmdrg.com/.
    Previous studies have shown that the three-dimensional (3D) geometric and electronic structure of molecules play a crucial role in determining their key properties and intermolecular interactions. Therefore, it is necessary to establish a quantum chemical (QC) property database containing the most stable 3D geometric conformations and electronic structures of molecules. In this study, a high-quality QC property database, called QuanDB, was developed, which included structurally diverse molecular entities and featured a user-friendly interface. Currently, QuanDB contains 154,610 compounds sourced from public databases and scientific literature, with 10,125 scaffolds. The elemental composition comprises nine elements: H, C, O, N, P, S, F, Cl, and Br. For each molecule, QuanDB provides 53 global and 5 local QC properties and the most stable 3D conformation. These properties are divided into three categories: geometric structure, electronic structure, and thermodynamics. Geometric structure optimization and single point energy calculation at the theoretical level of B3LYP-D3(BJ)/6-311G(d)/SMD/water and B3LYP-D3(BJ)/def2-TZVP/SMD/water, respectively, were applied to ensure highly accurate calculations of QC properties, with the computational cost exceeding 107 core-hours. QuanDB provides high-value geometric and electronic structure information for use in molecular representation models, which are critical for machine-learning-based molecular design, thereby contributing to a comprehensive description of the chemical compound space. As a new high-quality dataset for QC properties, QuanDB is expected to become a benchmark tool for the training and optimization of machine learning models, thus further advancing the development of novel drugs and materials. QuanDB is freely available, without registration, at https://quandb.cmdrg.com/ .
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Editorial
    暂无摘要。
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    复杂的中枢神经系统(CNS)疾病的有效治疗需要具有多药理学和多功能性的药物。然而,设计这种药物仍然是一个挑战。这里,我们提出了一种基于柔性支架的化学信息学方法(FSCA),用于合理设计多药理学药物。FSCA涉及使用不同的结合姿势将柔性支架安装到不同的受体上,例如IHCH-7179,它在5-HT2AR处采用“向下弯曲”结合姿势作为拮抗剂,在5-HT1AR处采用“向上拉伸”结合姿势作为激动剂。IHCH-7179通过阻断5-HT2AR的精神活性症状并激活5-HT1AR以减轻认知障碍,在减轻小鼠的认知障碍和精神活性症状方面表现出了有希望的结果。通过分析胺能受体结构,我们确定了两个有特色的图案,“激动剂过滤器”和“构象整形器”,“确定配体结合姿势并预测胺能受体的活性。有了这些图案,FSCA可应用于其他受体的多药理学配体的设计。
    Effective treatments for complex central nervous system (CNS) disorders require drugs with polypharmacology and multifunctionality, yet designing such drugs remains a challenge. Here, we present a flexible scaffold-based cheminformatics approach (FSCA) for the rational design of polypharmacological drugs. FSCA involves fitting a flexible scaffold to different receptors using different binding poses, as exemplified by IHCH-7179, which adopted a \"bending-down\" binding pose at 5-HT2AR to act as an antagonist and a \"stretching-up\" binding pose at 5-HT1AR to function as an agonist. IHCH-7179 demonstrated promising results in alleviating cognitive deficits and psychoactive symptoms in mice by blocking 5-HT2AR for psychoactive symptoms and activating 5-HT1AR to alleviate cognitive deficits. By analyzing aminergic receptor structures, we identified two featured motifs, the \"agonist filter\" and \"conformation shaper,\" which determine ligand binding pose and predict activity at aminergic receptors. With these motifs, FSCA can be applied to the design of polypharmacological ligands at other receptors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    中草药化合物的治疗效果通常是通过多种成分的协同相互作用来实现的。然而,目前的研究主要集中在单个成分上,忽视了中草药化合物的整体性。本研究提出了一种新的策略,以阐明基于其多组分的中草药化合物的药效物质基础(在中国名为“ZuFen”,它是指具有相似化学结构的多种成分)组成,以仙灵固宝(XLGB)胶囊为例进行研究。在从各种数据库中获取成分后,进行了基于化学信息学的成分划分,共856种成分,分为9种主要成分。此外,XLGB胶囊的药效学成分是通过分析吸收到血液中的成分来确定的。通过这些成分的组合和吸收筛选,八宝经皂苷成分,补骨脂香豆素成分,分离得到淫羊藿黄酮多苷成分。在斑马鱼中评估了这些成分的抗骨质疏松功效,证明了它们逆转泼尼松龙引起的矿化减少的能力。这些发现进一步支持以下观点:这些组分充当XLGB胶囊的药理学功效的物质基础。这项研究提供了一种新的系统策略,用于基于“多组分”观点发现中草药化合物功效的药效学物质基础。
    The therapeutic effects of Chinese herbal compounds are often achieved through the synergistic interactions of multiple ingredients. However, current research predominantly focuses on individual ingredients, neglecting the holistic nature of Chinese herbal compounds. This study proposes a novel strategy to elucidate the pharmacodynamic material basis of Chinese herbal compounds based on their multi-components (components named \'ZuFen\' in China, it refers to multiple ingredients with similar chemical structures) composition, using the Xian-Ling-Gu-Bao (XLGB) capsule as a case study. Cheminformatics-based components partitioning was conducted after sourcing ingredients from various databases, resulting in a total of 856 ingredients which were categorized into nine major components. Furthermore, the pharmacodynamic ingredients of XLGB capsule were determined by analyzing the ingredients that were absorbed into the bloodstream. Through a combination of these ingredients and screening for absorption, the Dipsacus asper saponin components, Psoralea corylifolia coumarin components, and Epimedium flavonoid polyglycosides components were isolated. The anti-osteoporosis efficacy of these components were evaluated in zebrafish, demonstrating their capability to reverse mineralization reduction caused by prednisolone. These findings further support the idea that these components serve as the material basis for the pharmacological efficacy of XLGB capsule. This study provides a novel systematic strategy for discovering the pharmacodynamic material basis of the efficacy of Chinese herbal compounds based on a \'multi-components\' perspective.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    准确评估药代动力学(PK)特性对于选择最佳候选物和避免下游故障至关重要。迁移学习是一种创新的机器学习方法,可以利用有限的数据进行高通量预测。最近,迁移学习方法在预测ADME/PK参数方面显示出希望。鉴于用于PK预测的迁移学习研究的大量增长,必须全面审查其优势和挑战。这项研究探讨了基本原理,分类,用于PK预测的各种迁移学习技术的工具包和应用,通过三个实际案例研究证明它们的效用。这项工作将为药物设计研究人员提供参考。Teaser:探索迁移学习如何彻底改变药代动力学预测,用创新的机器学习技术克服数据稀缺。发现它的分类,药物设计中的应用和实际案例研究。
    Accurate assessment of pharmacokinetic (PK) properties is crucial for selecting optimal candidates and avoiding downstream failures. Transfer learning is an innovative machine learning approach enabling high-throughput prediction with limited data. Recently, transfer learning methods showed promise in predicting ADME/PK parameters. Given the prolific growth of research on transfer learning for PK prediction, a comprehensive review of its advantages and challenges is imperative. This study explores the fundamentals, classifications, toolkits and applications of various transfer learning techniques for PK prediction, demonstrating their utility through three practical case studies. This work will serve as a reference for drug design researchers.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    稀有放线菌作为新型生物活性次级代谢产物的潜在来源受到高度重视。在这些罕见的放线菌中,由于其能够产生多种生物活性次级代谢产物,因此特别值得注意。随着细菌基因组测序的不断进行和生物信息学技术的快速发展,我们对蔗糖素的次级代谢潜力的了解可以变得更加全面,但是这个空间还没有被审查或探索过。这篇综述详细介绍了138个糖原衍生的次级代谢产物的化学结构和生物活性,根据其生物合成途径分为五个不同的组。此外,我们深入研究了九种生物活性代谢物的实验表征的生物合成途径。通过利用化学信息学和生物信息学方法的组合,我们试图建立代谢产物家族和由蔗糖菌株编码的生物合成基因簇家族之间的联系。我们的分析提供了可以与相应的BGC相关的次级代谢产物的全面视角,并突出了糖原的未充分开发的生物合成潜力。本综述也为有针对性地发现和生物合成新型糖原天然产物提供了指导。
    Rare actinomycetes are highly valued as potential sources of novel bioactive secondary metabolites. Among these rare actinomycetes, the genus Saccharothrix is particularly noteworthy due to its ability to produce a diverse range of bioactive secondary metabolites. With the continuous sequencing of bacterial genomes and the rapid development of bioinformatics technologies, our knowledge of the secondary metabolic potential of Saccharothrix can become more comprehensive, but this space has not been reviewed or explored. This review presents a detailed overview of the chemical structures and bioactivities of 138 Saccharothrix-derived secondary metabolites, which are classified into five distinct groups based on their biosynthetic pathways. Furthermore, we delve into experimentally characterized biosynthetic pathways of nine bioactive metabolites. By utilizing a combination of cheminformatic and bioinformatic approaches, we attempted to establish connections between the metabolite families and the biosynthetic gene cluster families encoded by Saccharothrix strains. Our analysis provides a comprehensive perspective on the secondary metabolites that can be linked to corresponding BGCs and highlights the underexplored biosynthetic potential of Saccharothrix. This review also provides guidance for the targeted discovery and biosynthesis of novel natural products from Saccharothrix.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    宋内志贺氏菌是革兰氏阴性细菌,是发达国家志贺氏菌病的主要原因。据报道,亚洲的患病率异常上升,中东,和拉丁美洲。迄今为止,目前尚无针对S.sonnei感染的预防性疫苗.该病原体已显示出对一线和二线抗生素的抗性。因此,针对志贺氏菌病的有效广谱疫苗开发是必不可少的。在本研究中,采用疫苗组学辅助的免疫信息学策略,从S.sonnei全蛋白质组数据中鉴定潜在的候选疫苗.与人类和人类肠道微生物组蛋白质组非同源的病原体必需蛋白,为此目的是可行的候选人。基于反向疫苗学方法,将三种抗原外膜蛋白优先用于预测前导表位。使用前导B-和T-细胞表位与合适的接头和佐剂肽序列组合来设计基于多表位的嵌合疫苗,以增强针对所设计疫苗的免疫应答。SS-MEVC构建体基于多种物理化学,免疫学特性,和免疫受体对接得分。免疫模拟分析预测了设计的疫苗构建体的强免疫原性应答能力。分子动力学模拟分析确保了前导疫苗构建体与宿主受体的稳定分子相互作用。计算机限制和克隆分析预测了SS-MEVC构建体在大肠杆菌表达系统中的可行克隆能力。预计拟议的疫苗构建体更安全,有效且能够诱导针对S.sonnei感染的强大免疫应答,并且可能值得通过体外/体内测定进行检查。
    Shigella sonnei is a gram-negative bacterium and is the primary cause of shigellosis in advanced countries. An exceptional rise in the prevalence of the disease has been reported in Asia, the Middle East, and Latin America. To date, no preventive vaccine is available against S. sonnei infections. This pathogen has shown resistances towards both first- and second-line antibiotics. Therefore, an effective broad spectrum vaccine development against shigellosis is indispensable. In the present study, vaccinomics-aided immunoinformatics strategies were pursued to identify potential vaccine candidates from the S. sonnei whole proteome data. Pathogen essential proteins that are non-homologous to human and human gut microbiome proteome set, are feasible candidates for this purpose. Three antigenic outer membrane proteins were prioritized to predict lead epitopes based on reverse vaccinology approach. Multi-epitope-based chimeric vaccines was designed using lead B- and T-cell epitopes combined with suitable linker and adjuvant peptide sequences to enhance immune responses against the designed vaccine. The SS-MEVC construct was prioritized based on multiple physicochemical, immunological properties, and immune-receptors docking scores. Immune simulation analysis predicted strong immunogenic response capability of the designed vaccine construct. The Molecular dynamic simulations analysis ensured stable molecular interactions of lead vaccine construct with the host receptors. In silico restriction and cloning analysis predicted feasible cloning capability of the SS-MEVC construct within the E. coli expression system. The proposed vaccine construct is predicted to be more safe, effective and capable of inducing robust immune responses against S. sonnei infections and may be worthy of examination via in vitro/in vivo assays.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    小分子中的味道测定在食品化学中是关键的,但是传统的实验方法可能是耗时的。因此,计算技术已经成为这项任务的有价值的工具。在这项研究中,我们使用各种分子特征表示来探索味觉预测,并在包含2601个分子的数据集上评估不同机器学习算法的性能.结果表明,基于GNN的模型在口味预测方面优于其他方法。此外,结合不同分子表示的共识模型显示出改进的性能。其中,分子指纹+GNN共识模型是表现最好的,突出GNN和分子指纹的互补优势。这些发现对食品化学研究和相关领域具有重要意义。通过利用这些计算方法,味道预测可以加快,在理解各种食物成分和相关化合物的分子结构和味觉之间的关系方面取得了进展。
    Taste determination in small molecules is critical in food chemistry but traditional experimental methods can be time-consuming. Consequently, computational techniques have emerged as valuable tools for this task. In this study, we explore taste prediction using various molecular feature representations and assess the performance of different machine learning algorithms on a dataset comprising 2601 molecules. The results reveal that GNN-based models outperform other approaches in taste prediction. Moreover, consensus models that combine diverse molecular representations demonstrate improved performance. Among these, the molecular fingerprints + GNN consensus model emerges as the top performer, highlighting the complementary strengths of GNNs and molecular fingerprints. These findings have significant implications for food chemistry research and related fields. By leveraging these computational approaches, taste prediction can be expedited, leading to advancements in understanding the relationship between molecular structure and taste perception in various food components and related compounds.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    蛋白质-配体盲对接是在药物和生物学研究中广泛使用的用于研究配体和受体的结合位点和位姿的方法。最近,我们名为CB-Dock2的新型盲对接服务器已经发布,目前正在被世界各地的研究人员使用。CB-Dock2优于最先进的方法,因为它在结合位点识别和结合姿态预测方面具有准确性,由其基于知识的对接引擎启用。这种高度自动化的服务器提供交互式和直观的输入和输出Web界面,使其成为生物信息学和化学信息学社区的高效和用户友好的工具。本章简要概述了这些方法,其次是使用CB-Dock2服务器的详细指南。此外,我们提供了一个案例研究,评估使用该工具的蛋白质-配体盲对接的性能。
    Protein-ligand blind docking is a widely used method for studying the binding sites and poses of ligands and receptors in pharmaceutical and biological research. Recently, our new blind docking server named CB-Dock2 has been released and is currently being utilized by researchers worldwide. CB-Dock2 outperforms state-of-the-art methods due to its accuracy in binding site identification and binding pose prediction, which are enabled by its knowledge-based docking engine. This highly automated server offers interactive and intuitive input and output web interfaces, making it an efficient and user-friendly tool for the bioinformatics and cheminformatics communities. This chapter provides a brief overview of the methods, followed by a detailed guide on using the CB-Dock2 server. Additionally, we present a case study that evaluates the performance of protein-ligand blind docking using this tool.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    最近的研究表明,RNA作为有希望的药物靶标。然而,在检测RNA-配体相互作用方面取得了有限的进展。为了指导RNA结合配体的发现,有必要全面地描述它们,特别是在结合特异性方面,结合亲和力和药物样特性。我们建立了一个数据库,RNALID(http://biomed。nscc-gz.cn/RNALID/html/index。html#/database),收集通过低通量实验验证的RNA-配体相互作用。RNALID包含358个RNA-配体相互作用。与其他数据库相比,RNALID中94.5%的配体是完全或部分新的集合,51.78%具有新颖的二维(2D)结构。通过对配体结构的分析,结合亲和力和化学信息学参数我们发现,主要结合RNA重复的多价(MV)配体在2D和3D结构中在结构上比其他配体类型更保守,表现出比配体结合非重复RNA更高的结合特异性和结合亲和力,但远远偏离了利平斯基的5条规则。相反,与病毒RNA结合的小分子(SM)配体表现出更高的亲和力和更相似的蛋白质配体,但可能具有低结合特异性。对28个详细的药物相似度特性的进一步分析表明,RNA配体的开发需要在结合亲和力和药物相似度之间取得平衡,因为两者之间存在显着的线性相关性。比较RNALID配体与FDA批准的药物和没有生物活性的配体表明RNA结合配体在化学性质上与它们不同。结构特性和药物相似性。因此,在多个方面表征RNALID中的RNA-配体相互作用为发现和设计与RNA结合的可药物配体提供了新的见解。
    Recent studies suggest RNAs act as promising drug targets. However, limited development has been achieved in detecting RNA-ligand interactions. To guide the discovery of RNA-binding ligands, it is necessary to characterize them comprehensively, especially in the binding specificity, binding affinity and drug-like properties. We established a database, RNALID (http://biomed.nscc-gz.cn/RNALID/html/index.html#/database), which collects RNA-ligand interactions validated by low-throughput experiment. RNALID contains 358 RNA-ligand interactions. Comparing to the fellow database, 94.5% of ligands in RNALID are completely or partially novel collections, and 51.78% have novel two-dimensional (2D) structures. Through the analysis of ligand structure, binding affinity and cheminformatic parameters we found that multivalent (MV) ligands mainly binding to RNA repeats are more structurally conserved in both 2D and 3D structures than other ligand types, exhibit higher binding specificity and binding affinity than ligands binding to non-repeat RNAs, but deviate far from the Lipinski\'s rule of five. In contrary, small molecule (SM) ligands binding to virus RNA exhibit higher affinity and more resemble protein-ligands, but potentially possess low binding specificity. Further analysis on 28 detailed drug-likeness properties indicated that RNA-ligands\' development need to balance between the binding affinity and the drug-likeness because of the significant linear co-relationship between the two. Comparing RNALID ligands to FDA-approved drugs and ligands without bioactivity indicated that RNA-binding ligands are different from them in chemical properties, structural properties and drug-likeness. Thus, characterizing the RNA-ligand interactions in RNALID in multiple respects provides new insights into discovering and designing druggable ligands binding with RNA.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号