Cheminformatics

化学信息学
  • 文章类型: Journal Article
    化学空间的探索是化学信息学的一个基本方面,特别是当人们探索一个大的化合物数据集,以将化学结构与分子性质联系起来。在这项研究中,我们在药效水平上扩展了我们以前在化学空间可视化方面的工作.而不是使用传统的亲和力二元分类(活性与非活性),我们引入了一种改进的方法,根据化合物的活性水平将其分为四个不同的类别:超活性,非常活跃,活跃,不活跃。这种分类丰富了应用于药效团空间的配色方案,其中药效团假说的颜色表示由相关化合物驱动。以BCR-ABL酪氨酸激酶为例,我们确定了与药效团活性不连续相对应的有趣区域,为结构-活动关系分析提供有价值的见解。
    The exploration of chemical space is a fundamental aspect of chemoinformatics, particularly when one explores a large compound data set to relate chemical structures with molecular properties. In this study, we extend our previous work on chemical space visualization at the pharmacophoric level. Instead of using conventional binary classification of affinity (active vs inactive), we introduce a refined approach that categorizes compounds into four distinct classes based on their activity levels: super active, very active, active, and inactive. This classification enriches the color scheme applied to pharmacophore space, where the color representation of a pharmacophore hypothesis is driven by the associated compounds. Using the BCR-ABL tyrosine kinase as a case study, we identified intriguing regions corresponding to pharmacophore activity discontinuities, providing valuable insights for structure-activity relationships analysis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    中草药化合物的治疗效果通常是通过多种成分的协同相互作用来实现的。然而,目前的研究主要集中在单个成分上,忽视了中草药化合物的整体性。本研究提出了一种新的策略,以阐明基于其多组分的中草药化合物的药效物质基础(在中国名为“ZuFen”,它是指具有相似化学结构的多种成分)组成,以仙灵固宝(XLGB)胶囊为例进行研究。在从各种数据库中获取成分后,进行了基于化学信息学的成分划分,共856种成分,分为9种主要成分。此外,XLGB胶囊的药效学成分是通过分析吸收到血液中的成分来确定的。通过这些成分的组合和吸收筛选,八宝经皂苷成分,补骨脂香豆素成分,分离得到淫羊藿黄酮多苷成分。在斑马鱼中评估了这些成分的抗骨质疏松功效,证明了它们逆转泼尼松龙引起的矿化减少的能力。这些发现进一步支持以下观点:这些组分充当XLGB胶囊的药理学功效的物质基础。这项研究提供了一种新的系统策略,用于基于“多组分”观点发现中草药化合物功效的药效学物质基础。
    The therapeutic effects of Chinese herbal compounds are often achieved through the synergistic interactions of multiple ingredients. However, current research predominantly focuses on individual ingredients, neglecting the holistic nature of Chinese herbal compounds. This study proposes a novel strategy to elucidate the pharmacodynamic material basis of Chinese herbal compounds based on their multi-components (components named \'ZuFen\' in China, it refers to multiple ingredients with similar chemical structures) composition, using the Xian-Ling-Gu-Bao (XLGB) capsule as a case study. Cheminformatics-based components partitioning was conducted after sourcing ingredients from various databases, resulting in a total of 856 ingredients which were categorized into nine major components. Furthermore, the pharmacodynamic ingredients of XLGB capsule were determined by analyzing the ingredients that were absorbed into the bloodstream. Through a combination of these ingredients and screening for absorption, the Dipsacus asper saponin components, Psoralea corylifolia coumarin components, and Epimedium flavonoid polyglycosides components were isolated. The anti-osteoporosis efficacy of these components were evaluated in zebrafish, demonstrating their capability to reverse mineralization reduction caused by prednisolone. These findings further support the idea that these components serve as the material basis for the pharmacological efficacy of XLGB capsule. This study provides a novel systematic strategy for discovering the pharmacodynamic material basis of the efficacy of Chinese herbal compounds based on a \'multi-components\' perspective.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    科学工作流程通过集成以特定顺序执行的各种软件和工具来促进数据分析任务的自动化。要在工作流中实现透明度和可重用性,实施公平原则至关重要。这里,我们以代谢组学注释工作流(MAW)为例,描述了我们在代谢组学工作流实施FAIR原则的经验.MAW使用通用工作流语言(CWL)指定,允许在不同的工作流引擎上后续执行工作流。使用WorkflowHub上的CWL描述注册MAW。在WorkflowHub上的提交过程中,CWL描述用于使用工作流RO-Crate配置文件包装MAW,其中包括Bioschemas中的元数据。研究人员可以使用这种叙述性讨论作为指南,开始使用FAIR实践进行其生物信息学或化学信息学工作流程,同时纳入针对其研究领域的必要修订。
    Scientific workflows facilitate the automation of data analysis tasks by integrating various software and tools executed in a particular order. To enable transparency and reusability in workflows, it is essential to implement the FAIR principles. Here, we describe our experiences implementing the FAIR principles for metabolomics workflows using the Metabolome Annotation Workflow (MAW) as a case study. MAW is specified using the Common Workflow Language (CWL), allowing for the subsequent execution of the workflow on different workflow engines. MAW is registered using a CWL description on WorkflowHub. During the submission process on WorkflowHub, a CWL description is used for packaging MAW using the Workflow RO-Crate profile, which includes metadata in Bioschemas. Researchers can use this narrative discussion as a guideline to commence using FAIR practices for their bioinformatics or cheminformatics workflows while incorporating necessary amendments specific to their research area.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    To accurately simulate the inner workings of an enzyme active site with quantum mechanics (QM), not only must the reactive species be included in the model but also important surrounding residues, solvent, or coenzymes involved in crafting the microenvironment. Our lab has been developing the Residue Interaction Network Residue Selector (RINRUS) toolkit to utilize interatomic contact network information for automated, rational residue selection and QM-cluster model generation. Starting from an x-ray crystal structure of catechol-O-methyltransferase, RINRUS was used to construct a series of QM-cluster models. The reactant, product, and transition state of the methyl transfer reaction were computed for a total of 550 models, and the resulting free energies of activation and reaction were used to evaluate model convergence. RINRUS-designed models with only 200-300 atoms are shown to converge. RINRUS will serve as a cornerstone for improved and automated cheminformatics-based enzyme model design.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    抗菌素耐药性(AMR)是最严重的全球公共卫生威胁之一,因为它损害了结核病等致命传染病的成功治疗。不断需要新的治疗方法,但探索新的生化空间需要很长时间,而且成本很高。解决此问题的一种方法是重新利用已验证的靶标,并鉴定可以同时结合这些靶标的多个结合口袋的新型化学型,作为新的前导生成策略。这项研究报告了这样一个策略,动态混合药效团模型(DHPM),这代表了与传统方法相反的不同结合袋的组合相互作用特征,其中药效基团模型是从单个结合位点产生的。我们考虑过Mtb-DapB,经过验证的分枝杆菌药物靶标,作为我们的模型系统,探索DHPM筛选新的未开发化合物的有效性。Mtb-DapB具有辅因子结合位点(CBS)和相邻的底物结合位点(SBS)。设计了四种不同的Mtb-DapB模型系统,在相邻SBS中存在/不存在抑制剂2,6-PDC的情况下,NADPH/NADH占据CBS。设计了两个模型系统,其中2,6-PDC与NADPH和NADH连接以形成杂合分子。对六个模型系统进行了200ns的分子动力学模拟,并分析了轨迹以识别稳定的配体-受体相互作用特征。基于这些互动,常规药效基团模型(CPM)由单个结合位点产生,而DHPM由占据两个结合位点的杂合分子产生.通过CPM和DHPM筛选了1,563,764个公开可用分子的巨大文库。根据他们的Hashed二元分子指纹和使用Tanimoto的4点药效基团指纹进行比较,余弦,Dice和Tversky相似度矩阵。DHPM筛选的分子表现出显著的结构多样性,与通过CPM筛选的化合物相比,更好的结合强度和药物样性质表明DHPM探索用于抗TB药物发现的新化学空间的效率。DHPM的想法可以应用于广泛的分枝杆菌或其他病原体靶标,以进入未开发的化学空间。
    Antimicrobial resistance (AMR) is one of the most serious global public health threats as it compromises the successful treatment of deadly infectious diseases like tuberculosis. New therapeutics are constantly needed but it takes a long time and is expensive to explore new biochemical space. One way to address this issue is to repurpose the validated targets and identify novel chemotypes that can simultaneously bind to multiple binding pockets of these targets as a new lead generation strategy. This study reports such a strategy, dynamic hybrid pharmacophore model (DHPM), which represents the combined interaction features of different binding pockets contrary to the conventional approaches, where pharmacophore models are generated from single binding sites. We have considered Mtb-DapB, a validated mycobacterial drug target, as our model system to explore the effectiveness of DHPMs to screen novel unexplored compounds. Mtb-DapB has a cofactor binding site (CBS) and an adjacent substrate binding site (SBS). Four different model systems of Mtb-DapB were designed where, either NADPH/NADH occupies CBS in presence/absence of an inhibitor 2, 6-PDC in the adjacent SBS. Two more model systems were designed, where 2, 6-PDC was linked to NADPH and NADH to form hybrid molecules. The six model systems were subjected to 200 ns molecular dynamics simulations and trajectories were analyzed to identify stable ligand-receptor interaction features. Based on these interactions, conventional pharmacophore models (CPM) were generated from the individual binding sites while DHPMs were created from hybrid-molecules occupying both binding sites. A huge library of 1,563,764 publicly available molecules were screened by CPMs and DHPMs. The screened hits obtained from both types of models were compared based on their Hashed binary molecular fingerprints and 4-point pharmacophore fingerprints using Tanimoto, Cosine, Dice and Tversky similarity matrices. Molecules screened by DHPM exhibited significant structural diversity, better binding strength and drug like properties as compared to the compounds screened by CPMs indicating the efficiency of DHPM to explore new chemical space for anti-TB drug discovery. The idea of DHPM can be applied for a wide range of mycobacterial or other pathogen targets to venture into unexplored chemical space.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    最近,生物毒素由于其在饲料和食品以及环境中的高污染水平而受到越来越多的关注。然而,缺乏将数据驱动的计算方法与“湿”实验验证无缝连接的综合平台。为此,我们构建了一个新颖的平台,集成了毒素生物转化方法的技术方面。首先,称为ToxinDB的生物毒素数据库(http://www.rxnfinder.org/toxindb/),包含超过4836种毒素的多方面数据,已建成。接下来,在过去的十年中,从100多人策划的约58万篇文献报道中提取的300,000多个生化反应中,提取了8000多个生物转化反应规则。根据这些反应规则,建立了毒素生物转化预测模型。最后,构建了生物毒素的全球化学空间,包含约550,000种毒素和推定的毒素代谢物,其中94.7%的代谢物以前没有报道过。此外,我们进行了一个案例研究,以研究木霉菌中的桔霉素代谢,并在ToxinDB的生物转化预测工具的帮助下鉴定出了一种新的代谢产物。这个独特的综合平台将有助于探索毒素代谢组的“暗物质”,并促进解毒酶的发现。
    Recently, biogenic toxins have received increasing attention owing to their high contamination levels in feed and food as well as in the environment. However, there is a lack of an integrative platform for seamless linking of data-driven computational methods with \'wet\' experimental validations. To this end, we constructed a novel platform that integrates the technical aspects of toxin biotransformation methods. First, a biogenic toxin database termed ToxinDB (http://www.rxnfinder.org/toxindb/), containing multifaceted data on more than 4836 toxins, was built. Next, more than 8000 biotransformation reaction rules were extracted from over 300,000 biochemical reactions extracted from ~580,000 literature reports curated by more than 100 people over the past decade. Based on these reaction rules, a toxin biotransformation prediction model was constructed. Finally, the global chemical space of biogenic toxins was constructed, comprising ~550,000 toxins and putative toxin metabolites, of which 94.7% of the metabolites have not been previously reported. Additionally, we performed a case study to investigate citrinin metabolism in Trichoderma, and a novel metabolite was identified with the assistance of the biotransformation prediction tool of ToxinDB. This unique integrative platform will assist exploration of the \'dark matter\' of a toxin\'s metabolome and promote the discovery of detoxification enzymes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    Over the last 5 years deep learning has progressed tremendously in both image recognition and natural language processing. Now it is increasingly applied to other data rich fields. In drug discovery, recurrent neural networks (RNNs) have been shown to be an effective method to generate novel chemical structures in the form of SMILES. However, ligands generated by current methods have so far provided relatively low diversity and do not fully cover the whole chemical space occupied by known ligands. Here, we propose a new method (DrugEx) to discover de novo drug-like molecules. DrugEx is an RNN model (generator) trained through reinforcement learning which was integrated with a special exploration strategy. As a case study we applied our method to design ligands against the adenosine A2A receptor. From ChEMBL data, a machine learning model (predictor) was created to predict whether generated molecules are active or not. Based on this predictor as the reward function, the generator was trained by reinforcement learning without any further data. We then compared the performance of our method with two previously published methods, REINVENT and ORGANIC. We found that candidate molecules our model designed, and predicted to be active, had a larger chemical diversity and better covered the chemical space of known ligands compared to the state-of-the-art.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    化学品负债,如不良反应和毒性,在现代药物发现过程中发挥着重要作用。化学负债的计算机评估是通过补充或替代体外和体内实验来降低成本和动物测试的重要步骤。在这里,我们提出了一种方法,结合了几种分类和化学方法,以便能够预测化学负债,并在化合物结构变化对其药理学特征的影响的背景下解释获得的结果。我们第一次认识到,生成地形图的监督扩展是一种有效的新化学方法。已经提出了使用监督Isomap映射新数据的新方法,而无需从头开始重新构建模型。在我们的研究中,首次在化学信息学中使用了两种方法来估计模型的适用性领域。作为模型解释的结果,已经发现了负责化合物药理学特征负面特征的结构警报。
    Chemical liabilities, such as adverse effects and toxicity, play a significant role in modern drug discovery process. In silico assessment of chemical liabilities is an important step aimed to reduce costs and animal testing by complementing or replacing in vitro and in vivo experiments. Herein, we propose an approach combining several classification and chemography methods to be able to predict chemical liabilities and to interpret obtained results in the context of impact of structural changes of compounds on their pharmacological profile. To our knowledge for the first time, the supervised extension of Generative Topographic Mapping is proposed as an effective new chemography method. New approach for mapping new data using supervised Isomap without re-building models from the scratch has been proposed. Two approaches for estimation of model\'s applicability domain are used in our study to our knowledge for the first time in chemoinformatics. The structural alerts responsible for the negative characteristics of pharmacological profile of chemical compounds has been found as a result of model interpretation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号