关键词: active learning ginsenoside hydrolase graph neural networks molecular dynamics regioselectivity

Mesh : Molecular Dynamics Simulation Ginsenosides / chemistry metabolism Substrate Specificity Hydrolases / chemistry metabolism Panax / chemistry enzymology

来  源:   DOI:10.3390/molecules29153614   PDF(Pubmed)

Abstract:
Identifying the catalytic regioselectivity of enzymes remains a challenge. Compared to experimental trial-and-error approaches, computational methods like molecular dynamics simulations provide valuable insights into enzyme characteristics. However, the massive data generated by these simulations hinder the extraction of knowledge about enzyme catalytic mechanisms without adequate modeling techniques. Here, we propose a computational framework utilizing graph-based active learning from molecular dynamics to identify the regioselectivity of ginsenoside hydrolases (GHs), which selectively catalyze C6 or C20 positions to obtain rare deglycosylated bioactive compounds from Panax plants. Experimental results reveal that the dynamic-aware graph model can excellently distinguish GH regioselectivity with accuracy as high as 96-98% even when different enzyme-substrate systems exhibit similar dynamic behaviors. The active learning strategy equips our model to work robustly while reducing the reliance on dynamic data, indicating its capacity to mine sufficient knowledge from short multi-replica simulations. Moreover, the model\'s interpretability identified crucial residues and features associated with regioselectivity. Our findings contribute to the understanding of GH catalytic mechanisms and provide direct assistance for rational design to improve regioselectivity. We presented a general computational framework for modeling enzyme catalytic specificity from simulation data, paving the way for further integration of experimental and computational approaches in enzyme optimization and design.
摘要:
鉴定酶的催化区域选择性仍然是一个挑战。与实验性的试错方法相比,分子动力学模拟等计算方法为酶特性提供了有价值的见解。然而,这些模拟产生的大量数据阻碍了没有足够的建模技术的酶催化机理知识的提取。这里,我们提出了一个计算框架,利用基于图的主动学习从分子动力学来识别人参皂苷水解酶(GHs)的区域选择性,它选择性地催化C6或C20位置,从人参植物中获得稀有的去糖基化生物活性化合物。实验结果表明,即使不同的酶-底物系统表现出相似的动态行为,动态感知图模型也能很好地区分GH区域选择性,准确率高达96-98%。主动学习策略使我们的模型能够稳健地工作,同时减少对动态数据的依赖,表明它有能力从短的多副本模拟中挖掘足够的知识。此外,该模型的可解释性确定了与区域选择性相关的关键残基和特征。我们的发现有助于理解GH催化机理,并为合理设计以提高区域选择性提供直接帮助。我们提出了从模拟数据中模拟酶催化特异性的通用计算框架,为酶优化和设计中实验和计算方法的进一步整合铺平了道路。
公众号