Model building

模型建筑
  • 文章类型: Journal Article
    植物科学变得越来越复杂。随着新的实验技术和技术的引入,现在可以探索植物新陈代谢的细节。除了经常应用于气体交换或代谢组学分析的稳态测量之外,新方法,例如,基于13C标签,现在可以了解在现场或实验室中波动的环境条件下代谢浓度的变化。为了探索代谢物浓度的瞬态现象,动力学模型是一个有价值的工具。在这一章中,我们描述了使用Python软件包modelbase实现和构建植物代谢动力学模型的方法。作为一个例子,我们使用光呼吸通路的一部分。此外,我们展示了modelbase的其他功能,这些功能有助于探索动力学模型,从而可以揭示实验不容易获得的生物系统的信息。此外,我们将指出有关动力学模型数学背景的额外信息,以推动进一步的自我研究。
    Plant science has become more and more complex. With the introduction of new experimental techniques and technologies, it is now possible to explore the fine details of plant metabolism. Besides steady-state measurements often applied in gas-exchange or metabolomic analyses, new approaches, e.g., based on 13C labeling, are now available to understand the changes in metabolic concentrations under fluctuating environmental conditions in the field or laboratory. To explore those transient phenomena of metabolite concentrations, kinetic models are a valuable tool. In this chapter, we describe ways to implement and build kinetic models of plant metabolism with the Python software package modelbase. As an example, we use a part of the photorespiratory pathway. Moreover, we show additional functionalities of modelbase that help to explore kinetic models and thus can reveal information about a biological system that is not easily accessible to experiments. In addition, we will point to extra information on the mathematical background of kinetic models to give an impetus for further self-study.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    机器学习在低温电子显微镜(cryoEM)数据分析中的应用为cryoEM数据处理管道增加了一套有价值的工具。随着这些工具变得更容易获得和广泛使用,应评估其使用的影响。我们注意到机器学习图修改工具可以对冷冻EM密度产生不同的影响。从这个角度来看,我们评估了这些影响,以表明机器学习工具通常会提高生物大分子的密度,同时为配体产生不可预测的结果。这种不可预测的行为既体现在地图质量的定量指标中,也体现在修改后的地图的定性调查中。这里给出的结果突出了机器学习工具在cryoEM中的力量和潜力,同时也说明了未经审查使用它们的一些风险。
    The application of machine learning to cryogenic electron microscopy (cryoEM) data analysis has added a valuable set of tools to the cryoEM data processing pipeline. As these tools become more accessible and widely available, the implications of their use should be assessed. We noticed that machine learning map modification tools can have differential effects on cryoEM densities. In this perspective, we evaluate these effects to show that machine learning tools generally improve densities for biomacromolecules while generating unpredictable results for ligands. This unpredictable behavior manifests both in quantitative metrics of map quality and in qualitative investigations of modified maps. The results presented here highlight the power and potential of machine learning tools in cryoEM, while also illustrating some of the risks of their unexamined use.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    机器学习的进步使蛋白质结构的预测足够准确,可用于晶体学和低温电子显微镜数据的大分子结构测定。Phenix软件套件将AlphaFold预测集成到自动化管道中,该管道可以从氨基酸序列和数据开始,并自动执行模型构建和细化,以返回拟合到数据中的蛋白质模型。由于高效运行AlphaFold的陡峭技术要求,我们已经实现了一个Phenix-AlphaFoldWeb服务,该服务使所有Phenix用户都可以从官方1.21版本开始,从PhenixGUI远程运行AlphaFold预测。此Web服务将根据研究社区的使用方式以及Phenix的未来研究方向进行改进。
    Advances in machine learning have enabled sufficiently accurate predictions of protein structure to be used in macromolecular structure determination with crystallography and cryo-electron microscopy data. The Phenix software suite has AlphaFold predictions integrated into an automated pipeline that can start with an amino acid sequence and data, and automatically perform model-building and refinement to return a protein model fitted into the data. Due to the steep technical requirements of running AlphaFold efficiently, we have implemented a Phenix-AlphaFold webservice that enables all Phenix users to run AlphaFold predictions remotely from the Phenix GUI starting with the official 1.21 release. This webservice will be improved based on how it is used by the research community and the future research directions for Phenix.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    蛋白质是众所周知的“变形者”,可将构象改变为功能。在晶体学中,晶体和所得的电子密度图内通常存在多个构象状态。然而,明确地将替代状态纳入模型以解开多构象合奏是具有挑战性的。我们之前报道过FLEXR工具,which,几分钟之内,自动将构象信号与噪声分离,并构建相应的,经常失踪,结构特征转化为多构象模型。为了使该方法广泛用于常规多构象构建,作为大分子晶体学计算工具包的一部分,我们为FLEXR提供了一个图形用户界面(GUI),设计为Coot1的插件。GUI实现将FLEXR模型与Coot中现有的验证和建模工具套件无缝连接。我们设想FLEXR将通过增加对多构象体建模方法的访问来帮助晶体学家,这将最终导致蛋白质数据库中蛋白质构象异质性的更好表示。反过来,对蛋白质构象景观的更深入的见解可能为生物学提供信息或为配体设计提供新的机会。该代码是开源的,可在GitHub上免费获得,网址为https://github.com/TheFischerLab/FLEXR-GUI。
    Proteins are well known \'shapeshifters\' which change conformation to function. In crystallography, multiple conformational states are often present within the crystal and the resulting electron-density map. Yet, explicitly incorporating alternative states into models to disentangle multi-conformer ensembles is challenging. We previously reported the tool FLEXR, which, within a few minutes, automatically separates conformational signal from noise and builds the corresponding, often missing, structural features into a multi-conformer model. To make the method widely accessible for routine multi-conformer building as part of the computational toolkit for macromolecular crystallography, we present a graphical user interface (GUI) for FLEXR, designed as a plugin for Coot 1. The GUI implementation seamlessly connects FLEXR models with the existing suite of validation and modeling tools available in Coot. We envision that FLEXR will aid crystallographers by increasing access to a multi-conformer modeling method that will ultimately lead to a better representation of protein conformational heterogeneity in the Protein Data Bank. In turn, deeper insights into the protein conformational landscape may inform biology or provide new opportunities for ligand design. The code is open source and freely available on GitHub at https://github.com/TheFischerLab/FLEXR-GUI.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    增加的日长和温暖的条件通过直接调节核phyB而反过来影响植物的生长,ELF3和COP1水平。下胚轴长度的定量测量是深入了解这个复杂的调控网络的关键。而类似的定量数据是许多植物生物学研究的基础。这里,我们探索数学建模的应用,特别是常微分方程(ODE),了解植物对这些环境线索的反应。我们为建设提供了全面的指导,模拟,并将这些模型拟合到数据中,利用质量作用定律研究分子物种的进化。介绍了这些模型的基本原理,强调它们在破译复杂的植物生理相互作用和测试假设方面的效用。这个简短的介绍将不允许没有数学背景的实验主义者在一夜之间运行他们自己的模拟,但这将帮助他们掌握建模原则,并与更多有理论倾向的同事交流。
    Increased day lengths and warm conditions inversely affect plant growth by directly modulating nuclear phyB, ELF3, and COP1 levels. Quantitative measures of the hypocotyl length have been key to gaining a deeper understanding of this complex regulatory network, while similar quantitative data are the foundation for many studies in plant biology. Here, we explore the application of mathematical modeling, specifically ordinary differential equations (ODEs), to understand plant responses to these environmental cues. We provide a comprehensive guide to constructing, simulating, and fitting these models to data, using the law of mass action to study the evolution of molecular species. The fundamental principles of these models are introduced, highlighting their utility in deciphering complex plant physiological interactions and testing hypotheses. This brief introduction will not allow experimentalists without a mathematical background to run their own simulations overnight, but it will help them grasp modeling principles and communicate with more theory-inclined colleagues.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在这项研究中,建立了岳州龙井茶不同品种和品质等级感官评分和理化成分的近红外定量预测模型。首先,L,a,b收集每个样品的颜色因子和漫反射光谱数据。随后,原始光谱经过预处理。选择变量的三种技术,汽车,BOSS,和SPA,用于提取最佳特征带。最后,从特征带提取的光谱数据与L,a和b颜色因子构建SVR和PLSR预测模型。实现对岳州龙井茶不同品种和等级的快速无损鉴别。结果表明,BOSS是感官评分和独特的咖啡因波长的最佳变量选择技术,汽车,然而,是儿茶素独特波长的最佳变量选择技术。此外,基于中层数据融合的非线性预测模型大大优于线性预测模型。对于感官评分的预测模型,儿茶素,还有咖啡因,相对百分比偏差(RPD)值分别为2.8、1.6和2.6,表明模型具有良好的预测能力。总之,利用近红外光谱和数据融合技术对岳州龙井茶的品质进行评价是可行的。
    In this study, NIR quantitative prediction model was established for sensory score and physicochemical components of different varieties and quality grades of Yuezhou Longjing tea. Firstly, L, a, b color factors and diffuse reflection spectral data are collected for each sample. Subsequently, the original spectrum is preprocessed. Three techniques for selecting variables, CARS, BOSS, and SPA, were utilized to extract optimal feature bands. Finally, the spectral data extracted from feature bands were fused with L, a and b color factors to build SVR and PLSR prediction models. enabling the rapid non-destructive discrimination of different varieties and grades of Yuezhou Longjing tea. The outcomes demonstrated that BOSS was the best variable selection technique for sensory score and the distinctive caffeine wavelengths, CARS, however, was the best variable selection technique for catechins distinctive wavelengths. Additionally, the middle-level data fusion-based non-linear prediction models greatly outperformed the linear prediction models. For the prediction models of sensory score, catechins, and caffeine, the relative percent deviation (RPD) values were 2.8, 1.6, and 2.6, respectively, suggesting the good predictive ability of the models. In conclusion, evaluating the quality of the five Yuezhou Longjing tea varieties using near-infrared spectroscopy and data fusion have proved as feasible.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    密度改性是通过任何实验定相方法为常规结构解决方案提供途径的标准步骤,单波长或多波长异常衍射是最流行的方法,以及将片段或不完整的模型扩展为完整的解决方案。在SHELXE的情况下,说明了密度修改对任一来源的起始图的影响。审查了程序可以运行的不同模式;这些包括鲜为人知的用途,例如读取外部相位值和权重或以Hendrickson-Lattman系数编码的相位分布。通常在SHELXE,初始阶段是根据实验数据计算的,从部分模型或地图中,或来自两种来源的组合。通过密度修改和扩展初始相位集,如果数据的分辨率和结构类型允许,聚丙氨酸示踪.作为从预测模型导出的阶段中系统地消除模型偏差的功能,可以设置trace以排除起始模型占用的区域。如果提供了序列,则该迹线现在包括延伸到γ位置或疏水性和芳香族侧链中,在每个跟踪周期中执行。一旦从这种轨迹计算出的结构因子与原始数据之间的相关系数超过30%,则表明该结构已得到解决,该序列停靠在所有模型构建周期中,并且如果地图支持它,则安装侧链。讨论了为提供完整模型而引入的跟踪算法的扩展。使用一组测试来评估定相性能的改进。
    Density modification is a standard step to provide a route for routine structure solution by any experimental phasing method, with single-wavelength or multi-wavelength anomalous diffraction being the most popular methods, as well as to extend fragments or incomplete models into a full solution. The effect of density modification on the starting maps from either source is illustrated in the case of SHELXE. The different modes in which the program can run are reviewed; these include less well known uses such as reading external phase values and weights or phase distributions encoded in Hendrickson-Lattman coefficients. Typically in SHELXE, initial phases are calculated from experimental data, from a partial model or map, or from a combination of both sources. The initial phase set is improved and extended by density modification and, if the resolution of the data and the type of structure permits, polyalanine tracing. As a feature to systematically eliminate model bias from phases derived from predicted models, the trace can be set to exclude the area occupied by the starting model. The trace now includes an extension into the gamma position or hydrophobic and aromatic side chains if a sequence is provided, which is performed in every tracing cycle. Once a correlation coefficient of over 30% between the structure factors calculated from such a trace and the native data indicates that the structure has been solved, the sequence is docked in all model-building cycles and side chains are fitted if the map supports it. The extensions to the tracing algorithm brought in to provide a complete model are discussed. The improvement in phasing performance is assessed using a set of tests.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • DOI:
    文章类型: Journal Article
    肝细胞癌(HCC)与高死亡率相关,特别是在亚洲人群中,慢性HBV感染是一个主要原因。准确预测死亡率可以帮助临床决策。我们的目标是(I)比较巴塞罗那临床肝癌分类(BCLC)分期的预测能力,中性粒细胞与淋巴细胞比率(NLR),和白蛋白-胆红素(ALBI)评分可预测短期死亡率(一年和两年),以及(ii)开发了一种与传统模型相比具有更高准确性的新型模型。这项研究招募了298名来自我们肝病科的连续HCC患者。通过受试者工作特征曲线下面积(AUROC)分析评估死亡率的预后值。建立了一个新的模型,并使用5倍交叉验证进行了内部验证,随后在100名患者的队列中进行外部验证。肝硬化的主要病因是乙型肝炎病毒(HBV),与肝癌患者的81.2%保留肝功能。观察到血红蛋白(Hb)和血清白蛋白水平存在显着差异,这反映了患者的营养状况,存活一年的病人和死亡的病人。与NLR相比,BCLC表现出更高的预测准确性,但与ALBI评分相比具有边缘优势。因此,结合BCLC的新颖模型,Hb,并开发了血清白蛋白,内部和外部验证,以及亚组敏感性分析。该模型对一年和两年死亡率的预测准确性明显高于传统的预后预测因子,AUROC值分别为0.841和0.805。小说“BCLC-营养模型”,其中包含BCLC,Hb,和血清白蛋白,与常用的预后评分相比,可以提高HCC患者短期死亡率的预测准确性.这强调了营养在HCC患者管理中的重要性。
    Hepatocellular carcinoma (HCC) is associated with high mortality, especially in Asian populations where chronic HBV infection is a major cause. Accurate prediction of mortality can assist clinical decision-making. We aim to (i) compare the predicting ability of Barcelona Clinic Liver Cancer classification (BCLC) stage, neutrophil-to-lymphocyte ratio (NLR), and Albumin-Bilirubin (ALBI) score in predicting short-term mortality (one- and two-year) and (ii) develop a novel model with improved accuracy compared to the conventional models. This study enrolled 298 consecutive HCC patients from our hepatology department. The prognostic values for mortality were assessed by area under the receiver operating characteristic curve (AUROC) analysis. A novel model was established and internally validated using 5-fold cross-validation, followed by external validation in a cohort of 100 patients. The primary etiology of cirrhosis was hepatitis B virus (HBV), with 81.2% of HCC patients having preserved liver function. Significant differences were observed in hemoglobin (Hb) and serum albumin levels, which reflect patients\' nutrition status, between patients who survived for one year and those who died. BCLC exhibited superior predictive accuracy compared to NLR but had borderline superiority to the ALBI score. Therefore, a novel model incorporating BCLC, Hb, and serum albumin was developed, internally and externally validated, as well as subgroup sensitivity analysis. The model exhibited significantly higher predictive accuracy for one- and two-year mortality than conventional prognostic predictors, with AUROC values of 0.841 and 0.805, respectively. The novel \"BCLC-Nutrition Model\", which incorporates BCLC, Hb, and serum albumin, may provide improved predictive accuracy for short-term mortality in HCC patients compared to commonly used prognostic scores. This emphasizes the importance of nutrition in the management of HCC patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    乙型肝炎病毒(HBV)感染是全球公共卫生问题。在慢性感染期间,HBV小表面抗原以非感染性球形亚病毒颗粒(SVPs)的形式大量过量表达,具有很强的免疫原性。迄今为止,试图理解HBV球形SVP的结构被限制在12-30µ,关于其结构的结论相互矛盾。我们使用低温电子显微镜(cryo-EM)和3D图像重建将HBV球形SVP解决为6.3µ。这里,我们提出了一种将AlphaFold2预测与中等分辨率低温EM密度图相结合的扩展协议,以构建可靠的3D模型.该协议利用在低温EM社区中常规使用的多个软件包。工作流程包括3D模型预测,模型评估,刚体接头,柔性接头,真实空间细化,模型验证,和模型调整。最后,所描述的协议也可以应用于高分辨率的低温EM数据集(2-4µ)。
    Hepatitis B virus (HBV) infection is a global public health concern. During chronic infection, the HBV small-surface antigen is expressed in large excess as non-infectious spherical subviral particles (SVPs), which possess strong immunogenicity. To date, attempts at understanding the structure of HBV spherical SVP have been restricted to 12-30 Å with contradictory conclusions regarding its architecture. We have used cryo-electron microscopy (cryo-EM) and 3D image reconstruction to solve the HBV spherical SVP to 6.3 Å. Here, we present an extended protocol on combining AlphaFold2 prediction with a moderate-resolution cryo-EM density map to build a reliable 3D model. This protocol utilizes multiple software packages that are routinely used in the cryo-EM community. The workflow includes 3D model prediction, model evaluation, rigid-body fitting, flexible fitting, real-space refinement, model validation, and model adjustment. Finally, the described protocol can also be applied to high-resolution cryo-EM datasets (2-4 Å).
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    将准确的蛋白质模型构建为中等分辨率(3-5)的低温电子显微镜(cryo-EM)图具有挑战性且容易出错。我们已经开发了MEDIC(Cryo-EM中的模型错误检测),一个强大的统计模型,通过将局部密度拟合与深度学习衍生的结构信息相结合,识别构建到低温EM图中的蛋白质结构中的局部骨架误差。MEDIC在一组28个结构上进行了验证,这些结构随后被解决为更高的分辨率,我们以68%的精度和60%的召回率识别低分辨率和高分辨率结构之间的差异。我们还使用此模型修复12个沉积结构中的100多个错误,并以80%的精度和60%的召回率识别4个精细AlphaFold预测中的错误。随着建模者更频繁地使用深度学习预测作为改进和重建的起点,MEDIC处理手工构建和机器学习方法中结构错误的能力使其成为结构生物学家的强大工具。
    Building accurate protein models into moderate resolution (3-5 Å) cryoelectron microscopy (cryo-EM) maps is challenging and error prone. We have developed MEDIC (Model Error Detection in Cryo-EM), a robust statistical model that identifies local backbone errors in protein structures built into cryo-EM maps by combining local fit-to-density with deep-learning-derived structural information. MEDIC is validated on a set of 28 structures that were subsequently solved to higher resolutions, where we identify the differences between low- and high-resolution structures with 68% precision and 60% recall. We additionally use this model to fix over 100 errors in 12 deposited structures and to identify errors in 4 refined AlphaFold predictions with 80% precision and 60% recall. As modelers more frequently use deep learning predictions as a starting point for refinement and rebuilding, MEDIC\'s ability to handle errors in structures derived from hand-building and machine learning methods makes it a powerful tool for structural biologists.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号