Quantitative Structure-property Relationship

定量结构 - 性质关系
  • 文章类型: Journal Article
    在碳中和碳峰值的背景下,分子管理已成为石化行业关注的焦点。实现分子管理的关键是分子重建,这依赖于快速准确的石油属性计算。专注于石脑油,我们提出了一种新颖的性质预测模型构建程序(MD-NP),采用分子动力学模拟从实际分析数据中收集性质和伽马分布,以计算模拟混合物的摩尔分数。我们通过分子动力学模拟计算了在273K至300K范围内的348组混合物性质数据。分子特征提取基于分子描述符。除了基于开源工具包(RDKit和Mordred)的描述符之外,我们设计了12个石脑油知识(NK)描述符,重点是石脑油。三种机器学习算法(支持向量回归、应用极限梯度增强和人工神经网络)并进行比较,以建立用于预测石脑油密度和粘度的模型。Mordred和NK描述符+支持向量回归算法实现了密度的最佳性能。选择的RDKFp和NK描述符+人工神经网络算法实现了粘度的最佳性能。使用消融研究,T,P_w和CC(C)C是NK中的三个有效描述符,可以提高属性预测模型的性能。MDs-NP有可能扩展到更多的属性以及更复杂的石油系统。MDs-NP模型可用于快速分子重建,以促进数据驱动模型的构建和石化过程的智能转换。
    In the context of carbon neutrality and carbon peaking, molecular management has become a focus of the petrochemical industry. The key to achieving molecular management is molecular reconstruction, which relies on rapid and accurate calculation of oil properties. Focusing on naphtha, we proposed a novel property prediction model construction procedure (MDs-NP) employing molecular dynamics simulations for property collections and gamma distribution from real analytical data for calculating mole fractions of simulation mixtures. We calculated 348 sets of mixture properties data in the range of 273 K-300 K by molecular dynamics simulations. Molecular feature extraction was based on molecular descriptors. In addition to descriptors based on open-source toolkits (RDKit and Mordred), we designed 12 naphtha knowledge (NK) descriptors with a focus on naphtha. Three machine learning algorithms (support vector regression, extreme gradient boosting and artificial neural network) were applied and compared to establish models for the prediction of the density and viscosity of naphtha. Mordred and NK descriptors + support vector regression algorithm achieved the best performance for density. The selected RDKFp and NK descriptors + artificial neural network algorithm achieved the best performance for viscosity. Using ablation studies, T, P_w and CC(C)C are three effective descriptors in NK that can improve the performance of the property prediction models. MDs-NP has the potential to be extended to more properties as well as more-complex petroleum systems. The models from MDs-NP can be used for rapid molecular reconstruction to facilitate construction of data-driven models and intelligent transformation of petrochemical processes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    自燃温度(AIT)是设计火灾和爆炸安全措施的重要指标之一。因此,在这项研究中,定量结构-性质关系方法用于基于分子结构信息的三元混合液体的AIT预测。使用Mordred软件计算并过滤最佳分子描述符。提出了十二种混合规则来计算混合物的分子描述符。建立了二元液体混合物AIT值的预测模型,使用反向传播神经网络(BPNN)和一维卷积神经网络(1DCNN)进行验证和评估。使用shapley加性解释方法解释了模型中单个分子描述符与AIT之间的相对贡献以及正相关和负相关。结果表明,采用混合规则1的BPNN和1DCNN模型具有最好的拟合能力,稳定性和预测能力。训练集中BPNN和1DCNN模型的决定系数分别为0.996和0.992,均方根误差分别为3.613℃和5.284℃,平均绝对误差为2.483°C和4.144°C,纳什效率系数分别为0.996和0.992,willmott指数分别为0.999和0.998。和相对贡献的前三个分子描述符的值,SssCH2,SsOH和SsCH3与AIT值呈负相关。BPNN和1DCNN模型为预测三元混合液体AIT提供了准确可靠的方法。
    Auto-ignition temperature (AIT) is one of the crucial exponents in the design of fire and explosion safety measures. Therefore, in this study, quantitative structure-property relationship approach was used to predict the AIT of ternary hybrid liquids based on molecular structure information. The optimal molecular descriptors were calculated and filtered using Mordred software. Twelve mixing rules were proposed for calculating molecular descriptors of mixtures. A prediction model for the AIT value of binary liquid mixtures was developed, validated and evaluated using a back propagation neural network (BPNN) and a one-dimensional convolutional neural network (1DCNN). The relative contribution and positive and negative correlations between individual molecular descriptors and AIT in the model were interpreted using the shapley additive explanations method. The results show that BPNN and 1DCNN models using mixing rule 1 have the best fitting ability, stability and prediction ability. The determination coefficient of the BPNN and 1DCNN models in the training set were 0.996 and 0.992, the root mean square errors were 3.613 °C and 5.284 °C, the mean absolute errors were 2.483 °C and 4.144 °C, the nash efficiency coefficient was 0.996 and 0.992, respectively, the willmott index was 0.999 and 0.998. and the values of the top three molecular descriptors of relative contribution, SssCH2, SsOH and SsCH3, were negatively correlated with the AIT values. The BPNN and 1DCNN models provide an accurate and reliable method for predicting ternary mixing liquid AIT.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    药物在水和有机溶剂中的溶解度是许多制药过程中的关键因素。近年来,一种称为低共熔溶剂(DESs)的新型溶剂已被开发为药物的有用溶剂。氯化胆碱-甘油/尿素(ChCl-G/U)系统是公认的一类新型环保溶剂的DES。这种类型的DES在水中的一种最新应用是药物的增溶。
    本研究旨在研究某些药物在ChCl-G/U中的溶解度。此外,研究的DES的增溶机制,提出了增溶的定量结构-性质关系(QSPR)模型。
    使用摇瓶方法研究了13种药物在ChCl-G/U系统的水溶液中的溶解度。在所研究系统的10%和50%质量分数下进行研究。使用QSPR模型,使用多元线性回归模型建立在ChCl-G/U水混合物存在下研究化合物溶解之间的数学关系。
    在向水溶液中加入ChCl-G/U时,化合物的溶解度显示出显著增加。根据获得的数据,使用增溶率和结构描述符开发QSPR模型。
    实验数据证明了利用ChCl-G/U作为介质增强难溶性药物在水中的溶解度的潜力。ChCl-G/U水混合物中溶质的增溶可能与药物的结构性质相关。此外,在ChCl-U中溶液的最终pH是使用该系统进行溶解时必须考虑的关键因素。
    UNASSIGNED: The solubility of drugs in water and organic solvents is a crucial factor in numerous pharmaceutical processes. In recent years, a new type of solvent called deep eutectic solvents (DESs) has been developed as a useful solvent for drugs. Choline chloride-glycerol/urea (ChCl-G/U) systems are DESs recognized as a novel category of environmentally friendly solvents. One recent application of this type of DES in water is the solubilization of drugs.
    UNASSIGNED: This study aimed to investigate the solubility of certain drugs in ChCl-G/U. In addition, the solubilization mechanisms of the DESs studied, and quantitative structure-property relationship (QSPR) models for solubilization were proposed.
    UNASSIGNED: The solubility of 13 drugs in an aqueous solution of the ChCl-G/U system was investigated using the shake flask method. The study was conducted at 10% and 50% mass fractions of the studied systems. Multiple linear regression models were used to develop mathematical relationships between the solubilization of the studied compounds in the presence of ChCl-G/U + water mixture using QSPR models.
    UNASSIGNED: The solubility of the compounds showed a significant increase upon adding ChCl-G/U to the aqueous solutions. Based on the data obtained, QSPR models were developed using solubilization ratio and structural descriptors.
    UNASSIGNED: The experimental data demonstrates the potential of utilizing ChCl-G/U as a medium to enhance the solubility of poorly soluble drugs in water. Solubilization of solutes in ChCl-G/U + water mixtures could be correlated with the structural properties of drugs. Moreover, the final pH of the solutions in ChCl-U is a critical factor that must be considered when using this system for solubilization.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    拓扑指数将化学结构与化学反应性联系起来,物理性质,和生物活性。定量结构-活性关系(QSPR)是为各种类型的化学反应性的相关性提出的数学模型,生物活性,和具有拓扑指数/熵的物理属性。在这篇文章中,我们提出了基于边缘熵的末端顶点的ve度与苯衍生物的理化性质之间的QSPR。我们设计了一种基于Maple的算法来计算熵。使用SPSS分析两者的关系。我们已经表明,生理化学性质,如临界压力,亨利的法律,临界温度,Gibb的能量,logP,临界体积,摩尔折射率可以通过熵来预测。所有结果均为高度正和显著。兰迪奇,Balaban,重新定义的第三个萨格勒布熵显示出与理化性质的最佳关系。由RamaswamyH.Sarma沟通。
    Topological indices relate chemical structure to chemical reactivity, physical properties, and biological activity. Quantitative structure-activity relationships (QSPR) are mathematical models proposed for the correlation of various types of chemical reactivity, biological activity, and physical properties with topological indices/entropies. In this article, we have proposed the QSPR between the ve-degree of end vertices of edge based entropies and the physiochemical properties of benzene derivatives. We have designed a Maple-based algorithm for the computation of entropies. The relationship was analyzed using SPSS. We have shown that the physiochemical properties such as critical pressure, Henry\'s law, critical temperature, Gibb\'s energy, logP, critical volume, and molar refractivity can be predicted by entropies. All the results were highly positive and significant. The Randić, Balaban, and redefined third Zagreb entropies showed the best relations with physiochemical properties.Communicated by Ramaswamy H. Sarma.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:由芳族硝基化合物的结构不稳定性引起的反应的热危害是化学过程安全领域的主要关注点,也是诸如火灾和爆炸之类的重大热失控(TR)事故的主要原因之一。其中,自加速分解温度(SADT),作为一个重要的参数,已被广泛用于评估芳族硝基化合物在实际储存和运输过程中的热危害。然而,控制温度(CT)和紧急温度(ET),依赖于SADT并与之相关,在以前的研究中很少报道。在这项工作中,基于与27个单/二元芳族硝基化合物的稳定结构相对应的分子描述符,构建了CT和ET的多元线性回归(MLR)和人工神经网络(ANN)模型,结合先进的绝热加速量热实验和定量结构-性质关系(QSPR)。筛选出具有显著贡献的最优描述符子集,预测能力,并利用内部和外部验证参数评估了四类模型的稳健性,最后,选取两类参数(R2和ARE)作为主要指标进行综合比较分析。结果表明,4种模型均能较好地拟合实验数据。在此期间,人工神经网络模型的精度略高于MLR模型,两种模式(线性和非线性)下的QSPR模型在预测能力上更倾向于ET。在简化计算过程、实现参数快速预测的基础上,本研究有望为安全运行等工程应用提供技术支持,物质的安全储存和运输,和化学工业的应急响应。
    方法:在这项工作中,我们通过ARC和AKTS测试和计算了27个单/二元芳族硝基化合物的热安全参数,并进一步使用PubChem数据库和Gaussian09软件程序来获得并优化其相应的分子结构。几何优化过程在B3LYP级和6-31+G(d,P)基础设置,而相同的功能和基础集用于振动分析。OpenBabel工具箱和ChemDES平台用于转换编码和描述符计算。最后,使用IBMSPSSStatistics24和MATLAB软件构建MLR模型和ANN模型,分别。
    BACKGROUND: The thermal hazard of reactions caused by the structural instability of aromatic nitro compounds is a major concern in the field of chemical process safety and one of the main causes of major thermal runaway (TR) accidents such as fire and explosion. Among them, the self-accelerating decomposition temperature (SADT), as an important parameter, has been widely used to evaluate the thermal hazards of aromatic nitro compounds in actual storage and transportation processes. However, the control temperature (CT) and emergency temperature (ET), which depend on and are associated with SADT, have been rarely reported in previous studies. In this work, multiple linear regression (MLR) and artificial neural network (ANN) models for CT and ET were constructed based on the molecular descriptors corresponding to the stable structures of 27 monadic/binary aromatic nitro compounds, combined with advanced adiabatic accelerating calorimetric experiments and quantitative structure-property relationship (QSPR). The optimal subset of descriptors with significant contributions was screened out while the fit, predictive ability, and robustness of the four types of models were evaluated with internal and external validation parameters, and finally, two types of parameters (R2 and ARE) were selected as the main indicators for a comprehensive comparative analysis. The results show that the four models fit the experimental data well. During this period, the accuracy of ANN models is slightly higher than that of MLR models, and the QSPR models under the two modes (linear and nonlinear) are more inclined toward ET in prediction ability. Based on simplifying the calculation process and realizing rapid parameter prediction, this study is expected to provide technical support for engineering applications such as safe operation, safe storage and transportation of substances, and emergency response in the chemical industry.
    METHODS: In this work, we tested and calculated the thermal safety parameters of 27 monadic/binary aromatic nitro compounds by ARC and AKTS and further used the PubChem database and Gaussian 09 software program to obtain and optimize their corresponding molecular structures. The geometric optimization process adopts DFT on the B3LYP level and the 6-31 + G(d, p) basis set, while the same functional and basis set was used for vibration analysis. The OpenBabel toolbox and ChemDES platform were used for transformation coding and descriptor calculation. Finally, IBM SPSS Statistics 24 and MATLAB software were used to construct MLR models and ANN models, respectively.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在这项研究中,利用机器学习的力量来探索基于肽的水凝胶的分子结构与其粘塑性特性之间的联系。根据文献中显示水凝胶功能性的基于肽的材料的规定的完整列表尝试化合物的选择。在这种追求中,我们考虑了一组完整的分子描述符和指纹图谱-对于所分析的每个基于肽的结构,我们考虑了大小为17,968的条目.材料的弹性和粘性模量响应在[0.1-100](rad/s)范围内的宽频谱上映射。总的来说,结果表明,基于肽的水凝胶的频率依赖性机械响应与其(间)分子属性在统计上相关,如电荷,第一电离电势(或等效的电负性),表面积,化学底物的数量,债券类型,和分子间的相互作用。测量了几种(有监督的)软计算技术的性能,用于我们的定量结构属性关系模型。此外,测试了将我们的数据库映射到新的主成分系统的假设,通过使用无监督的方法,这提高了预测精度。就意义而言,本文提供了频率相关的弹性模量和粘性模量的第一份报告,用于具有水凝胶功能的一组70个基于肽的制剂。由RamaswamyH.Sarma沟通。
    In this study, the power of machine learning was harnessed to probe the link between molecular structures of peptide-based hydrogels and their viscoplastic properties. The selection of compounds was attempted in accordance with the prescribed full list of peptide-based materials exhibiting hydrogel functionality in the literature. In this pursuit, a complete set of molecular descriptors and fingerprints was considered - accounting for an entry of size 17,968 for each peptide-based structure analyzed. The elastic and viscous moduli response of materials were mapped over a wide frequency spectrum in the range [0.1-100] (rad/s). In general, the results indicate that the frequency-dependent mechanical response of peptide-based hydrogels is statistically correlated with its (inter)molecular attributes, such as charge, first ionization potential (or equivalently electronegativity), surface area, number of chemical substrates, bond type, and intermolecular interactions. The performance of several (supervised) soft computing techniques was measured, for our quantitative structure property relationships model. In addition, the hypothesis of mapping our databank to a new system of principal components was tested, by using an unsupervised methodology, which resulted in enhancement of the prediction accuracy. In terms of significance, the present article provides the first report of frequency-dependent elastic and viscous moduli, for a set of 70 peptide-based formulations with hydrogel functionality.Communicated by Ramaswamy H. Sarma.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    由于其高灵敏度,荧光颜料的开发是几个研究领域中感兴趣的领域。在当前的研究中-八个已知和三个新的N,使用Claisen-Schmidt反应以良好的产率合成了N-二甲基氨基-查耳酮(12a-k)。对于每个分子系统,光物理性质,包括最大吸收波长(λ吸收),摩尔吸光系数(ε),最大激发波长(λ激发),最大发射波长(λ发射),斯托克斯位移(Δλ),荧光量子产率(Φfl),荧光寿命(τfl),辐射和非辐射速率常数(kR和kNR,分别)进行了评估。根据存在于各化合物上的取代基分析这些性质中的每一个的变化。为了将合成化合物的化学结构与其光物理性质联系起来,应用Hansch分析(2D-QSPR)。根据汉施分析,我们发现了与分子轨道及其衍生物的能量有关的不同的光物理性质(最高占据分子轨道-HOMO,最低未占用分子轨道-LUMO,LUMO-HOMO-ΔLH之间的差异,化学势-µ,硬度η,柔软度-S,和亲电子全局指数-ω)以及原子C5、Cα上的原子电荷,Cβ,和CO。这种类型的分析的应用使得理解并随后设计具有确定的光物理性质的新分子成为可能。最后,这些化合物被用作荧光色素,对乳腺癌细胞进行活细胞成像,获得化合物12a作为期许替代品。
    The development of fluorescent pigments is an area of interest in several research fields due to their high sensitivity. In the current study-eight known and three new N,N-dimethylamino-chalcones (12a-k) were synthesized with good yields using the Claisen-Schmidt reaction. For each molecular system, the photophysical properties, including the maximum absorption wavelength (λAbsorption), molar absorption coefficient (ε), maximum excitation wavelength (λExcitation), maximum emission wavelength (λEmission), Stokes Shift (Δλ), fluorescence quantum yield (Φfl), fluorescence lifetime (τfl), radiative and non-radiative rate constants (kR and kNR, respectively) were evaluated. Variations in each of these properties were analyzed depending on the substituents present on each compound. To relate the chemical structures of the synthesized compounds to their photophysical properties, Hansch analysis (2D-QSPR) was applied. As a result of Hansch analysis, we found different photophysical properties related to molecular orbitals and the energy of their derivatives (Highest Occupied Molecular Orbital-HOMO, Lowest Unoccupied Molecular Orbital-LUMO, Difference between LUMO-HOMO-ΔLH, Chemical potential-µ, Hardness-η, Softness-S, and electrophilic global index-ω) as well as to the atomic charges on atoms C5, Cα, Cβ, and CO. The application of this type of analysis has made it possible to understand and subsequently design new molecules with defined photophysical properties. Finally, the compounds were use as fluorescent pigment to get living cell imaging on breast cancer cells, obtaining the compound 12a as promissory alternative.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    使用先前由B3LYP/6-311G(d,p)//DFTB用于2263个sp3C-H共价键。最好的属性集包括碳原子的114个描述符(核原子和环描述符周围的5个球体中的原子类型计数)。优化后的模型预测了由224个键组成的独立测试集的DFT计算的BDE,MAE=2.86kcal/mol。来自iBonD数据库(http://ibond。南开。edu.cn)是由RF预测的,其MAE适中(5.36kcal/mol),但相对于实验能量的R2相对较高(0.75)。探索了一种预测方案,该方案通过在实验数据的附加存储器中观察到的k个最近邻居(KNN)的平均偏差来校正RF预测。对于145个键的独立测试集和相应的实验键能,校正后的预测达到MAE=2.22kcal/mol。
    Random Forest (RF) QSPR models were developed with a data set of homolytic bond dissociation energies (BDE) previously calculated by B3LYP/6-311++G(d,p)//DFTB for 2263 sp3C-H covalent bonds. The best set of attributes consisted in 114 descriptors of the carbon atom (counts of atom types in 5 spheres around the kernel atom and ring descriptors). The optimized model predicted the DFT-calculated BDE of an independent test set of 224 bonds with MAE=2.86 kcal/mol. A new data set of 409 bonds from the iBonD database (http://ibond.nankai.edu.cn) was predicted by the RF with a modest MAE (5.36 kcal/mol) but a relatively high R2 (0.75) against experimental energies. A prediction scheme was explored that corrects the RF prediction with the average deviation observed for the k nearest neighbours (KNN) in an additional memory of experimental data. The corrected predictions achieved MAE=2.22 kcal/mol for an independent test set of 145 bonds and the corresponding experimental bond energies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    鉴于实验测定的难度,定量结构-性质关系(QSPR)和深度学习(DL)是预测化合物理化性质的重要工具。在本文中,偏最小二乘(PLS),遗传函数逼近(GFA),利用深度神经网络(DNN)预测SE-52和DB-5固定相PAHs的Lee保留指数(Lee-RI)。四个分子描述符,分子量(MW),药物相似性的定量估计(QED),原子电荷加权负表面积(Jurs_PNSA_3),选择相对负电荷(Jurs_RNCG)构建基于遗传算法的回归模型。对于SE-52,PLS模型显示出最佳的预测能力,其次是DNN和GFA。相对误差(RE),均方根误差(RMSE),最佳PLS回归模型的回归系数(R2)为1.228%,5.407和0.980。对于DB-5,DNN模型显示出最佳的预测能力,其次是GFA和PLS。RE,DB-5-1和DB-5-2的最佳DNN回归模型的RMSE和R2为1.058%,4.325%,0.976%,0.821%,3.795%,和0.970%,分别。三个回归模型不仅表现出良好的预测能力,同时也突出了模型的稳定性和延展性。
    Given the difficult of experimental determination, quantitative structure-property relationship (QSPR) and deep learning (DL) provide an important tool to predict physicochemical property of chemical compounds. In this paper, partial least squares (PLS), genetic function approximation (GFA), and deep neural network (DNN) were used to predict the Lee retention index (Lee-RI) of PAHs in SE-52 and DB-5 stationary phases. Four molecular descriptors, molecular weight (MW), quantitative estimate of drug-likeness (QED), atomic charge weighted negative surface area (Jurs_PNSA_3), and relative negative charge (Jurs_RNCG) were selected to construct regression models based on genetic algorithm. For SE-52, PLS model showed best prediction power, followed by DNN and GFA. The relative error (RE), root mean square error (RMSE), and regression coefficient (R2 ) of best PLS regression model are 1.228%, 5.407, and 0.980. For DB-5, DNN model showed best prediction power, followed by GFA and PLS. The RE, RMSE and R2 of best DNN regression model for DB-5-1 and DB-5-2 are 1.058%, 4.325%, 0.976%, 0.821%, 3.795%, and 0.970%, respectively. The three regression models not only show good predictive ability, but also highlight the stability and ductility of the models.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    由于水中的各种有机微污染物而导致的日益严重的环境问题导致寻找合适的其他水处理方法。获得大量和各种污染物的实验数据将消耗大量的时间以及经济和生态资源。另一种方法是预测定量结构-性质关系(QSPR)建模,它建立了分子的结构特性与生物之间的相关性,物理,或化学性质。因此,在这项研究中,已使用广泛的验证技术和统计测试进行了QSPR建模,以研究臭氧氧化对有机微污染物降解性的结构影响。与大多数其他研究相比,基础数据集——92个有机分子的速率常数——是在定义实验参数的标准化条件下获得的。QSPR建模是使用用于描述符计算的软件PaDEL和用于建模过程的QSARINS的组合来执行的,该过程符合适用的QSAR/QSPR模型的所有五个OECD要求。使用多准则决策工具选择最终模型,以根据所有计算的统计质量参数评估模型质量。该模型包括10个选定的描述符和指纹,并显示出良好的回归能力,预测能力,和稳定性(R²=0.8221,CCCtr=0.9024,Q²loo=0.7436,R²ext=0.8420,Q²F1=0.8104)。定义了QSPR模型的适用域,并将所选模型描述符的解释与先前的实验研究联系起来。将可解释描述符的显着影响放入实验环境中,并与以前的研究和模型进行比较。例如,作为分子大小和极化性的量度的摩尔折射率和重要子结构如甲酰胺基团的出现似乎降低了去除速率常数。孤立电子进入共振的贡献以及稠环的出现被确定为通过臭氧化增加微污染物降解性的影响。
    The increasing environmental problems due to various organic micropollutants in water cause the search of suitable additional water treatment methods. Gaining experimental data for the large amount and variety of pollutants would consume a lot of time as well as economic and ecologic resources. An alternative approach is predictive quantitative structure-property relationship (QSPR) modeling, which establishes a correlation between the structural properties of a molecules with a biological, physical, or chemical property. Therefore, in this study, QSPR modeling has been conducted using extensive validation techniques and statistical test to investigate the structural influence on the degradability of organic micropollutants with ozonation. In contrast to most of the other studies, the underlying dataset - rate constants for 92 organic molecules - were obtained under standardized conditions with defined experimental parameters. QSPR modeling was executed using a combination of the software PaDEL for descriptor calculation and QSARINS for the modeling process respecting all five OECD-requirements for applicable QSAR/QSPR-models. The final model was selected using a multi-criteria decision-making tool to evaluate the model quality based on all calculated statistical quality parameters. The model included 10 selected descriptors and fingerprints and showed good regression abilities, predictive power, and stability (R² = 0.8221, CCCtr = 0.9024, Q²loo = 0.7436, R²ext = 0.8420, Q²F1 = 0.8104). The applicability domain of the QSPR model was defined and an interpretation of selected model descriptors has been connected to previous experimental studies. A significant influence of the interpretable descriptors was put into experimental context and compared with previous studies and models. For example, the molar refractivity as a measure of size and polarizability of a molecule and the occurrence of important substructures such as a formamide group seem to decrease the removal rate constant. The contribution of lone electrons entering into resonance as well as the occurrence of fused rings were identified as influences for the increase of the degradability of micropollutants by ozonation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号