digital soil mapping

数字土壤制图
  • 文章类型: English Abstract
    定量分析土壤盐渍化影响因素的空间非平稳特征及其空间分布预测,对于合理利用滨海盐渍土资源和制定当地防治措施具有重要意义。在这项研究中,东营市河口区,山东省,被用作研究区域,采用经典统计方法对土壤盐渍化状况进行描述性统计。利用空间自相关理论探讨研究区土壤盐渍化的整体和局部空间结构特征。选择了与土壤盐分有关的影响因素,和多元线性回归(MLR),地理加权回归(GWR),采用多尺度地理加权回归(MGWR)方法对研究区土壤盐分空间分布进行建模和预测,分析不同影响因子对土壤盐分影响的空间异质性。结果表明:①研究区土壤盐分平均值为5.84g·kg-1,表明盐渍化严重,全局Moran\sI指数为0.19(P<0.00),具有明显的空间聚集特征。②在三个模型中,MGWR模型的建模精度最高。与MLR模型相比,GWR和MGWR的Radj2分别提高了0.05和0.07,RSS分别下降210.13和179.95。③MGWR回归结果表明,土壤盐分的空间分布主要受中部土壤盐分的影响,土壤粘粒含量,从不同影响因素的标准化回归系数的平均值和植被覆盖。不同影响因子对土壤盐渍化具有显著的空间非平稳特征。④MGWR土壤盐分空间分布预测结果表明,土壤盐分高(≥6g·kg-1)区域主要分布在研究区北部,从海岸到内部的总体空间趋势是减少的。研究结果可为利用MGWR进行县域及更大尺度土壤盐渍化影响因素的分析和预测作图提供参考。
    Quantitative analysis of the spatial non-stationary characteristics of soil salinization influencing factors and the prediction of its spatial distribution are of great significance for the rational use of coastal saline soil resources and the formulation of local prevention and control measures. In this study, the Hekou District of Dongying City, Shandong Province, was used as the study area, and the descriptive statistics of soil salinization status were conducted using classical statistical methods. Spatial autocorrelation theory was used to explore the characteristics of global and local spatial structure of soil salinization in the study area. Influential factors related to soil salinity were selected, and multivariate linear regression (MLR), geographically weighted regression (GWR), and multi-scale geographically weighted regression (MGWR) methods were used to model and predict the spatial distribution of soil salinity in the study area and to analyze the spatial heterogeneity of the effects of different influencing factors on soil salinity. The results showed that: ① The mean value of soil salinity in the study area was 5.84 g·kg-1, indicating severe salinization, with a global Moran\'s I index of 0.19 (P<0.00) and obvious spatial aggregation characteristics. ② Among the three models, the MGWR model had the highest modeling accuracy. Compared with that of the MLR model, the Radj2 of GWR and MGWR improved by 0.05 and 0.07, respectively, and the RSS decreased by 210.13 and 179.95, respectively. ③ The results of MGWR regression showed that the spatial distribution of soil salinity appeared to be mainly affected by the middle soil salinity, soil clay content, and vegetation cover from the mean values of standardized regression coefficients of different influencing factors. Different influencing factors had significant spatial non-stationary characteristics on soil salinization. ④ The results of the spatial distribution prediction of soil salinity in MGWR showed that the areas of high soil salinity (≥6 g·kg-1) were mainly distributed in the northern part of the study area, with an overall spatial trend of decreasing from the coast to the interior. The results of the study can be used as a reference for the analysis and predictive mapping of factors affecting soil salinization in the county and on a larger scale using MGWR.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    土壤有机碳(SOC)在全球碳循环和碳固存中起着至关重要的作用。支持需要全面了解其分布和控制。本研究使用深度学习方法探讨了各种协变量对局部(高达1.25km)和大陆(美国)尺度上SOC空间分布的重要性。我们的发现强调了地形属性在预测地形SOC浓度分布中的重要作用,在局部尺度上贡献了大约三分之一的整体预测。在大陆尺度上,在预测SOC分布方面,气候仅比地形重要1.2倍,而在局部尺度上,地形的结构模式比气候和植被重要14倍和2倍,分别。我们强调地形属性,虽然在所有尺度上都是SOC分布的组成部分,是具有明确空间排列信息的局部尺度上更强的预测因子。虽然这项观察性研究没有评估因果机制,尽管如此,我们的分析还是提出了一个关于SOC空间分布的细微差别的观点,这表明了当地和大陆尺度上SOC的不同预测因子。从这项研究中获得的见解对改进的SOC映射具有重要意义,决策支持工具,和土地管理策略,协助制定有效的碳封存计划并加强缓解气候变化的努力。
    Soil organic carbon (SOC) plays a vital role in global carbon cycling and sequestration, underpinning the need for a comprehensive understanding of its distribution and controls. This study explores the importance of various covariates on SOC spatial distribution at both local (up to 1.25 km) and continental (USA) scales using a deep learning approach. Our findings highlight the significant role of terrain attributes in predicting SOC concentration distribution with terrain, contributing approximately one-third of the overall prediction at the local scale. At the continental scale, climate is only 1.2 times more important than terrain in predicting SOC distribution, whereas at the local scale, the structural pattern of terrain is 14 and 2 times more important than climate and vegetation, respectively. We underscore that terrain attributes, while being integral to the SOC distribution at all scales, are stronger predictors at the local scale with explicit spatial arrangement information. While this observational study does not assess causal mechanisms, our analysis nonetheless presents a nuanced perspective about SOC spatial distribution, which suggests disparate predictors of SOC at local and continental scales. The insights gained from this study have implications for improved SOC mapping, decision support tools, and land management strategies, aiding in the development of effective carbon sequestration initiatives and enhancing climate mitigation efforts.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    可持续的土壤资源管理依赖于可靠的土壤信息,通常来自“传统土壤数据”或新旧土壤数据的组合。然而,协调不同时期收集的土壤数据的任务在文献中仍未得到很大程度的探索。应对这一挑战需要将时间维度纳入时空土壤研究的数学和统计模型。这项研究旨在创建一个全面的框架,以协调不同时间的土壤数据。我们评估了历史和最近土壤数据的整合,从4到48岁的数据,利用土壤数据最近度分析。为了实现这一点,我们引入了一个“数据年龄”属性,计算土壤调查年份与现在之间的时间差(例如,2022年)。我们应用了三种机器学习模型-决策树(DT),随机森林(RF),梯度提升(GBM)-包含6339个站点和28,149个深度协调层的数据集。结果一致证明了跨模型的强劲性能,RF表现优于R平方值为0.99,RMSE为1.41,一致性为0.97。同样,DT和GBM也显示出强大的预测能力。地形衍生的环境协变量在预测土壤数据新近度中比土地利用和土地覆盖(LULC)变化起着更重要的作用。虽然LULC的变化显示了不同深度的土壤有机碳浓度变化,这是一个不太重要的因素。人为因素,如LULC变化和归一化植被指数(NDVI),不是土壤数据新近度的主要决定因素。土壤深度的变化对预测土壤数据的近期性没有影响。这项研究验证了地形派生的协变量,尤其是海拔因素,在使用土壤数据新近度概念预测当前土壤属性时,有效地解释了旧土壤数据的质量。这种方法有可能增强实时估计,比如碳预算,我们强调它在全球地球系统模型中的重要性。
    Sustainable soil resource management depends on reliable soil information, often derived from \'legacy soil data\' or a combination of old and new soil data. However, the task of harmonizing soil data collected at different times remains a largely unexplored in the literature. Addressing this challenge requires incorporating the temporal dimension into mathematical and statistical models for spatio-temporal soil studies. This study aimed to create a comprehensive framework for harmonizing soil data across various time. We assessed the integration of historical and recent soil data, ranging from 4 to 48 years old data, using soil data recency analysis. To achieve this, we introduced an \'age of data\' attribute, calculating the time difference between soil survey years and the present (e.g., 2022). We applied three machine learning models - Decision Trees (DT), Random Forest (RF), Gradient Boosting (GBM) - to a dataset containing 6339 sites and 28,149 depth-harmonized layers. The results consistently demonstrated robust performance across models, RF outperforming with an R-squared value of 0.99, RMSE of 1.41, and a concordance of 0.97. Similarly, DT and GBM also showed strong predictive power. Terrain-derived environmental covariates played a more important role than land use and land cover (LULC) change in predicting soil data recency. While LULC change showed soil organic carbon concentration variability across the different depths, it was a less important factor. Anthropogenic factors, such as LULC change and normalized difference vegetation index (NDVI), were not primary determinants of soil data recency. Variations in soil depth had no impact on predicting soil data recency. This study validated that terrain-derived covariates, especially elevation factors, effectively explain the quality of older soil data when predicting current soil attributes using the soil data recency concept. This approach has the potential to enhance real-time estimates, such as carbon budgets, and we emphasize its importance in global earth system models.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    土壤盐渍化和碱化,干旱和半干旱地区土地退化和荒漠化的主要原因,需要有效的监测,以实现可持续的土地管理。这项研究探讨了从可见和近红外(Vis-NIR)光谱得出的偏最小二乘(PLS)潜在变量(LV)的效用,结合遥感(RS)和辅助变量,预测新疆北部地区的电导率(EC)和钠吸收率(SAR),中国。使用克拉玛依地区的90个土壤样本,机器学习模型(随机森林,支持向量回归,立体派)在四个场景中进行了测试。建模结果表明,RS和土地利用是不可靠的预测因子,但是地形属性的添加显着提高了EC和SAR的预测精度。引入来自Vis-NIR光谱的PLSLV导致随机森林模型对EC(CCC=0.83,R2=0.80,nRMSE=0.48,RPD=2.12)和SAR(CCC=0.78,R2=0.74,nRMSE=0.58,RPD=2.25)的性能最高。变量重要性分析确定了PLSLV,某些地形属性(例如,谷深,高程,信道网络基础级别,弥漫性日照),和特定的RS数据(即,VV+VH的极化指数)是研究区域最有影响力的预测因子。这项研究肯定了Vis-NIR数据用于数字土壤制图的效率,提供具有成本效益的解决方案。总之,近端的土壤传感技术和高度相关的地形属性与RF模型的集成有可能产生一个可靠的空间模型来绘制土壤EC和SAR。这种综合方法允许划定危险区域,这反过来使人们能够考虑最佳管理做法,并有助于减少受盐影响和受土壤影响的土壤中退化的风险。
    Soil salinization and sodification, the primary causes of land degradation and desertification in arid and semi-arid regions, demand effective monitoring for sustainable land management. This study explores the utility of partial least square (PLS) latent variables (LVs) derived from visible and near-infrared (Vis-NIR) spectroscopy, combined with remote sensing (RS) and auxiliary variables, to predict electrical conductivity (EC) and sodium absorption ratio (SAR) in northern Xinjiang, China. Using 90 soil samples from the Karamay district, machine learning models (Random Forest, Support Vector Regression, Cubist) were tested in four scenarios. Modeling results showed that RS and Land use alone were unreliable predictors, but the addition of topographic attributes significantly improved the prediction accuracy for both EC and SAR. The incorporation of PLS LVs derived from Vis-NIR spectroscopy led to the highest performance by the Random Forest model for EC (CCC = 0.83, R2 = 0.80, nRMSE = 0.48, RPD = 2.12) and SAR (CCC = 0.78, R2 = 0.74, nRMSE = 0.58, RPD = 2.25). The variable importance analysis identified PLS LVs, certain topographic attributes (e.g., valley depth, elevation, channel network base level, diffuse insolation), and specific RS data (i.e., polarization index of VV + VH) as the most influential predictors in the study area. This study affirms the efficiency of Vis-NIR data for digital soil mapping, offering a cost-effective solution. In conclusion, the integration of proximal soil sensing techniques and highly relevant topographic attributes with the RF model has the potential to yield a reliable spatial model for mapping soil EC and SAR. This integrated approach allows for the delineation of hazardous zones, which in turn enables the consideration of best management practices and contributes to the reduction of the risk of degradation in salt-affected and sodicity-affected soils.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    人工神经网络(ANN)已被证明是涉及大量数据的复杂问题的有用工具。我们用人工神经网络预测土壤图的用例受到政府机构的高度需求,建筑公司,或者农民,考虑到成本和时间密集型的野外工作。然而,在应用人工神经网络时,有两个主要挑战。在他们最常见的形式中,深度学习算法不提供可解释的预测性不确定性。这意味着神经网络的属性,如预测变量的确定性和合理性,依靠专家的解释,而不是通过验证ANN的评估指标来量化。Further,这些算法在地理上远离训练区域或训练数据稀疏覆盖的区域的预测中显示出很高的置信度。为了应对这些挑战,我们使用贝叶斯深度学习方法“最后一层拉普拉斯近似”,它专门设计用于将不确定性量化到深度网络中,在我们对土壤分类的探索性研究中。它纠正了过度自信的区域,而不降低预测的准确性,为我们提供了一个更现实的模型预测的不确定性表达。在我们德国南部的研究区,我们将土壤细分为土壤区域,作为测试案例,我们明确排除了训练区域中的两个土壤区域,但在预测中包括了这些区域。我们的结果强调了不确定度测量的必要性,以获得更可靠和可解释的人工神经网络结果,特别是对于远离训练区的地区。此外,从这项研究中获得的知识解决了神经网络的过度自信问题,并提供了有关土壤类型的可预测性和知识差距识别的有价值的信息。通过分析模型数据支持有限的区域,因此,高度不确定性,利益相关者可以认识到需要更多数据收集工作的领域。
    Artificial neural networks (ANNs) have proven to be a useful tool for complex questions that involve large amounts of data. Our use case of predicting soil maps with ANNs is in high demand by government agencies, construction companies, or farmers, given cost and time intensive field work. However, there are two main challenges when applying ANNs. In their most common form, deep learning algorithms do not provide interpretable predictive uncertainty. This means that properties of an ANN such as the certainty and plausibility of the predicted variables, rely on the interpretation by experts rather than being quantified by evaluation metrics validating the ANNs. Further, these algorithms have shown a high confidence in their predictions in areas geographically distant from the training area or areas sparsely covered by training data. To tackle these challenges, we use the Bayesian deep learning approach \"last-layer Laplace approximation\", which is specifically designed to quantify uncertainty into deep networks, in our explorative study on soil classification. It corrects the overconfident areas without reducing the accuracy of the predictions, giving us a more realistic uncertainty expression of the model\'s prediction. In our study area in southern Germany, we subdivide the soils into soil regions and as a test case we explicitly exclude two soil regions in the training area but include these regions in the prediction. Our results emphasize the need for uncertainty measurement to obtain more reliable and interpretable results of ANNs, especially for regions far away from the training area. Moreover, the knowledge gained from this research addresses the problem of overconfidence of ANNs and provides valuable information on the predictability of soil types and the identification of knowledge gaps. By analyzing regions where the model has limited data support and, consequently, high uncertainty, stakeholders can recognize the areas that require more data collection efforts.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    土壤特性影响植物生理和生长,在温带森林生态系统中塑造物种生态位方面发挥着重要作用。这里,我们调查了土壤数据质量对瑞士森林中41种木本植物物种的物种分布模型(SDMs)性能的影响。我们比较了基于测得的土壤特性的模型与基于区域(瑞士森林土壤图)和全球尺度(土壤网格)上数字映射的土壤特性的模型。我们首先使用测得的土壤数据以及成熟的温带林分中的植物物种存在和缺失来校准顶形气候SDM。我们使用相同的土壤预测因子开发了进一步的模型,但是从瑞士国家林业清单最近的相邻地块的数字土壤地图中提取的值。与没有土壤信息的SDM相比,没有土壤信息的SDM的预测能力,以及实测土壤信息与数字地图,用模型性能和变量贡献的指标进行了评估。平均而言,具有测量和数字映射土壤特性的模型的性能明显优于没有土壤信息的模型。基于实测和瑞士森林土壤图的SDM显示出更高的性能,尤其是对于具有“极端”生态位的物种(例如,偏好高或低pH),与使用土壤网格的人相比。然而,如果没有区域土壤地图,应测试土壤网格改善SDM的潜力。此外,在测试的土壤预测因子中,pH值,表土层的粘土含量最大程度地提高了SDM对森林木本植物的预测能力。总之,我们证明了区域土壤图对于预测温带森林中强烈环境梯度中木本物种分布的价值。温带森林SDM的准确性提高和对分布驱动因素的见解可能会支持森林管理者制定支持生物多样性保护等战略,或气候适应规划。
    Soil properties influence plant physiology and growth, playing a fundamental role in shaping species niches in temperate forest ecosystems. Here, we investigated the impact of soil data quality on the performance of species distribution models (SDMs) of 41 woody plant species in Swiss forests. We compared models based on measured soil properties with those based on digitally mapped soil properties on regional (Swiss Forest Soil Maps) and global scales (SoilGrids). We first calibrated topo-climatic SDMs with measured soil data and plant species presences and absences from mature temperate forest stand plots. We developed further models using the same soil predictors, but with values extracted from digital soil maps at the nearest neighbouring plots of the Swiss National Forestry Inventory. The predictive power of SDMs without soil information compared to those with soil information, as well as measured soil information vs digitally mapped, was evaluated with metrics of model performance and variable contribution. On average, models with measured and digitally mapped soil properties performed significantly better than those without soil information. SDMs based on measured and Swiss Forest Soil Maps showed higher performance, especially for species with an \'extreme\' niche position (e.g., preference for high or low pH), compared to those using SoilGrids. Nevertheless, if no regional soil maps are available, SoilGrids should be tested for their potential to improve SDMs. Moreover, among the tested soil predictors, pH, and clay content of the topsoil layers most improved the predictive power of SDMs for forest woody plants. In conclusion, we demonstrate the value of regional soil maps for predicting the distribution of woody species across strong environmental gradients in temperate forests. The improved accuracy of SDMs and insights into drivers of distribution may support forest managers in strategies supporting e.g. biodiversity conservation, or climate adaptation planning.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    磷(P)是陆地和水生生态系统中初级生产的关键营养素。由于P矿产储量有限且不可再生,关于可持续利用其保护后代粮食安全的讨论越来越多。了解土壤磷的空间分布对于推进有效的磷管理和促进可持续农业实践至关重要。这项研究旨在以良好的分辨率(30m)数字绘制巴西可用P(AP)和总P(TP)的库存。使用随机森林机器学习算法和表土数据库(0-20厘米),AP有28,572个样本,TP有3154个样本,我们根据与土壤形成过程相关的环境协变量预测了磷储量。通过将巴西分为两个次区域,代表具有本地覆盖范围和人为覆盖范围的区域,我们为每个子区域建立了独立的预测模型。我们的结果表明,巴西的TP库存为531Tg,AP库存为17.4Tg。最大的土壤TP存量在大西洋森林生物群落中(73.8克。m2),可能是由于这个生物群落中有机碳储量较高。最大的AP库存在Caatinga生物群落中(2.51g。m2),因为较年轻的土壤具有较低的P吸附能力。我们还发现,与本地地区相比,农业地区的肥料使用显着增加了AP库存。我们的结果表明,AP库存强烈影响巴西的农业生产,相关系数范围从0.20的咖啡作物到0.46的大豆。这项研究中生成的地图有望为农业和环境系统中P的可持续利用做出贡献。
    Phosphorus (P) is a critical nutrient for primary production in terrestrial and aquatic ecosystems. As P mineral reserves are finite and non-renewable, there is an increasing discussion on its sustainable utilization to safeguard food security for future generations. Understanding the spatial distribution of soil P is central in advancing effective phosphorus management and fostering sustainable agricultural practices. This study aims to digitally map the stocks of available P (AP) and total P (TP) in Brazil at a fine resolution (30 m). Using the Random Forest machine learning algorithm and a database of topsoil (0-20 cm) with 28,572 samples for AP and 3154 for TP, we predicted P stocks based on environmental covariates related to soil formation processes. By dividing Brazil into two sub-regions, representing areas with native coverage and anthropogenic ones, we built independent predictive models for each sub-region. Our results show that Brazil has a TP stock of 531 Tg and an AP stock of 17.4 Tg. The largest soil TP stocks are in the Atlantic Forest biome (73.8 g.m2), likely due to higher organic carbon stocks in this biome. The largest AP stocks were in the Caatinga biome (2.51 g.m2) because of younger soils with low P adsorption capacity. We also found that fertilizer use significantly increased AP stocks in agricultural areas compared to native ones. Our results indicated that AP stocks strongly influenced Brazil\'s agricultural production, with a correlation coefficient ranging from 0.20 for coffee crops to 0.46 for soybean. The maps generated in this study are expected to contribute to the sustainable use of P in agriculture and environmental systems.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    与大多数欧洲国家不同,西班牙南部的安达卢西亚作为地中海地区仍然缺乏机器学习算法提供的SOC内容的数字地图。气候的多样性,地质学,水文学,景观,地形,植被,和微起伏数据作为易于获取的协变量促进了数字土壤制图(DSM)的发展。这项研究的目的是对三个深度的SOC的空间分布进行建模和映射,位于塞维利亚和科尔多瓦两省约10000公里2的区域,并使用R编程来比较两种机器学习技术(立方和随机森林),以开发多个深度的SOC图。
    本研究中使用的环境协变量包括来自数字高程模型(DEM)的九个导数,三个气候变量,最后是18个遥感光谱数据(由2019年7月获得的Landsat-8OLI和Sentinel-2AMSI计算的波段比率)。总的来说,从100个点采集300个土壤样品(0-25厘米)。本研究的目的是对SOC的空间分布进行建模和映射,位于塞维利亚和科尔多瓦两省,面积约10000平方公里,并通过R编程比较两种机器学习技术(立方体和随机森林)。
    研究结果表明,使用Landsat-8OLI和Sentinel-2AMSI卫星数据整合指数的新颖方法具有更好的结果。
    最后,我们获得的证据表明,卫星图像的分辨率在建模和数字制图中更为重要。
    UNASSIGNED: Unlike most of Europe, Andalucía in southern Spain as a Mediterranean area still lacks digital maps of SOC content provided by machine learning algorithms. The wide diversity of climate, geology, hydrology, landscape, topography, vegetation, and micro-relief data as easy-to-obtain covariates facilitated the development of digital soil mapping (DSM). The purpose of this research is to model and map the spatial distribution of SOC at three depths, in an area of approximately 10000 km 2 located in Seville and Cordoba Provinces, and to use R programming to compare two machine learning techniques (cubist and random forest) for developing SOC maps at multiple depths.
    UNASSIGNED: Environmental covariates used in this research include nine derivatives from digital elevation models (DEM), three climatic variables and finally eighteen remotely-sensed spectral data (band ratios calculated by the acquired Landsat-8 OLI and Sentinel-2A MSI in July 2019). In total, 300 soil samples from 100 points were taken (0-25 cm). The purpose of this research is to model and map the spatial distribution of SOC, in an area with approximately 10000 km2 located in Seville and Cordoba Provinces, and to compare two machine learning techniques (cubist and random forest) by R programming.
    UNASSIGNED: The findings showed that the novel approach for integrating the indices using Landsat-8 OLI and Sentinel-2A MSI satellite data had a better result.
    UNASSIGNED: Finally, we obtained evidence that the resolution of satellite images is more important in modelling and digital mapping.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: English Abstract
    土壤有机质是衡量土壤肥力的重要指标,有必要提高区域有机质空间分布预测的精度。在这项研究中,我们分析了黄河流域威宁平原1690个土壤表层(0-20cm)的有机质含量,并收集了有关自然环境和人类活动的数据。利用经典统计学建立了1348个点的SOM空间分布预测模型,确定性插值,地统计插值,和机器学习,分别,和342个样本点数据作为测试集,对不同模型的预测精度进行检验和分析。结果表明,威宁平原表层土壤SOM平均含量为14.34g·kg-1,1690个采样点土壤有机质平均变异为34.81%,表明中等程度的变异性。结果还揭示了空间分布趋势,东北和西南土壤有机质含量低,黄河中游左岸和右岸土壤有机质含量高,在威宁平原的倾斜地形中,土壤有机质相对较高。预测精度从高到低依次为机器学习方法,地统计学插值法,确定性插值法,和经典的统计方法。通过比较,基于优化的麻雀搜索算法改进的BP神经网络具有最佳的预测精度,优化后的麻雀搜索算法具有更好的收敛精度,避免陷入局部优化,防止数据过度拟合,具有较好的预测能力。该优化算法提高了SOM预测的精度,在土壤属性预测中具有良好的应用前景。
    Soil organic matter is an important indicator of soil fertility, and it is necessary to improve the accuracy of regional organic matter spatial distribution prediction. In this study, we analyzed the organic matter content of 1 690 soil surface layers (0-20 cm) and collected data on the natural environment and human activities in the Weining Plain of the Yellow River Basin. The SOM spatial distribution prediction model was established with 1 348 points using classical statistics, deterministic interpolation, geostatistical interpolation, and machine learning, respectively, and 342 sample points data were used as the test set to test and analyze the prediction accuracy of different models. The results showed that the average SOM content of the surface soil of the Weining Plain was 14.34 g·kg-1, and the average soil organic matter variation across 1 690 sampling points was 34.81%, indicating a medium degree of variability. The results also revealed a spatial distribution trend, with low soil organic matter content in the northeast and southwest, high soil organic matter on the left and right banks of the Yellow River in the middle, and relatively high soil organic matter in the sloping terrain of the Weining Plain. The four types of methods in order of high to low prediction accuracy were the machine learning method, geostatistical interpolation method, deterministic interpolation method, and classical statistical method. Through comparison, the BP neural network that was improved based on the optimized sparrow search algorithm had the best prediction accuracy, and the optimized sparrow search algorithm had better convergence accuracy, avoided falling into local optimization, prevented data overfitting, and had better prediction ability. This optimization algorithm can improve the accuracy of SOM prediction and has good application prospects in soil attribute prediction.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    对被重金属污染的地点进行评估和适当管理需要有关这些金属的空间分布的准确信息。本研究旨在预测和绘制Cd的分布,Cu,Ni,Pb,使用点观测,环境变量,和基于直方图的梯度提升(HGB)建模。来自土壤地球化学空间数据库(SGSD)的9180多个表层土壤观测值(n=1150),土壤地球化学和矿物学调查(GMSS)(n=4857),和霍姆格伦数据集(HD)(n=3400),和28个协变量(100米×100米网格)代表气候,地形,植被,土壤,并编制了人类活动。使用决定系数(R2)对20%的校准中未使用的数据进行了模型性能评估,一致性相关系数(ρc),和均方根误差(RMSE)指数。预测的不确定性计算为HGB提供的估计的95%和5%分位数之间的差异。该模型解释了数据中多达50%的方差,其中Cu的RMSE介于0.16(mgkg-1)之间,Zn的RMSE介于23.4(mgkg-1)之间。分别。同样,ρc介于0.55(Cu)和0.68(Zn)之间,分别,在所有预测中,锌的R2最高(0.50)。我们在城市地区附近观察到高Pb浓度。在密西西比河下游河谷发现了所有研究金属的峰值浓度。Cu,Ni,西海岸的锌浓度较高;美国中部的镉浓度较高。粘土,pH值,潜在蒸散,温度,和降水是模型中重金属空间预测的五大重要协变量之一。点观测值和环境协变量的结合使用以及机器学习提供了对美国土壤中重金属分布的可靠预测。更新后的地图可以支持环境评估,监测,并利用适用于其他土壤数据库的这种方法进行决策,全世界。
    Assessment and proper management of sites contaminated with heavy metals require precise information on the spatial distribution of these metals. This study aimed to predict and map the distribution of Cd, Cu, Ni, Pb, and Zn across the conterminous USA using point observations, environmental variables, and Histogram-based Gradient Boosting (HGB) modeling. Over 9180 surficial soil observations from the Soil Geochemistry Spatial Database (SGSD) (n = 1150), the Geochemical and Mineralogical Survey of Soils (GMSS) (n = 4857), and the Holmgren Dataset (HD) (n = 3400), and 28 covariates (100 m × 100 m grid) representing climate, topography, vegetation, soils, and anthropic activity were compiled. Model performance was evaluated on 20 % of the data not used in calibration using the coefficient of determination (R2), concordance correlation coefficient (ρc), and root mean square error (RMSE) indices. Uncertainty of predictions was calculated as the difference between the estimated 95 and 5 % quantiles provided by HGB. The model explained up to 50 % of the variance in the data with RMSE ranging between 0.16 (mg kg-1) for Cu and 23.4 (mg kg-1) for Zn, respectively. Likewise, ρc ranged between 0.55 (Cu) and 0.68 (Zn), respectively, and Zn had the highest R2 (0.50) among all predictions. We observed high Pb concentrations near urban areas. Peak concentrations of all studied metals were found in the Lower Mississippi River Valley. Cu, Ni, and Zn concentrations were higher on the West Coast; Cd concentrations were higher in the central USA. Clay, pH, potential evapotranspiration, temperature, and precipitation were among the model\'s top five important covariates for spatial predictions of heavy metals. The combined use of point observations and environmental covariates coupled with machine learning provided a reliable prediction of heavy metals distribution in the soils of the conterminous USA. The updated maps could support environmental assessments, monitoring, and decision-making with this methodology applicable to other soil databases, worldwide.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号