compositional data

成分数据
  • 文章类型: Journal Article
    室外射击场(OSR)的土壤污染是人类健康的主要威胁,特别是当,活动结束后,土地用于娱乐区或农业生产。使用多传感器方法评估了意大利南部OSR的土地退化状况。它基于:i)近端传感器,包括用于测量土壤电导率(ECa)和磁化率(MSa)的电磁感应(EMI),K的γ射线光谱法,eU和eTh分析和超声波透入法检测代表土壤强度的锥体指数(CI)数据,ii)土壤厚度(ST)的实地调查,和iii)通过便携式X射线荧光光谱法对潜在有毒元素(PTE)进行实验室分析,并通过气相色谱法对多环芳烃(PAHs)进行实验室分析。使用地统计学方法对测量的空间变异性进行了建模和映射。测量最密集的协变量(即,表土的ECa)在具有外部漂移的克里格中使用,以改善PTE的预测。PTE图得到了空间不确定性图的补充。将稳健的多变量主成分分析(rPCA)应用于近端传感器和实验室数据,并允许识别PAHs的关联。铅,CI与表土ECa沿第一组分(PC1),强调土地人为效应与EMI措施之间的相关性;而ST(估计地下石灰华硬层的深度)与沿第二部分(PC2)的底部土壤ECa和MSa之间的关联证明了土壤地层学对EMI措施的影响。这项研究表明,同时使用与实验室分析相关的不同近端传感器可以评估和建模OSR的土地退化状态的空间变异性,包括土壤压实,有机和无机污染。EMI数据与PTEs含量之间的相关性突出了该技术在土壤污染领域的潜力。
    Soil contamination in outdoor shooting ranges (OSRs) is a major threat for human health, particularly when, after the end of activities, the land is used for recreational areas or agricultural production. The status of land degradation of an OSR in southern Italy was assessed using a multisensor approach. It was based on: i) proximal sensors, including electromagnetic induction (EMI) for measuring soil electrical conductivity (ECa) and magnetic susceptibility (MSa), γ-ray spectrometry for K, eU and eTh analyses and ultrasonic penetrometry detecting cone index (CI) data representative of soil\'s strength, ii) field surveys on soil thickness (ST), and iii) laboratory analyses of potentially-toxic-elements (PTEs) by portable X-ray fluorescence spectrometry and polycyclic aromatic hydrocarbons (PAHs) by gas-chromatography. Spatial variability of measurements was modelled and mapped using geostatistical methods. The most densely measured covariate (i.e., the ECa of the topsoil) was used within kriging with external drift to improve the PTEs predictions. The PTEs maps were complemented by maps of spatial uncertainty. A robust multivariate principal component analysis (rPCA) was applied to proximal sensor and laboratory data and allowed to identify associations of PAHs, lead, CI with the topsoil ECa along the first component (PC1), highlighting the correlation between land anthropogenic effects and EMI measures; while the association between the ST (estimating the depth of underground travertine hard-layers) and the bottom soil ECa and MSa along the second component (PC2) evidenced the influence of soil stratigraphy on the EMI measures. This study demonstrates that the simultaneous use of different proximal sensors associated with laboratory analysis can allow to assess and model the spatial variability of the land degradation status of an OSR, including soil compaction, organic and inorganic contamination. The correlation between EMI data with the PTEs content highlights the potential of this technique in the field of soil contamination.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    The hydrogeological processes driving the hydrochemical composition of groundwater in the alpine pristine aquifer system of the Port del Comte Massif (PCM) are characterized through the multivariate statistical techniques Principal Component Analysis (PCA) and Gaussian Mixture Models (GMM) in the framework of Compositional Data (CoDa) analysis. Also, the groundwater Natural Background Levels (NBLs) for NO3 and SO4 and Cl are evaluated, which are specially important for indicating the occurrence of groundwater contamination derived from the anthropic activities conducted in the PCM. The different hydrogeochemical facies found in the aquifer system of the PCM comprises low mineralized Ca-HCO3 water for the main Eocene karst aquifer, and Ca-SO4 and highly mineralized NaCl water types in the minor aquifers discharging from the PCM. The NBL values of SO4, Cl and NO3 obtained for the main karst aquifer are 14.33, 4.06 and 6.55 mg/L, respectively. These values are 35, 3 and 1.2 times lower than the respective official NBLs values that were determined by the water administration to be compared with in the case of conducting a pollution assessment characterization in the main karst aquifer. Official overestimation of NBLs can put important groundwater resources in the PCM at risk.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    Two-way contingency tables arise in many fields, such as in medical studies, where the relation between two discrete random variables or responses is to be assessed. We propose to analyze and visualize a sample of 2 × 2 tables in the context of single-subject repeated measurements design by means of compositional data (CoDa) methods. First, we propose to visualize the tables in a quaternary diagram. Second, we show how to represent these tables by means of logratios indicating the relationship between the two variables as well as their strength and direction of dependency. Finally, we describe a technique to model those tables with a simplicial regression model. Data from a real-world study of self-prediction of migraine attack onset is used to illustrate this methodology. For each individual, the 2 × 2 table of their migraine expectation vs next day migraine occurrence is computed, generating a sample of tables. Then we visualize and interpret the prediction ability of individuals both in the simplex and in terms of logratios of components. Finally, we model the self-prediction ability with respect to demographic variables, days tracked and disease characteristics. Our application demonstrates that CoDa can be a useful tool for visualizing, modeling, and interpreting the components of 2 × 2 tables.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    这项研究的具体目标是:(1)确定Quintero和Puchuncavi地区表土中潜在有害元素的背景含量;(2)评估表土污染的程度并确定当地的污染源;(3)评估金属暴露于人群的健康风险。分析了智利环境部报告中的数据。多变量分析表明,潜在有害元素分为两大类,这可以通过控制地质成因(Ni,V和Zn)和人为(As,Cd,Cu和Pb)因素。As的空间分布,Cd,Cu和Pb在Puchuncavi-Ventanas工业园区周围显示出空间格局。用MAD(中位数绝对偏差)方法确定的背景值包括As31.6mgkg-1,Cd0.7mgkg-1,Cu296mgkg-1,Ni16.9mgkg-1,Pb48.6mgkg-1,V199mgkg-1和Zn192mgkg-1。污染指数报告说,在某些土壤样品中,污染严重到污染程度很高。儿童的危险商值高于成人,由于摄入和皮肤吸收,而癌症风险指数值在可接受范围内.
    The specific objectives of this study were: (1) to establish background content levels of potentially harmful elements in topsoil of Quintero and Puchuncavi districts; (2) to assess the degree of topsoil pollution and identify the local sources of pollution; and (3) to assess the health risk of metal exposure to population. Data from a Chilean Environmental Ministry report were analyzed. Multivariate analysis suggests that the potentially harmful elements are split into two main groups, which can be explained by controlling geogenic (Ni, V and Zn) and anthropogenic (As, Cd, Cu and Pb) factors. The spatial distribution of As, Cd, Cu and Pb displays a spatial pattern around the Puchuncavi-Ventanas industrial park. The background values determined with MAD (median absolute deviation) method include As 31.6 mg kg-1, Cd 0.7 mg kg-1, Cu 296 mg kg-1, Ni 16.9 mg kg-1, Pb 48.6 mg kg-1, V 199 mg kg-1 and Zn 192 mg kg-1. The pollution indices reported considerable contamination to very high contamination in some soil samples. Hazard quotient values were higher for children than for adults, due to the ingestion and dermal absorption, while the cancer risk index values were in the acceptable range.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    BACKGROUND: At a global level, access to safe drinking water and sanitation has been monitored by the Joint Monitoring Programme (JMP) of WHO and UNICEF. The methods employed are based on analysis of data from household surveys and linear regression modelling of these results over time. However, there is evidence of non-linearity in the JMP data. In addition, the compositional nature of these data is not taken into consideration. This article seeks to address these two previous shortcomings in order to produce more accurate estimates.
    METHODS: We employed an isometric log-ratio transformation designed for compositional data. We applied linear and non-linear time regressions to both the original and the transformed data. Specifically, different modelling alternatives for non-linear trajectories were analysed, all of which are based on a generalized additive model (GAM).
    CONCLUSIONS: Non-linear methods, such as GAM, may be used for modelling non-linear trajectories in the JMP data. This projection method is particularly suited for data-rich countries. Moreover, the ilr transformation of compositional data is conceptually sound and fairly simple to implement. It helps improve the performance of both linear and non-linear regression models, specifically in the occurrence of extreme data points, i.e. when coverage rates are near either 0% or 100%.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号