数据融合 data fusion-医云文献数字医云科研云海量医学决策数据服务

data fusion 关注

数据融合

文献(827篇)

百科

视频

1 Long-term annual mapping and spatial-temporal dynamic analysis of winter wheat in Shandong Province based on spatial-temporal data fusion (2000-2022).

基于时空数据融合 (2000 - 2022) 的山东省冬小麦长期年作图与时空动态分析 [J]. 影响指数 : 3.307
发表时间：Aug 2024 20
来源期刊：Environ Monit Assess PMID：39162856

DOI：10.1007/s10661-024-12971-x
文章类型： Journal Article

冬小麦,作为世界上主要的主食作物之一，在确保粮食安全和制定国际粮食贸易政策方面发挥着至关重要的作用。然而,高分辨率相对稀缺，过去几十年的长时间系列冬小麦地图。这项研究利用Landsat和Sentinel-2数据制作了描绘GoogleEarthEngine（GEE）中冬小麦分布的地图。进一步分析了山东省冬小麦栽培的综合时空动态,中国。应用间隙填充和Savitzky-Golay滤波方法（GF-SG）来解决LandsatNDVI（归一化植被指数）时间序列中的时间不连续性。基于物候特征的六个特征用于区分冬小麦与其他土地覆盖类型。由此产生的地图从2000年到2022年，从2000年到2017年分辨率为30米，从2018年到2022年分辨率提高了10米。这些地图的总体准确率为80.5%至93.3%,Kappa系数为71.3%至909%，F1评分为84.2%至96.9%。在分析期间，从2000年到2011年，冬小麦种植面积有所下降。然而,发生了明显的变化，从2014年到2017年观察到冬小麦种植面积增加，从2018年到2022年随后增加。这项研究强调了使用卫星观测数据进行冬小麦长期测绘和监测的可行性。拟议的方法对将这种绘图和监测方法扩展到其他类似领域具有长期影响。
Winter wheat, as one of the world\'s key staple crops, plays a crucial role in ensuring food security and shaping international food trade policies. However, there has been a relative scarcity of high-resolution, long time-series winter wheat maps over the past few decades. This study utilized Landsat and Sentinel-2 data to produce maps depicting winter wheat distribution in Google Earth Engine (GEE). We further analyzed the comprehensive spatial-temporal dynamics of winter wheat cultivation in Shandong Province, China. The gap filling and Savitzky-Golay filter method (GF-SG) was applied to address temporal discontinuities in the Landsat NDVI (Normalized Difference Vegetation Index) time series. Six features based on phenological characteristics were used to distinguish winter wheat from other land cover types. The resulting maps spanned from 2000 to 2022, featuring a 30-m resolution from 2000 to 2017 and an improved 10-m resolution from 2018 to 2022. The overall accuracy of these maps ranged from 80.5 to 93.3%, with Kappa coefficients ranging from 71.3 to 909% and F1 scores from 84.2 to 96.9%. Over the analyzed period, the area dedicated to winter wheat cultivation experienced a decline from 2000 to 2011. However, a notable shift occurred with an increase in winter wheat acreage observed from 2014 to 2017 and a subsequent rise from 2018 to 2022. This research highlights the viability of using satellite observation data for the long-term mapping and monitoring of winter wheat. The proposed methodology has long-term implications for extending this mapping and monitoring approach to other similar areas.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文
2 The development and application of a novel E-commerce recommendation system used in electric power B2B sector.

电力 B2B 行业新型电子商务推荐系统的开发与应用. 影响指数 : 暂无
发表时间：2024
来源期刊：Front Big Data PMID：39144817

DOI：10.3389/fdata.2024.1374980
文章类型： Journal Article

数字时代的到来已经将电子商务平台转变为行业的关键工具，然而，在电力行业的专门背景下，传统的推荐系统往往不足。这些系统通常与行业的独特挑战作斗争，例如不频繁和高风险的交易，长期的决策过程，和稀疏数据。这项研究开发了一种针对这些特定条件的新颖推荐引擎，例如处理企业对企业(B2B)交易的低频率和长周期性质。这种方法包括算法增强，以更好地处理和解释有限的可用数据，和数据预处理技术，旨在丰富该行业的稀疏数据集特征。本研究还引入了整合多维数据的方法创新，结合用户电子商务活动，产品细节，和必要的非招标信息。所提出的引擎采用先进的机器学习技术来提供更准确和相关的建议。结果表明，与传统模型相比，有了明显的改进，为电力行业的B2B交易提供更强大和有效的工具。这项研究不仅解决了该行业的独特挑战，而且还提供了一个蓝图，使推荐系统适应具有类似B2B特征的其他行业。
The advent of the digital era has transformed E-commerce platforms into critical tools for industry, yet traditional recommendation systems often fall short in the specialized context of the electric power industry. These systems typically struggle with the industry\'s unique challenges, such as infrequent and high-stakes transactions, prolonged decision-making processes, and sparse data. This research has developed a novel recommendation engine tailored to these specific conditions, such as to handle the low frequency and long cycle nature of Business-to-Business (B2B) transactions. This approach includes algorithmic enhancements to better process and interpret the limited data available, and data pre-processing techniques designed to enrich the sparse datasets characteristic of this industry. This research also introduces a methodological innovation that integrates multi-dimensional data, combining user E-commerce activities, product specifics, and essential non-tendering information. The proposed engine employs advanced machine learning techniques to provide more accurate and relevant recommendations. The results demonstrate a marked improvement over traditional models, offering a more robust and effective tool for facilitating B2B transactions in the electric power industry. This research not only addresses the sector\'s unique challenges but also provides a blueprint for adapting recommendation systems to other industries with similar B2B characteristics.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
3 Monitoring of meat quality and change-point detection by a sensor array and profiling of bacterial communities.

通过传感器阵列监测肉质和变化点检测并分析细菌群落。影响指数 : 6.911
发表时间：Sep 2024 1
来源期刊：Anal Chim Acta PMID：39142773

DOI：10.1016/j.aca.2024.343022
文章类型： Journal Article

背景：由于食品基质中发生了多种生化过程，因此对食品消费者质量的实时监控仍然具有挑战性。因此它需要精确的分析方法。确定变质食物的阈值通常很难设定。现有的分析方法对于食品的快速原位筛选来说过于复杂。
结果：我们通过电子鼻（e-nose）研究了肉类腐败的动力学，以将与肉类挥发性腐败标记相关的气味数字化，将结果与腐败肉样品的微生物组组成变化进行比较。我们应用时间序列分析来跟踪从电子鼻响应中提取的气体轮廓的动态变化，并确定肉状态的变化点窗口。获得的电子鼻特征与微生物组组成的变化相关，例如Brochothrix和假单胞菌属的比例增加。和支原体的消失。,具有代表性的氢气气体传感器，氨,和醇蒸气的R2值分别为0.98、0.93和0.91。将电子鼻和计算机视觉集成到单个分析面板中，可将肉类状态识别精度提高到0.85，从而实现更可靠的肉类状态评估。
结论：通过数字化电子鼻装置中的挥发性腐败标记物而实现的肉类状态变化点的准确识别有望在食品工业中应用智能小型化设备。
BACKGROUND: Real-time monitoring of food consumer quality remains challenging due to diverse bio-chemical processes taking place in the food matrices, and hence it requires accurate analytical methods. Thresholds to determine spoiled food are often difficult to set. The existing analytical methods are too complicated for rapid in situ screening of foodstuff.
RESULTS: We have studied the dynamics of meat spoilage by electronic nose (e-nose) for digitizing the smell associated with volatile spoilage markers of meat, comparing the results with changes in the microbiome composition of the spoiling meat samples. We apply the time series analysis to follow dynamic changes in the gas profile extracted from the e-nose responses and to identify the change-point window of the meat state. The obtained e-nose features correlate with changes in the microbiome composition such as increase in the proportion of Brochothrix and Pseudomonas spp. and disappearance of Mycoplasma spp., and with representative gas sensors towards hydrogen, ammonia, and alcohol vapors with R2 values of 0.98, 0.93, and 0.91, respectively. Integration of e-nose and computer vision into a single analytical panel improved the meat state identification accuracy up to 0.85, allowing for more reliable meat state assessment.
CONCLUSIONS: Accurate identification of the change-point in the meat state achieved by digitalizing volatile spoilage markers from the e-nose unit holds promises for application of smart miniaturized devices in food industry.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文
4 Digital modelling method of coal-mine roadway based on millimeter-wave radar array.

基于毫米波雷达阵列的煤矿巷道数字化建模方法 [J]. 影响指数 : 4.996
发表时间：Aug 2024 10
来源期刊：Sci Rep PMID：39127781

DOI：10.1038/s41598-024-69547-5
文章类型： Journal Article

煤矿巷道空间狭窄,照明度低且不均匀。动态采矿伴随着灰尘和水雾。视觉和激光传感器采集的巷道数据的准确性和可靠性较差。基于此,提出了一种基于毫米波雷达阵列的煤矿巷道数字化建模方法。首先,针对复杂的环境干扰问题，结合毫米波点云单帧数据量小的特点,提出了一种毫米波点云的多层滤波降噪处理和动态子图配准方法，以滤除干扰点，实现单雷达点云配准。其次,针对单个毫米波雷达无法一次扫描完整的道路信息的问题，结合毫米波雷达小仰角视场的特点，建立了毫米波雷达阵列采集系统，建立了基于点云特征的改进迭代最近点(ICP)配准算法,构建了道路点云融合模型。最后,针对阵列融合后点云不均匀稀疏的问题,提出了一种基于点云密度加权插值的泊松曲面重构方法,以细化巷道结构,实现数字巷道模型的精确重构。实验结果表明,毫米波雷达阵列数字化建模方法能够准确获取巷道围岩信息,道路点云密度提高22.4%,重建的巷道模型宽度和高度的平均绝对误差百分比仅为0.82%和0.72%,为井下巷道改造提供了一种新的研究方法,也为煤矿巷道数字化建模方法提供了重要依据。
The roadway space of coal mine is narrow, and the illumination is low and uneven. Dynamic mining is accompanied by dust and water mist. The accuracy and reliability of roadway data collected by vision and laser sensors are poor. Based on this, a digital modeling method of coal mine roadway based on millimeter-wave radar array is proposed. Firstly, aiming at the problem of complex environmental interference, combined with the characteristics of small amount of single frame data of millimeter-wave point cloud, a multi-layer filtering noise reduction processing and dynamic subgraph registration method of millimeter-wave point cloud is proposed to filter out interference points and realize single radar point cloud registration. Secondly, aiming at the problem that a single millimeter-wave radar cannot scan the complete roadway information at one time, combined with the characteristics of small elevation field of view of millimeter-wave radar, a millimeter-wave radar array acquisition system is built, and an improved iterative closest point (ICP) registration algorithm based on point cloud features is established to construct the roadway point cloud fusion model. Finally, aiming at the problem of uneven and sparse point cloud after array fusion, a Poisson surface reconstruction method based on point cloud density weighted interpolation is proposed to refine the roadway structure and realize the accurate reconstruction of digital roadway model. The experimental results show that the digital modeling method of millimeter-wave radar array can accurately obtain the information of roadway surrounding rock, the density of roadway point cloud is increased by 22.4%, and the average absolute error percentage of the width and height of the reconstructed roadway model is only 0.82% and 0.72%, which provides a new research method for the reconstruction of underground roadway and an important basis for the digital modeling method of coal mine roadway.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
5 A Rotating Machinery Fault Diagnosis Method Based on Dynamic Graph Convolution Network and Hard Threshold Denoising.

基于动态图卷积网络和硬阈值去噪的旋转机械故障诊断方法 [J]. 影响指数 : 3.847
发表时间：Jul 2024 27
来源期刊：Sensors (Basel) PMID：39123933

DOI：10.3390/s24154887
文章类型： Journal Article

随着精密传感仪器和数据存储设备的发展,多传感器数据的融合在齿轮箱故障诊断中备受关注。然而,现有方法难以捕获多传感器监测信息的局部时间依赖性,不可避免的噪声严重降低了多传感器信息融合诊断的准确性。为了解决这些问题,提出了一种基于动态图卷积神经网络和硬阈值去噪的故障诊断方法。首先,考虑到来自不同传感器的监测数据之间的关系随着时间的推移而变化，采用动态图结构对多传感器数据的时间依赖性进行建模，and,进一步,构建图卷积神经网络，实现多传感器数据中时间信息的交互和特征提取。其次,为了避免实际工程中噪声的影响，设计了一个硬阈值去噪策略，并将可学习的硬阈值去噪层嵌入到图神经网络中。利用环境噪声下两个典型齿轮箱故障试验台的实验故障数据集,验证了该方法在齿轮箱故障诊断中的有效性。实验结果表明,所提出的DDGCN方法在不同环境噪声水平下的平均诊断准确率高达99.7%,表现出良好的抗噪声性能。
With the development of precision sensing instruments and data storage devices, the fusion of multi-sensor data in gearbox fault diagnosis has attracted much attention. However, existing methods have difficulty in capturing the local temporal dependencies of multi-sensor monitoring information, and the inescapable noise severely decreases the accuracy of multi-sensor information fusion diagnosis. To address these issues, this paper proposes a fault diagnosis method based on dynamic graph convolutional neural networks and hard threshold denoising. Firstly, considering that the relationships between monitoring data from different sensors change over time, a dynamic graph structure is adopted to model the temporal dependencies of multi-sensor data, and, further, a graph convolutional neural network is constructed to achieve the interaction and feature extraction of temporal information from multi-sensor data. Secondly, to avoid the influence of noise in practical engineering, a hard threshold denoising strategy is designed, and a learnable hard threshold denoising layer is embedded into the graph neural network. Experimental fault datasets from two typical gearbox fault test benches under environmental noise are used to verify the effectiveness of the proposed method in gearbox fault diagnosis. The experimental results show that the proposed DDGCN method achieves an average diagnostic accuracy of up to 99.7% under different levels of environmental noise, demonstrating good noise resistance.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
6 PROSAC as a selection tool for SO-PLS regression: A strategy for multi-block data fusion.

PROSAC 作为 SO - PLS 回归的选择工具：一种多块数据融合策略。影响指数 : 6.911
发表时间：Aug 2024 29
来源期刊：Anal Chim Acta PMID：39122277

DOI：10.1016/j.aca.2024.342965
文章类型： Journal Article

背景：来自多个来源的光谱数据可以集成到多块融合化学计量模型中，例如顺序正交化偏最小二乘(SO-PLS)，改进样本质量特征的预测。预处理技术通常用于减轻无关的可变性，与响应变量无关。然而,当处理大量块时,选择合适的预处理方法和识别信息数据块变得越来越复杂和耗时。在这项工作中解决的问题是有效的预处理，选择,以及SO-PLS中目标应用程序的数据块排序。
结果：我们介绍PROSAC-SO-PLS方法，它采用预处理集成与面向响应的顺序交替校准（PROSAC）。该方法识别最佳预处理数据块及其用于特定SO-PLS应用的顺序次序。该方法采用逐步前向选择策略，在快速革兰氏施密特过程的推动下，根据块在最小化预测误差方面的有效性来确定块的优先级，如最低预测残差所示。为了验证我们方法的有效性，我们展示了三个经验近红外(NIR)数据集的结果。对单块预处理数据集和仅依赖于PROSAC的方法进行了偏最小二乘（PLS）回归的比较分析。PROSAC-SO-PLS方法始终优于这些方法，产生显著较低的预测误差。在所分析的8个响应变量中的7个中,预测的均方根误差(RMSEP)的降低范围从5%至25%证明了这一点。
结论：PROSAC-SO-PLS方法为NIR数据建模中的集成预处理提供了一种通用且有效的技术。它使SO-PLS的使用最小化对预处理序列或块顺序的关注,并且有效地管理大量数据块。这一创新显著简化了数据预处理和模型构建过程，提高化学计量模型的准确性和效率。
BACKGROUND: Spectral data from multiple sources can be integrated into multi-block fusion chemometric models, such as sequentially orthogonalized partial-least squares (SO-PLS), to improve the prediction of sample quality features. Pre-processing techniques are often applied to mitigate extraneous variability, unrelated to the response variables. However, the selection of suitable pre-processing methods and identification of informative data blocks becomes increasingly complex and time-consuming when dealing with a large number of blocks. The problem addressed in this work is the efficient pre-processing, selection, and ordering of data blocks for targeted applications in SO-PLS.
RESULTS: We introduce the PROSAC-SO-PLS methodology, which employs pre-processing ensembles with response-oriented sequential alternation calibration (PROSAC). This approach identifies the best pre-processed data blocks and their sequential order for specific SO-PLS applications. The method uses a stepwise forward selection strategy, facilitated by the rapid Gram-Schmidt process, to prioritize blocks based on their effectiveness in minimizing prediction error, as indicated by the lowest prediction residuals. To validate the efficacy of our approach, we showcase the outcomes of three empirical near-infrared (NIR) datasets. Comparative analyses were performed against partial-least-squares (PLS) regressions on single-block pre-processed datasets and a methodology relying solely on PROSAC. The PROSAC-SO-PLS approach consistently outperformed these methods, yielding significantly lower prediction errors. This has been evidenced by a reduction in the root-mean-squared error of prediction (RMSEP) ranging from 5 to 25 % across seven out of the eight response variables analyzed.
CONCLUSIONS: The PROSAC-SO-PLS methodology offers a versatile and efficient technique for ensemble pre-processing in NIR data modeling. It enables the use of SO-PLS minimizing concerns about pre-processing sequence or block order and effectively manages a large number of data blocks. This innovation significantly streamlines the data pre-processing and model-building processes, enhancing the accuracy and efficiency of chemometric models.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文
7 Comparison of data fusion strategies for automated prostate lesion detection using mpMRI correlated with whole mount histology.

使用与整体组织学相关的 mpMRI 进行自动前列腺病变检测的数据融合策略的比较。影响指数 : 4.309
发表时间：Jul 2024 29
来源期刊：Radiat Oncol PMID：39080735

DOI：10.1186/s13014-024-02471-0
文章类型： Journal Article

背景：在这项工作中，我们比较输入水平，特征级和决策级数据融合技术，用于自动检测有临床意义的前列腺病变（csPCa）。
方法：使用Unet作为基线开发了多种深度学习CNN架构。CNN使用两种多参数MRI图像(T2W，ADC,和高b值)和定量临床数据(前列腺特异性抗原(PSA)，PSA密度(PSAD),前列腺体积和总肿瘤体积（GTV）），只有MP-MRI图像（n=118），作为输入。此外,来自整个坐骑组织病理学图像(n=22)的共同配准的地面实况数据被用作评估的测试集。
结果：早期/中期/晚期融合的CNN精度为0.41/0.51/0.61，召回值为0.18/0.22/0.25，平均精度为0.13/0.19/0.27，F评分为0.55/0.67/0.76。DiceSorensen系数（DSC）用于评估将mpMRI与参数临床数据相结合以检测csPCa的影响。我们比较了用mpMRI和参数化临床训练的CNN的预测与仅用mpMRI图像作为输入的CNN的预测之间的DSC。我们获得的DSC数据分别为0.30/0.34/0.36和0.26/0.33/0.34。此外,我们评估了每个mpMRI输入通道对csPCa检测任务的影响，并获得了0.14/0.25/0.28的DSC。
结论：结果表明，决策级融合网络在前列腺病变检测任务中表现更好。将mpMRI数据与定量临床数据相结合并没有显示出这些网络之间的显着差异（p=0.26/0.62/0.85）。结果表明，用所有mpMRI数据训练的CNN优于具有较少输入通道的CNN，这与当前的临床协议一致，其中相同的输入用于PI-RADS病变评分。
背景：该试验在德国临床研究注册中心（DRKS）以提案编号Nr进行回顾性注册。476/14&476/19。
BACKGROUND: In this work, we compare input level, feature level and decision level data fusion techniques for automatic detection of clinically significant prostate lesions (csPCa).
METHODS: Multiple deep learning CNN architectures were developed using the Unet as the baseline. The CNNs use both multiparametric MRI images (T2W, ADC, and High b-value) and quantitative clinical data (prostate specific antigen (PSA), PSA density (PSAD), prostate gland volume & gross tumor volume (GTV)), and only mp-MRI images (n = 118), as input. In addition, co-registered ground truth data from whole mount histopathology images (n = 22) were used as a test set for evaluation.
RESULTS: The CNNs achieved for early/intermediate / late level fusion a precision of 0.41/0.51/0.61, recall value of 0.18/0.22/0.25, an average precision of 0.13 / 0.19 / 0.27, and F scores of 0.55/0.67/ 0.76. Dice Sorensen Coefficient (DSC) was used to evaluate the influence of combining mpMRI with parametric clinical data for the detection of csPCa. We compared the DSC between the predictions of CNN\'s trained with mpMRI and parametric clinical and the CNN\'s trained with only mpMRI images as input with the ground truth. We obtained a DSC of data 0.30/0.34/0.36 and 0.26/0.33/0.34 respectively. Additionally, we evaluated the influence of each mpMRI input channel for the task of csPCa detection and obtained a DSC of 0.14 / 0.25 / 0.28.
CONCLUSIONS: The results show that the decision level fusion network performs better for the task of prostate lesion detection. Combining mpMRI data with quantitative clinical data does not show significant differences between these networks (p = 0.26/0.62/0.85). The results show that CNNs trained with all mpMRI data outperform CNNs with less input channels which is consistent with current clinical protocols where the same input is used for PI-RADS lesion scoring.
BACKGROUND: The trial was registered retrospectively at the German Register for Clinical Studies (DRKS) under proposal number Nr. 476/14 & 476/19.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
8 Regression-Assisted Bayesian Record Linkage for Causal Inference in Observational Studies with Covariates Spread Over Two Files.

回归辅助的贝叶斯记录链接用于在两个文件上传播协变量的观察研究中的因果关系。影响指数 : 1.095
发表时间：Mar 2024
来源期刊：J Stat Plan Inference PMID：39076728

DOI：10.1016/j.jspi.2023.07.004
文章类型： Journal Article

我们考虑将数据分散在两个文件中的观察性研究的因果推断。一个文件包括治疗，结果,和一些在一组个体上测量的协变量，另一个文件包括在部分重叠的一组个体上测量的其他因果相关协变量。通过链接两个数据库中的记录，分析师可以控制更多的协变量，因此，与仅使用一个文件相比，降低了偏差的风险。当分析师无法访问启用完美的唯一标识符时，无差错链接，它们通常依赖于概率记录链接来构建单个链接的数据集，并使用这些关联数据估计因果效应。这种典型的做法不会将不确定性从不完美的联系传播到因果推论。Further,它没有利用变量之间的关系来提高连锁质量。我们通过融合回归辅助，具有因果推断的贝叶斯概率记录链接。马尔可夫链蒙特卡洛采样器生成多个合理的链接数据文件作为副产品，分析师可以将其用于多个归因推断。这里,我们显示了基于倾向得分重叠权重的两个因果估计的结果.使用意大利家庭收入和财富调查的模拟和数据，我们表明，我们的方法可以提高估计治疗效果的准确性。
We consider causal inference for observational studies with data spread over two files. One file includes the treatment, outcome, and some covariates measured on a set of individuals, and the other file includes additional causally-relevant covariates measured on a partially overlapping set of individuals. By linking records in the two databases, the analyst can control for more covariates, thereby reducing the risk of bias compared to using only one file alone. When analysts do not have access to a unique identifier that enables perfect, error-free linkages, they typically rely on probabilistic record linkage to construct a single linked data set, and estimate causal effects using these linked data. This typical practice does not propagate uncertainty from imperfect linkages to the causal inferences. Further, it does not take advantage of relationships among the variables to improve the linkage quality. We address these shortcomings by fusing regression-assisted, Bayesian probabilistic record linkage with causal inference. The Markov chain Monte Carlo sampler generates multiple plausible linked data files as byproducts that analysts can use for multiple imputation inferences. Here, we show results for two causal estimators based on propensity score overlap weights. Using simulations and data from the Italy Survey on Household Income and Wealth, we show that our approach can improve the accuracy of estimated treatment effects.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
9 Integration of LC-HRMS and 1H NMR metabolomics data fusion approaches for classification of Amarone wine based on withering time and yeast strain.

LC - HRMS 和 1H NMR 代谢组学数据融合方法的整合，用于基于枯萎时间和酵母菌株的 Amarone 葡萄酒分类。影响指数 : 6.443
发表时间：Oct 2024 30
来源期刊：Food Chem X PMID：39071933

DOI：10.1016/j.fochx.2024.101607
文章类型： Journal Article

结合两种非靶向代谢组学方法（LC-HRMS和1HNMR），根据葡萄枯萎时间和酵母菌株对Amarone葡萄酒进行分类。这项研究采用了多组学数据整合方法，结合无监督数据探索(MCIA)和监督统计分析(SPLS-DA)。结果显示，多组学伪特征值空间突出了数据集之间的有限相关性（RV分数=16.4%），表明分析的互补性。此外，SPLS-DA模型根据枯萎时间和酵母菌株对葡萄酒样品进行了正确分类，提供了更广泛的葡萄酒代谢组的表征，相对于从个别技术获得的东西。在氨基酸的积累中观察到显着的变化，单糖，和多酚化合物在整个枯萎过程中，样本分类错误率较低(7.52%)。总之,该策略展示了整合大型组学数据集和识别关键代谢物的高度能力，这些代谢物能够根据葡萄酒样品的特征进行区分。
Two untargeted metabolomics approaches (LC-HRMS and 1H NMR) were combined to classify Amarone wines based on grape withering time and yeast strain. The study employed a multi-omics data integration approach, combining unsupervised data exploration (MCIA) and supervised statistical analysis (sPLS-DA). The results revealed that the multi-omics pseudo-eigenvalue space highlighted a limited correlation between the datasets (RV-score = 16.4%), suggesting the complementarity of the assays. Furthermore, the sPLS-DA models correctly classified wine samples according to both withering time and yeast strains, providing a much broader characterization of wine metabolome with respect to what was obtained from the individual techniques. Significant variations were notably observed in the accumulation of amino acids, monosaccharides, and polyphenolic compounds throughout the withering process, with a lower error rate in sample classification (7.52%). In conclusion, this strategy demonstrated a high capability to integrate large omics datasets and identify key metabolites able to discriminate wine samples based on their characteristics.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
10 Revealing static and dynamic biomarkers from postprandial metabolomics data through coupled matrix and tensor factorizations.

通过耦合矩阵和张量分解从餐后代谢组学数据中显示静态和动态生物标志物。影响指数 : 4.747
发表时间：Jul 2024 27
来源期刊：Metabolomics PMID：39066850

DOI：10.1007/s11306-024-02128-9
文章类型： Journal Article

背景：来自膳食挑战测试的纵向代谢组学数据包含空腹和动态信号，这可能与代谢健康和疾病有关。最近的工作探索了时间分辨代谢组学数据的多路结构，方法是将其排列为具有模式的三路数组：受试者，代谢物，和时间。对这些动态数据的分析（其中从餐后状态中减去空腹数据）揭示了各种表型的动态标记，以及禁食状态和动态状态之间的差异。然而,在提取相同受试者分层的静态和动态生物标志物方面仍然取得了有限的成功.
目的：通过对空腹和动态代谢组学数据的联合分析，我们的目标是捕获同一主题分层的静态和动态生物标志物，提供完整的图片，这将对精确健康更有效。
方法：我们使用耦合矩阵和张量分解（CMTF）联合分析了COPSAC2000队列在进餐挑战测试期间收集的空腹和动态代谢组学数据，其中动态数据（受试者按时间的代谢物）与受试者模式中的空腹数据（受试者按代谢物）耦合。
结果：提出的数据融合方法从空腹和动态信号中提取BMI（体重指数）方面的共享受试者分层，以及对应于这些分层的静态和动态代谢生物标志物模式。具体来说,我们观察到受试者分层显示与所有空腹VLDLs和较高BMI呈正相关.对于同一主题分层，动态VLDLs的子集(主要是较小的大小)与较高的BMI呈负相关.与空腹和餐后状态的个体分析相比,使用这种数据融合方法观察到受试者定量与感兴趣的表型的更高相关性。
结论：基于CMTF的方法提供了相同主题分层的静态和动态生物标志物的完整图片-当标志物同时存在于空腹和动态状态时。
BACKGROUND: Longitudinal metabolomics data from a meal challenge test contains both fasting and dynamic signals, that may be related to metabolic health and diseases. Recent work has explored the multiway structure of time-resolved metabolomics data by arranging it as a three-way array with modes: subjects, metabolites, and time. The analysis of such dynamic data (where the fasting data is subtracted from postprandial states) reveals dynamic markers of various phenotypes, and differences between fasting and dynamic states. However, there is still limited success in terms of extracting static and dynamic biomarkers for the same subject stratifications.
OBJECTIVE: Through joint analysis of fasting and dynamic metabolomics data, our goal is to capture static and dynamic biomarkers of a phenotype for the same subject stratifications providing a complete picture, that will be more effective for precision health.
METHODS: We jointly analyze fasting and dynamic metabolomics data collected during a meal challenge test from the COPSAC 2000 cohort using coupled matrix and tensor factorizations (CMTF), where the dynamic data (subjects by metabolites by time) is coupled with the fasting data (subjects by metabolites) in the subjects mode.
RESULTS: The proposed data fusion approach extracts shared subject stratifications in terms of BMI (body mass index) from fasting and dynamic signals as well as the static and dynamic metabolic biomarker patterns corresponding to those stratifications. Specifically, we observe a subject stratification showing the positive association with all fasting VLDLs and higher BMI. For the same subject stratification, a subset of dynamic VLDLs (mainly the smaller sizes) correlates negatively with higher BMI. Higher correlations of the subject quantifications with the phenotype of interest are observed using such a data fusion approach compared to individual analyses of the fasting and postprandial state.
CONCLUSIONS: The CMTF-based approach provides a complete picture of static and dynamic biomarkers for the same subject stratifications-when markers are present in both fasting and dynamic states.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文

data fusion 关注

1 Long-term annual mapping and spatial-temporal dynamic analysis of winter wheat in Shandong Province based on spatial-temporal data fusion (2000-2022).

2 The development and application of a novel E-commerce recommendation system used in electric power B2B sector.

3 Monitoring of meat quality and change-point detection by a sensor array and profiling of bacterial communities.

4 Digital modelling method of coal-mine roadway based on millimeter-wave radar array.

5 A Rotating Machinery Fault Diagnosis Method Based on Dynamic Graph Convolution Network and Hard Threshold Denoising.

6 PROSAC as a selection tool for SO-PLS regression: A strategy for multi-block data fusion.

7 Comparison of data fusion strategies for automated prostate lesion detection using mpMRI correlated with whole mount histology.

8 Regression-Assisted Bayesian Record Linkage for Causal Inference in Observational Studies with Covariates Spread Over Two Files.

9 Integration of LC-HRMS and 1H NMR metabolomics data fusion approaches for classification of Amarone wine based on withering time and yeast strain.

10 Revealing static and dynamic biomarkers from postprandial metabolomics data through coupled matrix and tensor factorizations.