PanFP

  • 文章类型: Journal Article
    分子谱分析技术,如宏基因组学,代谢组学或代谢组学为微生物组的功能多样性提供了重要的见解。相比之下,16SrRNA基因测序,一种广泛且具有成本效益的测量微生物多样性的技术,只允许间接估计微生物的功能。为了缓解这种情况,PICRUSt2,Tax4Fun2,PanFP和MetGEM等工具使用不同的算法从16SrRNA基因测序数据推断功能概况。先前的研究对这些预测的质量产生了怀疑,激励我们使用匹配的16SrRNA基因测序系统评估这些工具,宏基因组数据集,和模拟数据。我们的贡献有三个方面:(I)使用模拟数据,我们调查技术偏差是否可以解释推断和预期结果之间的不一致;(ii)考虑人类队列2型糖尿病,结直肠癌和肥胖症,我们测试功能类别的健康相关差异丰度测量是否在16SrRNA基因推断和宏基因组来源的谱之间一致;(iii)由于16SrRNA基因拷贝数是功能谱推断中的重要混淆者,我们调查使用rrnDB数据库定制的拷贝数标准化是否可以改善结果.我们的结果表明,基于16SrRNA基因的功能推断工具通常没有必要的敏感性来描绘微生物组中与健康相关的功能变化,因此应谨慎使用。此外,我们概述了所测试的各个工具的重要差异,并为工具选择提供了建议。
    Molecular profiling techniques such as metagenomics, metatranscriptomics or metabolomics offer important insights into the functional diversity of the microbiome. In contrast, 16S rRNA gene sequencing, a widespread and cost-effective technique to measure microbial diversity, only allows for indirect estimation of microbial function. To mitigate this, tools such as PICRUSt2, Tax4Fun2, PanFP and MetGEM infer functional profiles from 16S rRNA gene sequencing data using different algorithms. Prior studies have cast doubts on the quality of these predictions, motivating us to systematically evaluate these tools using matched 16S rRNA gene sequencing, metagenomic datasets, and simulated data. Our contribution is threefold: (i) using simulated data, we investigate if technical biases could explain the discordance between inferred and expected results; (ii) considering human cohorts for type two diabetes, colorectal cancer and obesity, we test if health-related differential abundance measures of functional categories are concordant between 16S rRNA gene-inferred and metagenome-derived profiles and; (iii) since 16S rRNA gene copy number is an important confounder in functional profiles inference, we investigate if a customised copy number normalisation with the rrnDB database could improve the results. Our results show that 16S rRNA gene-based functional inference tools generally do not have the necessary sensitivity to delineate health-related functional changes in the microbiome and should thus be used with care. Furthermore, we outline important differences in the individual tools tested and offer recommendations for tool selection.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    已经出现了将细菌16SrRNA基因扩增子数据与基于原核参考基因组的功能基因联系起来的计算方法。本研究旨在验证或反驳功能基因预测工具在评估和比较实验治疗之间的群落功能的适用性。诱导根际微生物群落组成和功能的快速或慢速响应。小麦和大麦的根际样本是连续两年在活跃和成熟生长阶段从有机和常规农田中收集的,并采用全球变化实验设施的环境或未来气候处理。通过16SrRNA基因扩增子测序确定细菌群落组成,以及与碳有关的五种胞外酶的活性(β-葡萄糖苷酶,纤维二糖水解酶,和木糖苷酶),氮(N-乙酰氨基葡萄糖苷酶),和磷(酸性磷酸酶)循环测定。结构群落数据用于使用Tax4Fun和PanFP预测根际群落的功能模式。随后,将预测与测量的活动进行比较。尽管不同的处理主要驱动群落组成(植物生长阶段)或测量的酶活性(农业系统),这些预测以定性而非定量的方式反映了治疗中的模式。测量值和预测值之间的大多数差异来自植物生长阶段(快速群落响应),其次是农业管理和气候(社区反应较慢)。因此,我们的结果表明,预测工具适用于动态较小的环境系统中土壤群落功能的比较调查。重要性将土壤微生物群落结构与其功能联系起来,这对维持生态系统的健康和服务很重要,仍然具有挑战性。除了结构群落分析的巨大进步,功能等价物,例如宏基因组学和超转录组学,仍然是时间和成本密集型的。最近的计算方法(Tax4Fun和PanFP)旨在根据参考基因组从结构社区数据预测功能。尽管这些工具的可用性已经得到宏基因组数据的证实,到目前为止,缺少预测函数和测量函数之间的比较。因此,这项研究包括对这些工具在不同环境条件下的性能的广泛现实测试,包括相关的全球变化因素(土地利用和气候)。这项工作为预测工具在不同已建立的土壤生态系统中比较土壤群落功能的适用性提供了有价值的验证,并提出了它们在解开给定群落结构所提供的广泛功能方面的可用性。
    Computational approaches that link bacterial 16S rRNA gene amplicon data to functional genes based on prokaryotic reference genomes have emerged. This study aims to validate or refute the applicability of the functional gene prediction tools for assessment and comparison of community functionality among experimental treatments, inducing either fast or slow responses in rhizosphere microbial community composition and function. Rhizosphere samples of wheat and barley were collected in two consecutive years at active and mature growth phases from organic and conventional farming plots with ambient or future-climate treatments of the Global Change Experimental Facility. Bacterial community composition was determined by 16S rRNA gene amplicon sequencing, and the activities of five extracellular enzymes involved in carbon (β-glucosidases, cellobiohydrolase, and xylosidase), nitrogen (N-acetylglucosaminidase), and phosphorus (acid phosphatase) cycles were determined. Structural community data were used to predict functional patterns of the rhizosphere communities using Tax4Fun and PanFP. Subsequently, the predictions were compared with the measured activities. Despite the fact that different treatments mainly drove either community composition (plant growth phase) or measured enzyme activities (farming system), the predictions mirrored patterns in the treatments in a qualitative but not quantitative way. Most of the discrepancies between measured and predicted values resulted from plant growth stages (fast community response), followed by farming management and climate (slower community response). Thus, our results suggest the applicability of the prediction tools for comparative investigations of soil community functionality in less-dynamic environmental systems. IMPORTANCE Linking soil microbial community structure to its functionality, which is important for maintaining health and services of an ecosystem, is still challenging. Besides great advances in structural community analysis, functional equivalents, such as metagenomics and metatranscriptomics, are still time and cost intensive. Recent computational approaches (Tax4Fun and PanFP) aim to predict functions from structural community data based on reference genomes. Although the usability of these tools has been confirmed with metagenomic data, a comparison between predicted and measured functions is so far missing. Thus, this study comprises an expansive reality test on the performance of these tools under different environmental conditions, including relevant global change factors (land use and climate). The work provides a valuable validation of the applicability of the prediction tools for comparison of soil community functions across different sufficiently established soil ecosystems and suggest their usability to unravel the broad spectrum of functions provided by a given community structure.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号