functional data

功能数据
  • 文章类型: Journal Article
    我们将纵向全基因组关联研究中不规则时间点观察到的阿尔茨海默病相关表型响应变量建模为稀疏功能数据,并提出非参数检验程序来检测功能基因型效应,同时控制环境协变量的混杂效应。我们对协方差测试的新功能分析基于看似无关的内核平滑器,它考虑了受试者内部的时间相关性,从而享受比现有功能测试更好的能力。我们证明了所提出的测试与一致一致的非参数协方差函数估计器相结合,具有Wilks现象,并且是minimax最强大的。本文所用的数据来自阿尔茨海默病神经影像学倡议(ADNI)数据库,其中提出的测试的应用导致发现了可能与阿尔茨海默病有关的新基因。
    We model the Alzheimer\'s Disease-related phenotype response variables observed on irregular time points in longitudinal Genome-Wide Association Studies as sparse functional data and propose nonparametric test procedures to detect functional genotype effects while controlling the confounding effects of environmental covariates. Our new functional analysis of covariance tests are based on a seemingly unrelated kernel smoother, which takes into account the within-subject temporal correlations, and thus enjoy improved power over existing functional tests. We show that the proposed test combined with a uniformly consistent nonparametric covariance function estimator enjoys the Wilks phenomenon and is minimax most powerful. Data used in the preparation of this article were obtained from the Alzheimer\'s Disease Neuroimaging Initiative (ADNI) database, where an application of the proposed test lead to the discovery of new genes that may be related to Alzheimer\'s Disease.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    最近,双向或纵向功能数据分析在许多领域备受关注。然而,关于如何适当地表征双向功能预测因子和标量响应之间的关联知之甚少。以死亡率研究为动机,在本文中,我们提出了一种新的双向泛函线性模型,其中响应是标量,函数预测是双向轨迹。该模型是直观的,可解释并自然地捕获双向功能预测器和标量型响应的每种方式之间的关系。Further,我们开发了一种新的估计方法来估计弱可分性框架下的回归函数。构造回归函数的主要技术工具是乘积函数主成分分析和迭代最小二乘法。在广泛的仿真研究中证明了我们方法的可靠性能。我们还分析了死亡率数据集,以说明所提出的程序的有用性。
    Recently, two-way or longitudinal functional data analysis has attracted much attention in many fields. However, little is known on how to appropriately characterize the association between two-way functional predictor and scalar response. Motivated by a mortality study, in this paper, we propose a novel two-way functional linear model, where the response is a scalar and functional predictor is two-way trajectory. The model is intuitive, interpretable and naturally captures relationship between each way of two-way functional predictor and scalar-type response. Further, we develop a new estimation method to estimate the regression functions in the framework of weak separability. The main technical tools for the construction of the regression functions are product functional principal component analysis and iterative least square procedure. The solid performance of our method is demonstrated in extensive simulation studies. We also analyze the mortality dataset to illustrate the usefulness of the proposed procedure.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在本文中,我们提出了一种新的具有时间动态单指数相互作用的半参数函数对分位数回归模型。我们的模型非常灵活地考虑了多元纵向/功能协变量对纵向响应的非线性时间动态相互作用的影响,大多数现有的纵向数据分位数回归模型是我们提出的模型的特例。我们建议通过张量积B样条逼近双变量非参数系数函数,并采用检查损失最小化方法来估计双变量系数函数和索引参数向量。在一些温和的条件下,我们使用投影正交化技术建立了估计的单指数系数的渐近正态,并获得估计的二元系数函数的收敛速度。此外,我们提出了一个分数检验来检验协变量之间是否存在交互效应。蒙特卡罗模拟和经验数据分析说明了该方法的有限样本性能。
    In this paper we propose a new semiparametric function-on-function quantile regression model with time-dynamic single-index interactions. Our model is very flexible in taking into account of the nonlinear time-dynamic interaction effects of the multivariate longitudinal/functional covariates on the longitudinal response, that most existing quantile regression models for longitudinal data are special cases of our proposed model. We propose to approximate the bivariate nonparametric coefficient functions by tensor product B-splines, and employ a check loss minimization approach to estimate the bivariate coefficient functions and the index parameter vector. Under some mild conditions, we establish the asymptotic normality of the estimated single-index coefficients using projection orthogonalization technique, and obtain the convergence rates of the estimated bivariate coefficient functions. Furthermore, we propose a score test to examine whether there exist interaction effects between the covariates. The finite sample performance of the proposed method is illustrated by Monte Carlo simulations and an empirical data analysis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    宿主微生物生物学(HMB)站在重新定义的风口浪尖上,挑战传统范式,转而接受对微生物科学的更全面的理解。美国微生物学会(ASM)微生物科学理事会于2023年举办了一次虚拟务虚会,以确定HMB领域的未来以及推进微生物科学所需的创新。务虚会的演讲和讨论共同强调了微生物的相互联系及其对人类的深刻影响,动物,和环境健康,以及需要拓宽视野以充分接受这些互动的复杂性。为了推进HMB研究,微生物科学家将受益于加强跨学科和跨学科研究,以利用不同领域的专业知识,整合不同的学科,并促进HMB内部的公平和可及性。通过汇集各种科学观点,数据集成对于塑造HMB研究的未来至关重要。新的和创新的技术,和“组学”方法。ASM可以授权资源不足的群体,以确保尖端研究的好处达到科学界的每个角落。因此,ASM将准备引导HMB走向一个倡导包容性的未来,创新,和可获得的科学进步。
    Host-microbe biology (HMB) stands on the cusp of redefinition, challenging conventional paradigms to instead embrace a more holistic understanding of the microbial sciences. The American Society for Microbiology (ASM) Council on Microbial Sciences hosted a virtual retreat in 2023 to identify the future of the HMB field and innovations needed to advance the microbial sciences. The retreat presentations and discussions collectively emphasized the interconnectedness of microbes and their profound influence on humans, animals, and environmental health, as well as the need to broaden perspectives to fully embrace the complexity of these interactions. To advance HMB research, microbial scientists would benefit from enhancing interdisciplinary and transdisciplinary research to utilize expertise in diverse fields, integrate different disciplines, and promote equity and accessibility within HMB. Data integration will be pivotal in shaping the future of HMB research by bringing together varied scientific perspectives, new and innovative techniques, and \'omics approaches. ASM can empower under-resourced groups with the goal of ensuring that the benefits of cutting-edge research reach every corner of the scientific community. Thus, ASM will be poised to steer HMB toward a future that champions inclusivity, innovation, and accessible scientific progress.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    与基于条件均值回归的标量函数回归模型相比,标量对函数分位数回归对响应变量中的异常值是稳健的。然而,它容易受到功能预测器中的异常值(称为杠杆点)的影响。这是因为回归分位数的影响函数在响应变量中有界,但在预测空间中无界。杠杆点可能会改变预测矩阵的特征结构,导致估计和预测结果不佳。本研究提出了一种稳健的程序,可以在函数标量分位数回归方法中估计模型参数,并在存在异常值和杠杆点的情况下产生可靠的预测。所提出的方法基于函数偏分位数回归过程。我们提出了一个加权的部分分位数协方差来获得标量对函数分位数回归模型的函数部分分位数分量。分解后,模型参数通过加权损失函数估计,其中,通过迭代重新加权部分分位数分量来获得鲁棒性。通过一系列Monte-Carlo实验和经验数据示例评估了该方法的估计和预测性能。将结果与几种现有方法进行了比较。该方法在R包robfpqr中实现。
    Compared with the conditional mean regression-based scalar-on-function regression model, the scalar-on-function quantile regression is robust to outliers in the response variable. However, it is susceptible to outliers in the functional predictor (called leverage points). This is because the influence function of the regression quantiles is bounded in the response variable but unbounded in the predictor space. The leverage points may alter the eigenstructure of the predictor matrix, leading to poor estimation and prediction results. This study proposes a robust procedure to estimate the model parameters in the scalar-on-function quantile regression method and produce reliable predictions in the presence of both outliers and leverage points. The proposed method is based on a functional partial quantile regression procedure. We propose a weighted partial quantile covariance to obtain functional partial quantile components of the scalar-on-function quantile regression model. After the decomposition, the model parameters are estimated via a weighted loss function, where the robustness is obtained by iteratively reweighting the partial quantile components. The estimation and prediction performance of the proposed method is evaluated by a series of Monte-Carlo experiments and an empirical data example. The results are compared favorably with several existing methods. The method is implemented in an R package robfpqr.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    我们提出了一种新的非参数条件独立性检验,用于连续分位数水平上的标量响应和功能协变量。我们建立了一个Cramer-vonMises型检验统计量,该统计量基于由函数协变量的随机投影索引的经验过程,有效避免了预测假设下的“维度诅咒”,这几乎肯定等同于零假设。在一些温和的假设下获得了所提出的检验统计量的渐近零分布。然后研究了我们的检验统计量的渐近全局和局部幂性质。我们特别证明了该统计数据能够检测到以参数速率收敛到null的广泛类型的局部替代方案。此外,我们建议使用一种简单的乘数引导方法来估计临界值。通过几个蒙特卡罗模拟实验检查了我们统计量的有限样本性能。最后,对EEG数据集的分析用于显示我们提出的测试统计量的实用性和多功能性。
    We propose a new non-parametric conditional independence test for a scalar response and a functional covariate over a continuum of quantile levels. We build a Cramer-von Mises type test statistic based on an empirical process indexed by random projections of the functional covariate, effectively avoiding the \"curse of dimensionality\" under the projected hypothesis, which is almost surely equivalent to the null hypothesis. The asymptotic null distribution of the proposed test statistic is obtained under some mild assumptions. The asymptotic global and local power properties of our test statistic are then investigated. We specifically demonstrate that the statistic is able to detect a broad class of local alternatives converging to the null at the parametric rate. Additionally, we recommend a simple multiplier bootstrap approach for estimating the critical values. The finite-sample performance of our statistic is examined through several Monte Carlo simulation experiments. Finally, an analysis of an EEG data set is used to show the utility and versatility of our proposed test statistic.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    考虑到功能数据分析的背景,我们通过Gibbs采样器开发并应用了一种新的贝叶斯方法,以选择用于有限表示函数数据的基函数。所提出的方法使用伯努利潜在变量将具有正概率的某些基函数系数分配为零。该过程允许自适应基础选择,因为它可以确定基础的数量以及应该选择哪些来表示功能数据。此外,所提出的程序测量选择过程的不确定性,可以同时应用于多条曲线。开发的方法可以处理由于实验误差和受试者之间的随机个体差异而可能不同的观察曲线,可以在涉及巴西每日COVID-19病例数的真实数据集应用程序中观察到。仿真研究表明了所提出方法的主要性质,例如,它在估计系数方面的准确性以及找到真正的基函数集的过程的强度。尽管是在功能数据分析的背景下开发的,我们还通过仿真将提出的模型与完善的LASSO和贝叶斯LASSO进行了比较,这是针对非功能性数据开发的方法。
    Considering the context of functional data analysis, we developed and applied a new Bayesian approach via the Gibbs sampler to select basis functions for a finite representation of functional data. The proposed methodology uses Bernoulli latent variables to assign zero to some of the basis function coefficients with a positive probability. This procedure allows for an adaptive basis selection since it can determine the number of bases and which ones should be selected to represent functional data. Moreover, the proposed procedure measures the uncertainty of the selection process and can be applied to multiple curves simultaneously. The methodology developed can deal with observed curves that may differ due to experimental error and random individual differences between subjects, which one can observe in a real dataset application involving daily numbers of COVID-19 cases in Brazil. Simulation studies show the main properties of the proposed method, such as its accuracy in estimating the coefficients and the strength of the procedure to find the true set of basis functions. Despite having been developed in the context of functional data analysis, we also compared the proposed model via simulation with the well-established LASSO and Bayesian LASSO, which are methods developed for non-functional data.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    肠道微生物组的改变与阿尔茨海默病(AD)的发病机制有关,可用作诊断措施。然而,肠道微生物组的纵向数据及其对AD发生和进展的预后意义的认识有限.本研究的目的是开发基于肠道微生物组数据的AD发展的可靠预测模型。在这项纵向研究中,我们调查了49名轻度认知障碍(MCI)患者的肠道微生物组,平均(SD)随访3.7(0.6)年,使用猎枪宏基因组学。在4年随访(4yFU)结束时,27名MCI患者转化为AD痴呆,22名MCI患者保持稳定。从稳定的MCI患者中区分AD痴呆转化者的最佳分类模型包括24属,在BL处产生0.87的接受者工作特征曲线下面积(AUROC),1yFU为0.92,4yFU为0.95。通过分析25个GO(基因本体论)特征获得了具有功能数据的最佳模型,在BL时AUROC为0.87,1yFU时为0.85,4yFU时为0.81,33KO[京都基因和基因组百科全书(KEGG)直系同源]特征,BL时AUROC为0.79,1yFU为0.88,4yFU为0.82。对这三个模型使用集成学习,包括具有四个年龄参数的临床模型,性别,体重指数(BMI)和载脂蛋白E(ApoE)基因型,在BL时产生0.96的AUROC,1yFU为0.96,4yFU为0.97。总之,我们确定了新颖且及时稳定的肠道微生物组算法,该算法可准确预测MCI患者在4yFU期间进展为AD痴呆.
    Alterations in the gut microbiome are associated with the pathogenesis of Alzheimer\'s disease (AD) and can be used as a diagnostic measure. However, longitudinal data of the gut microbiome and knowledge about its prognostic significance for the development and progression of AD are limited. The aim of the present study was to develop a reliable predictive model based on gut microbiome data for AD development. In this longitudinal study, we investigated the intestinal microbiome in 49 mild cognitive impairment (MCI) patients over a mean (SD) follow-up of 3.7 (0.6) years, using shotgun metagenomics. At the end of the 4-year follow-up (4yFU), 27 MCI patients converted to AD dementia and 22 MCI patients remained stable. The best taxonomic model for the discrimination of AD dementia converters from stable MCI patients included 24 genera, yielding an area under the receiver operating characteristic curve (AUROC) of 0.87 at BL, 0.92 at 1yFU and 0.95 at 4yFU. The best models with functional data were obtained via analyzing 25 GO (Gene Ontology) features with an AUROC of 0.87 at BL, 0.85 at 1yFU and 0.81 at 4yFU and 33 KO [Kyoto Encyclopedia of Genes and Genomes (KEGG) ortholog] features with an AUROC of 0.79 at BL, 0.88 at 1yFU and 0.82 at 4yFU. Using ensemble learning for these three models, including a clinical model with the four parameters of age, gender, body mass index (BMI) and Apolipoprotein E (ApoE) genotype, yielded an AUROC of 0.96 at BL, 0.96 at 1yFU and 0.97 at 4yFU. In conclusion, we identified novel and timely stable gut microbiome algorithms that accurately predict progression to AD dementia in individuals with MCI over a 4yFU period.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    In this article, we propose a novel estimator of extreme conditional quantiles in partial functional linear regression models with heavy-tailed distributions. The conventional quantile regression estimators are often unstable at the extreme tails due to data sparsity, especially for heavy-tailed distributions. We first estimate the slope function and the partially linear coefficient using a functional quantile regression based on functional principal component analysis, which is a robust alternative to the ordinary least squares regression. The extreme conditional quantiles are then estimated by using a new extrapolation technique from extreme value theory. We establish the asymptotic normality of the proposed estimator and illustrate its finite sample performance by simulation studies and an empirical analysis of diffusion tensor imaging data from a cognitive disorder study.
    Dans cet article, un nouvel estimateur de quantiles conditionnels extrêmes est élaboré dans le cadre de modèles de régression linéaire fonctionnelle partielle avec des distributions à queues lourdes. Il est bien connu que la rareté des observations dans les ailes extrêmes de distributions à queues lourdes rend souvent les estimateurs de régression quantile usuels instables. Pour parer à la non robustesse des moindres carrés classiques, les auteurs ont commencé par estimer la fonction de pente et le coefficient partiellement linéaire d’une régression quantile en ayant recours à une approche basée sur l’analyse en composantes principales fonctionnelles. Ensuite, ils ont estimé les quantiles conditionnels extrêmes à l’aide d’une nouvelle technique d’extrapolation issue de la théorie des valeurs extrêmes. En plus d’établir la normalité asymptotique de l’estimateur proposé, les auteurs illustrent ses bonnes performances à distance finie par le biais d’une étude de simulation et une mise en oeuvre pratique sur les données d’imagerie de diffusion par tenseurs provenant d’une étude portant sur des troubles cognitifs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    现代医疗设备越来越多地产生复杂的数据,这些数据可以为潜在疾病的生理机制提供更深入的见解。医学成像研究中经常出现的一种复杂数据是功能数据,其采样单位是平滑连续函数。在这项工作中,为了建立涉及现代医学成像设备的实验的科学有效性,我们专注于评估通过不同方法(即不同技术或评估者)在同一主题上测量的多功能数据的可靠性和可重复性的问题。具体来说,我们开发了一系列可以评估方法内的组内相关系数和一致性相关系数指标,间方法,和基于由每种不同方法产生的复制功能数据测量结果组成的多变量多级功能数据的总(内部+内部)协议。为了有效估计,所提出的指数使用多变量多级函数混合效应模型的方差分量表示,可以通过功能主成分分析平滑估计。进行了广泛的模拟研究,以评估估计器的有限样本属性。所提出的方法用于评估由高科技放射性核素图像扫描产生的肾图曲线的可靠性和可重复性,该放射性核素图像扫描用于无创检测肾梗阻。
    Modern medical devices are increasingly producing complex data that could offer deeper insights into physiological mechanisms of underlying diseases. One type of complex data that arises frequently in medical imaging studies is functional data, whose sampling unit is a smooth continuous function. In this work, with the goal of establishing the scientific validity of experiments involving modern medical imaging devices, we focus on the problem of evaluating reliability and reproducibility of multiple functional data that are measured on the same subjects by different methods (i.e. different technologies or raters). Specifically, we develop a series of intraclass correlation coefficient and concordance correlation coefficient indices that can assess intra-method, inter-method, and total (intra + inter) agreement based on multivariate multilevel functional data consisting of replicated functional data measurements produced by each of the different methods. For efficient estimation, the proposed indices are expressed using variance components of a multivariate multilevel functional mixed effect model, which can be smoothly estimated by functional principal component analysis. Extensive simulation studies are performed to assess the finite-sample properties of the estimators. The proposed method is applied to evaluate the reliability and reproducibility of renogram curves produced by a high-tech radionuclide image scan used to non-invasively detect kidney obstruction.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号