hypothesis testing

假设检验
  • 文章类型: Journal Article
    提高传染病传播预报的准确性,最近引入了一种混合模型,在该模型中,通过机器学习(ML)模型从强制缓解政策数据中积极估计通常假定的恒定疾病传播率,然后将其馈送到扩展的易感感染-恢复模型,以预测感染病例数.只测试一个ML模型,也就是说,梯度增强模型(GBM),这项工作还没有完成,其他ML模型是否会表现得更好。这里,我们比较了GBM,线性回归,k-最近的邻居,和贝叶斯网络(BNs)根据未来35天的政策指数预测美国和加拿大各省的COVID-19感染病例数。这些ML模型在组合数据集上的平均绝对百分比误差没有显著差异[H(3)=3.10,p=0.38]。在两个省,观察到显著差异[H(3)=8.77,H(3)=8.07,p<0.05],然而,posthoc测试显示,在成对比较中没有显着差异。然而,在大多数训练数据集中,BNs的表现明显优于其他模型。结果表明,ML模型总体上具有相等的预测能力,和BNs最适合数据拟合应用。
    To improve the forecasting accuracy of the spread of infectious diseases, a hybrid model was recently introduced where the commonly assumed constant disease transmission rate was actively estimated from enforced mitigating policy data by a machine learning (ML) model and then fed to an extended susceptible-infected-recovered model to forecast the number of infected cases. Testing only one ML model, that is, gradient boosting model (GBM), the work left open whether other ML models would perform better. Here, we compared GBMs, linear regressions, k-nearest neighbors, and Bayesian networks (BNs) in forecasting the number of COVID-19-infected cases in the United States and Canadian provinces based on policy indices of future 35 days. There was no significant difference in the mean absolute percentage errors of these ML models over the combined dataset [H(3)=3.10,p=0.38]. In two provinces, a significant difference was observed [H(3)=8.77,H(3)=8.07,p<0.05], yet posthoc tests revealed no significant difference in pairwise comparisons. Nevertheless, BNs significantly outperformed the other models in most of the training datasets. The results put forward that the ML models have equal forecasting power overall, and BNs are best for data-fitting applications.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    我们将纵向全基因组关联研究中不规则时间点观察到的阿尔茨海默病相关表型响应变量建模为稀疏功能数据,并提出非参数检验程序来检测功能基因型效应,同时控制环境协变量的混杂效应。我们对协方差测试的新功能分析基于看似无关的内核平滑器,它考虑了受试者内部的时间相关性,从而享受比现有功能测试更好的能力。我们证明了所提出的测试与一致一致的非参数协方差函数估计器相结合,具有Wilks现象,并且是minimax最强大的。本文所用的数据来自阿尔茨海默病神经影像学倡议(ADNI)数据库,其中提出的测试的应用导致发现了可能与阿尔茨海默病有关的新基因。
    We model the Alzheimer\'s Disease-related phenotype response variables observed on irregular time points in longitudinal Genome-Wide Association Studies as sparse functional data and propose nonparametric test procedures to detect functional genotype effects while controlling the confounding effects of environmental covariates. Our new functional analysis of covariance tests are based on a seemingly unrelated kernel smoother, which takes into account the within-subject temporal correlations, and thus enjoy improved power over existing functional tests. We show that the proposed test combined with a uniformly consistent nonparametric covariance function estimator enjoys the Wilks phenomenon and is minimax most powerful. Data used in the preparation of this article were obtained from the Alzheimer\'s Disease Neuroimaging Initiative (ADNI) database, where an application of the proposed test lead to the discovery of new genes that may be related to Alzheimer\'s Disease.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    神经创伤领域正在努力应对最近发现的复制危机的影响。因此,必须注意识别和执行最合适的统计分析。这将防止滥用研究资源,并确保结论合理且在数据范围内。我们预计,贝叶斯统计方法将在未来几年看到越来越多的使用。贝叶斯方法将先验信念(或先验数据)集成到统计模型中,以合并历史信息和当前实验数据。这些方法可以提高检测实验组之间差异的能力(即,统计能力)在适当使用时。然而,如果要实施或评估这些分析,研究人员需要意识到这些方法的优势和局限性。最终,使用贝叶斯方法的方法可能对统计能力有实质性的好处,但是在识别和定义先前的信念时需要谨慎。
    The field of neurotrauma is grappling with the effects of the recently identified replication crisis. As such, care must be taken to identify and perform the most appropriate statistical analyses. This will prevent misuse of research resources and ensure that conclusions are reasonable and within the scope of the data. We anticipate that Bayesian statistical methods will see increasing use in the coming years. Bayesian methods integrate prior beliefs (or prior data) into a statistical model to merge historical information and current experimental data. These methods may improve the ability to detect differences between experimental groups (i.e., statistical power) when used appropriately. However, researchers need to be aware of the strengths and limitations of such approaches if they are to implement or evaluate these analyses. Ultimately, an approach using Bayesian methodologies may have substantial benefits to statistical power, but caution needs to be taken when identifying and defining prior beliefs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    假设检验(HT)的主要挑战在于确定I型(假阳性)和II型(未检测或假阴性)错误概率之间的最佳平衡。分析这些误差的指数收敛速度,被称为误差指数,提供对系统性能的重要见解。误差指数提供了一个镜头,通过它我们可以理解操作限制,例如资源限制和通信障碍,影响网络系统中分布式推理的准确性。这项调查全面回顾了HT的关键结果,从基础斯坦的引理到分布式HT的最新进展,都是通过误差指数的框架统一起来的。我们探索渐近和非渐近结果,强调它们对设计健壮高效的网络系统的影响,例如通过有损无线传感器监控网络进行事件检测,车辆环境中基于集体感知的目标检测,和分布式环境中的时钟同步,在其他人中。我们表明,理解错误指数的作用为优化决策和提高联网系统的可靠性提供了有价值的工具。
    A central challenge in hypothesis testing (HT) lies in determining the optimal balance between Type I (false positive) and Type II (non-detection or false negative) error probabilities. Analyzing these errors\' exponential rate of convergence, known as error exponents, provides crucial insights into system performance. Error exponents offer a lens through which we can understand how operational restrictions, such as resource constraints and impairments in communications, affect the accuracy of distributed inference in networked systems. This survey presents a comprehensive review of key results in HT, from the foundational Stein\'s Lemma to recent advancements in distributed HT, all unified through the framework of error exponents. We explore asymptotic and non-asymptotic results, highlighting their implications for designing robust and efficient networked systems, such as event detection through lossy wireless sensor monitoring networks, collective perception-based object detection in vehicular environments, and clock synchronization in distributed environments, among others. We show that understanding the role of error exponents provides a valuable tool for optimizing decision-making and improving the reliability of networked systems.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    从很小的时候,儿童以某种方式探索他们的环境,表明他们推理因果变量并寻求因果解释。的确,经过对黑猩猩解决问题能力的广泛研究,波维内利(猿类民间物理学,牛津大学出版社,2000)提出,这种推理不可观察变量的能力是人类独有的。从这个开始,波维内利和邓菲-莱利(加拿大实验心理学杂志,55(2)、187-195,2001)解决了黑猩猩是否会探索物体的问题,目的是阐明不可观察和令人惊讶的物体属性。黑猩猩,与学龄前儿童不同,在对象的不可观察属性发生变化后,没有显示对象探索的增加。我们批判性地讨论了这些发现,并认为需要使用更多种类的方法和更多物种的更多研究来支持只有人类参与解释寻求的假设。最后,我们强调了自Povinelli和Dunphy-Lelii进行原始调查以来,基于针对对象探索和信息寻求的发展和比较研究的未来研究途径。
    From an early age, children explore their environment in a way suggesting that they reason about causal variables and seek causal explanations. Indeed, following extensive studies of problem-solving abilities in chimpanzees, Povinelli (Folk Physics for Apes, Oxford University Press, 2000) proposed that this ability to reason about unobservable variables is unique to humans. Following on from this, Povinelli and Dunphy-Lelii (Canadian Journal of Experimental Psychology, 55(2), 187-195, 2001) addressed the question whether chimpanzees would explore objects with the aim of elucidating unobservable and surprising object properties. Chimpanzees, unlike preschool children, did not show increased object exploration following a change in the unobservable properties of an object. We critically discuss these findings and argue that more research using a greater variety of methods and with a larger number of species is required to support the hypothesis that only humans engage in explanation seeking. We conclude by highlighting avenues for future research based on developmental and comparative research aimed at object exploration and information seeking conducted since the original investigation by Povinelli and Dunphy-Lelii.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    学习新类别时,在两种不同的多模式类别学习任务中,儿童是否从与成人相同类型的培训中受益?我们比较了基于反馈的培训与观察性培训在年轻人(18-25岁)和早期学龄儿童(6-7岁)中的效果:基于联合规则和信息整合。我们使用了在视觉特征(“行星”刺激的旋转速度)和听觉特征(纯音刺激的音调频率)之间变化的多模态刺激。我们发现了基于规则的类别任务的年龄和训练类型之间的相互作用,这样成年人在反馈训练中比在观察训练中表现更好,而训练类型对儿童的类别学习表现没有显著影响。在学习基于规则的类别结构和信息集成类别结构方面,总体上成年人的表现优于儿童。在信息集成类别学习中,反馈训练与观察训练对成人或儿童类别学习均无显著影响.计算模型显示,孩子在这两项任务中都默认了单变量规则。发现儿童无法从反馈培训中受益,并且可以通过观察学习成功学习,这对设计适合儿童的教育干预措施具有启示意义。
    When learning new categories, do children benefit from the same types of training as adults? We compared the effects of feedback-based training with observational training in young adults (ages 18-25) and early school aged children (ages 6-7) across two different multimodal category learning tasks: conjunctive rule based and information integration. We used multimodal stimuli that varied across a visual feature (rotation speed of the \"planet\" stimulus) and an auditory feature (pitch frequency of a pure tone stimulus). We found an interaction between age and training type for the rule-based category task, such that adults performed better in feedback training than in observational training, whereas training type had no significant effect on children\'s category learning performance. Overall adults performed better than children in learning both the rule based and information integration category structures. In information integration category learning, feedback versus observational training did not have a significant effect on either adults\' or children\'s category learning. Computational modelling revealed that children defaulted to univariate rules in both tasks. The finding that children do not benefit from feedback training and can learn successfully via observational learning has implications for the design of educational interventions appropriate for children.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    大脑结构和功能测量值之间的个体内部耦合在发育中发展,可能是神经精神疾病风险的基础。尽管人们对结构-功能关系的发展越来越感兴趣,量化和测试耦合个体差异的严格方法仍处于起步阶段。在这篇文章中,我们探索并解决了测试和空间定位模态间耦合中个体差异的方法差距。我们提出了一种新的方法,称为CIDeR,它旨在以限制假阳性结果并提高对真阳性结果的检测的方式同时进行假设检验。通过比较不同的方法来测试模态间耦合的个体差异,我们描绘了他们测试的假设中的细微差异,这可能最终导致研究人员得出不同的结果。最后,我们使用来自费城神经发育队列的数据说明CIDeR在大脑发育的两个应用中的实用性。
    Within-individual coupling between measures of brain structure and function evolves in development and may underlie differential risk for neuropsychiatric disorders. Despite increasing interest in the development of structure-function relationships, rigorous methods to quantify and test individual differences in coupling remain nascent. In this article, we explore and address gaps in approaches for testing and spatially localizing individual differences in intermodal coupling. We propose a new method, called CIDeR, which is designed to simultaneously perform hypothesis testing in a way that limits false positive results and improve detection of true positive results. Through a comparison across different approaches to testing individual differences in intermodal coupling, we delineate subtle differences in the hypotheses they test, which may ultimately lead researchers to arrive at different results. Finally, we illustrate the utility of CIDeR in two applications to brain development using data from the Philadelphia Neurodevelopmental Cohort.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    分析崩溃数据是一个复杂且劳动密集型的过程,需要仔细考虑多个相互依赖的建模方面。如功能形式,转换,可能的促成因素,相关性,和未观察到的异质性。有限的时间,知识,经验可能会导致过度简化,过度装配,或忽略重要见解的错误指定模型。本文提出了一个广泛的假设检验框架,包括多目标数学规划公式和求解算法,以估计同时考虑可能的影响因素的碰撞频率模型。转换,非线性,和相关的随机参数。数学规划公式最小化样本内拟合和样本外预测。为了解决数学程序的复杂性和非凸性,所提出的解决方案框架利用各种元启发式解决方案算法。具体来说,和声搜索对超参数的敏感性最小,实现对解决方案的有效搜索,而不受超参数选择的影响。使用两个真实世界的数据集和一个合成数据集来评估框架的有效性。使用两个真实世界的数据集和独立团队在文献中发布的相应模型进行比较分析。所提出的框架显示了其查明有效模型规格的能力,产生准确的估计,并为研究人员和从业人员提供有价值的见解。所提出的方法可以发现许多见解,同时最大程度地减少模型开发所花费的时间。通过考虑更广泛的因素,可以生成具有不同质量的模型。例如,当应用于昆士兰州的崩溃数据时,拟议的方法表明,在急剧弯曲的道路上加入中间分隔可以有效减少撞车的发生,当应用于华盛顿的崩溃数据时,同时考虑交通量和道路曲率,导致碰撞差异显着减少,但碰撞手段增加。
    Analyzing crash data is a complex and labor-intensive process that requires careful consideration of multiple interdependent modeling aspects, such as functional forms, transformations, likely contributing factors, correlations, and unobserved heterogeneity. Limited time, knowledge, and experience may lead to over-simplified, over-fitted, or misspecified models overlooking important insights. This paper proposes an extensive hypothesis testing framework including a multi-objective mathematical programming formulation and solution algorithms to estimate crash frequency models considering simultaneously likely contributing factors, transformations, non-linearities, and correlated random parameters. The mathematical programming formulation minimizes both in-sample fit and out-of-sample prediction. To address the complexity and non-convexity of the mathematical program, the proposed solution framework utilizes a variety of metaheuristic solution algorithms. Specifically, Harmony Search demonstrated minimal sensitivity to hyperparameters, enabling an efficient search for solutions without being influenced by the choice of hyperparameters. The effectiveness of the framework was evaluated using two real-world datasets and one synthetic dataset. Comparative analyses were performed using the two real-world datasets and the corresponding models published in literature by independent teams. The proposed framework showed its capability to pinpoint efficient model specifications, produce accurate estimates, and provide valuable insights for both researchers and practitioners. The proposed approach allows for the discovery of numerous insights while minimizing the time spent on model development. By considering a broader set of contributing factors, models with varied qualities can be generated. For instance, when applied to crash data from Queensland, the proposed approach revealed that the inclusion of medians on sharp curved roads can effectively reduce the occurrence of crashes, when applied to crash data from Washington, the simultaneous consideration of traffic volume and road curvature resulted in a notable reduction in crash variances but an increase in crash means.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    分位数回归已成为分析竞争风险数据的广泛使用的工具。然而,具有连续标记的竞争风险数据的分位数回归仍然很少。标记变量是经典竞争风险模型中失败原因的扩展,其中失败原因被仅在未经审查的失败时间观察到的连续标记代替。连续标记变量的实例是测量感染病毒与疫苗构建体中包含的病毒之间的不相似性的遗传距离。在这篇文章中,我们提出了一种新的标记特定分位数回归模型。所提出的估计方法从标记附近的数据中借用强度,并基于诱导的平滑估计方程,这与现有的具有离散原因的竞争风险数据的方法有很大不同。所得估计量的渐近性质是在标记和分位数连续体中建立的。此外,提出了一种标记特异性分位数型疫苗效力,并开发了其统计推断程序。进行了模拟研究,以评估所提出的估计和假设检验程序的有限样本性能。提供了第一个HIV疫苗效力试验的应用。
    Quantile regression has become a widely used tool for analysing competing risk data. However, quantile regression for competing risk data with a continuous mark is still scarce. The mark variable is an extension of cause of failure in a classical competing risk model where cause of failure is replaced by a continuous mark only observed at uncensored failure times. An example of the continuous mark variable is the genetic distance that measures dissimilarity between the infecting virus and the virus contained in the vaccine construct. In this article, we propose a novel mark-specific quantile regression model. The proposed estimation method borrows strength from data in a neighbourhood of a mark and is based on an induced smoothed estimation equation, which is very different from the existing methods for competing risk data with discrete causes. The asymptotic properties of the resulting estimators are established across mark and quantile continuums. In addition, a mark-specific quantile-type vaccine efficacy is proposed and its statistical inference procedures are developed. Simulation studies are conducted to evaluate the finite sample performances of the proposed estimation and hypothesis testing procedures. An application to the first HIV vaccine efficacy trial is provided.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    功能网络通常指导我们对脑表型关联的空间图的解释。然而,评估感兴趣的网络中关联的丰富程度的方法在科学严谨和基本假设方面都有所不同。虽然有些方法依赖于主观解释,其他人对成像数据的空间属性做出了不切实际的假设,导致假阳性率膨胀。我们寻求通过借鉴遗传学研究中广泛使用的方法来测试一组基因与感兴趣的表型之间关联的富集,从而解决现有方法中的这一差距。我们提出了网络富集显著性测试(NEST),一个灵活的框架,用于测试脑表型关联与功能网络或大脑其他子区域的特异性。我们应用NEST来研究来自大规模神经发育队列研究的结构和功能脑成像数据的关联。
    Functional networks often guide our interpretation of spatial maps of brain-phenotype associations. However, methods for assessing enrichment of associations within networks of interest have varied in terms of both scientific rigor and underlying assumptions. While some approaches have relied on subjective interpretations, others have made unrealistic assumptions about spatial properties of imaging data, leading to inflated false positive rates. We seek to address this gap in existing methodology by borrowing insight from a method widely used in genetics research for testing enrichment of associations between a set of genes and a phenotype of interest. We propose network enrichment significance testing (NEST), a flexible framework for testing the specificity of brain-phenotype associations to functional networks or other sub-regions of the brain. We apply NEST to study enrichment of associations with structural and functional brain imaging data from a large-scale neurodevelopmental cohort study.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号