function approximation

  • 文章类型: Journal Article
    背景:样本信息的期望值(EVSI)衡量了通过收集其他数据可以获得的预期收益。使用传统的嵌套蒙特卡罗方法估计EVSI在计算上是昂贵的,但是最近开发的高斯近似(GA)方法可以有效地估计不同样本量的EVSI。然而,如果决策模型是高度非线性的,传统的遗传算法可能会导致有偏差的EVSI估计。当使用GA优化不同研究的价值时,这种偏差可能导致次优的研究设计。因此,我们扩展了传统的遗传算法,以提高其在非线性决策模型中的性能。
    方法:我们的方法通过基于2个步骤近似收益的条件期望来提供准确的EVSI估计。首先,应用泰勒级数逼近来估计收益的条件期望,作为使用样条的感兴趣参数的条件矩的函数,对样本的参数和相应的收益进行拟合。接下来,参数的条件矩通过常规GA和Fisher信息近似。所提出的方法适用于涉及非高斯参数和非线性决策模型的多个数据收集练习。将其性能与嵌套蒙特卡罗方法进行了比较,传统的GA方法,以及基于非参数回归的EVSI计算方法。
    结果:当感兴趣的参数为非高斯且决策模型为非线性时,所提出的方法可跨不同样本量提供准确的EVSI估计。所提出的方法的计算成本与其他新颖方法的计算成本相似。
    结论:所提出的方法可以准确有效地估计跨样本量的EVSI,这可以支持研究人员使用EVSI确定经济上最佳的研究设计。
    结论:高斯近似方法有效地估计了不同样本量的临床试验的样本信息(EVSI)的期望值,但是当健康经济模型具有非线性结构时,它可能会引入偏差。我们引入了基于样条的泰勒级数逼近方法,并将其与原始高斯逼近相结合,以校正EVSI估计中的非线性引起的偏差。我们的方法可以在不牺牲计算效率的情况下为复杂的决策模型提供更精确的EVSI估计,这可以从成本效益的角度增强资源分配策略。
    BACKGROUND: The expected value of sample information (EVSI) measures the expected benefits that could be obtained by collecting additional data. Estimating EVSI using the traditional nested Monte Carlo method is computationally expensive, but the recently developed Gaussian approximation (GA) approach can efficiently estimate EVSI across different sample sizes. However, the conventional GA may result in biased EVSI estimates if the decision models are highly nonlinear. This bias may lead to suboptimal study designs when GA is used to optimize the value of different studies. Therefore, we extend the conventional GA approach to improve its performance for nonlinear decision models.
    METHODS: Our method provides accurate EVSI estimates by approximating the conditional expectation of the benefit based on 2 steps. First, a Taylor series approximation is applied to estimate the conditional expectation of the benefit as a function of the conditional moments of the parameters of interest using a spline, which is fitted to the samples of the parameters and the corresponding benefits. Next, the conditional moments of parameters are approximated by the conventional GA and Fisher information. The proposed approach is applied to several data collection exercises involving non-Gaussian parameters and nonlinear decision models. Its performance is compared with the nested Monte Carlo method, the conventional GA approach, and the nonparametric regression-based method for EVSI calculation.
    RESULTS: The proposed approach provides accurate EVSI estimates across different sample sizes when the parameters of interest are non-Gaussian and the decision models are nonlinear. The computational cost of the proposed method is similar to that of other novel methods.
    CONCLUSIONS: The proposed approach can estimate EVSI across sample sizes accurately and efficiently, which may support researchers in determining an economically optimal study design using EVSI.
    CONCLUSIONS: The Gaussian approximation method efficiently estimates the expected value of sample information (EVSI) for clinical trials with varying sample sizes, but it may introduce bias when health economic models have a nonlinear structure.We introduce the spline-based Taylor series approximation method and combine it with the original Gaussian approximation to correct the nonlinearity-induced bias in EVSI estimation.Our approach can provide more precise EVSI estimates for complex decision models without sacrificing computational efficiency, which can enhance the resource allocation strategies from the cost-effective perspective.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在本文中,提出了利用约束最小二乘将非线性函数和/或离散数据简化为分段多项式的最优逼近算法。在对时间敏感的应用程序或资源有限的嵌入式系统中,近似函数的运行时间与其准确性一样重要。所提出的算法在确保误差低于指定阈值的同时,以最小的计算成本搜索最佳分段多项式(OPP)。这是通过使用具有最佳顺序和间隔数的平滑分段多项式来实现的。计算成本仅取决于多项式复杂度,即,运行时函数调用的顺序和间隔数。在以往的研究中,用户必须决定一个或所有的订单和间隔的数量。相比之下,OPP近似算法决定了两者。对于最佳逼近,对于特定的目标CPU离线,按升序计算了所有可能的分段多项式组合的计算成本,并将其制成表格。对于给定的样本点,通过约束最小二乘法和随机选择方法对每个组合进行优化。之后,检查近似误差是否低于预定值。如果错误是允许的,该组合被选择为最优近似,或者检查了下一个组合。要验证性能,对几个具有代表性的函数进行了检查和分析。
    In this paper, the optimal approximation algorithm is proposed to simplify non-linear functions and/or discrete data as piecewise polynomials by using the constrained least squares. In time-sensitive applications or in embedded systems with limited resources, the runtime of the approximate function is as crucial as its accuracy. The proposed algorithm searches for the optimal piecewise polynomial (OPP) with the minimum computational cost while ensuring that the error is below a specified threshold. This was accomplished by using smooth piecewise polynomials with optimal order and numbers of intervals. The computational cost only depended on polynomial complexity, i.e., the order and the number of intervals at runtime function call. In previous studies, the user had to decide one or all of the orders and the number of intervals. In contrast, the OPP approximation algorithm determines both of them. For the optimal approximation, computational costs for all the possible combinations of piecewise polynomials were calculated and tabulated in ascending order for the specific target CPU off-line. Each combination was optimized through constrained least squares and the random selection method for the given sample points. Afterward, whether the approximation error was below the predetermined value was examined. When the error was permissible, the combination was selected as the optimal approximation, or the next combination was examined. To verify the performance, several representative functions were examined and analyzed.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在具有局部调整的神经元模型的神经网络中学习,例如径向基函数(RBF)网络通常被认为是不稳定的,特别是当使用多层体系结构时。此外,单层RBF网络的普遍逼近定理是非常成熟的;因此,更深层次的架构在理论上是不需要的。因此,RBF主要以单层方式使用。然而,深度神经网络已经证明了它们在许多不同任务上的有效性。在本文中,我们表明,具有多个径向基函数层的更深的RBF架构可以与有效的学习方案一起设计。我们介绍了一种基于k均值聚类和协方差估计的深度RBF网络初始化方案。我们进一步展示了如何利用卷积以部分连接的方式加快马氏距离的计算,这类似于卷积神经网络(CNN)。最后,我们评估了我们在图像分类和语音情感识别任务上的方法。我们的结果表明,深度RBF网络表现非常好,具有与其他深度神经网络类型相当的结果,比如CNN。
    Learning in neural networks with locally-tuned neuron models such as radial Basis Function (RBF) networks is often seen as instable, in particular when multi-layered architectures are used. Furthermore, universal approximation theorems for single-layered RBF networks are very well established; therefore, deeper architectures are theoretically not required. Consequently, RBFs are mostly used in a single-layered manner. However, deep neural networks have proven their effectiveness on many different tasks. In this paper, we show that deeper RBF architectures with multiple radial basis function layers can be designed together with efficient learning schemes. We introduce an initialization scheme for deep RBF networks based on k-means clustering and covariance estimation. We further show how to make use of convolutions to speed up the calculation of the Mahalanobis distance in a partially connected way, which is similar to the convolutional neural networks (CNNs). Finally, we evaluate our approach on image classification as well as speech emotion recognition tasks. Our results show that deep RBF networks perform very well, with comparable results to other deep neural network types, such as CNNs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    使用基于模拟的敏感性分析是评估和比较未来临床试验的候选设计的基础。在这种情况下,灵敏度分析对于评估重要的设计操作特性对各种未知参数的依赖性特别有用。操作特征的典型示例包括检测治疗效果的可能性和平均研究持续时间,这取决于在临床研究开始之前未知的参数,例如主要结局和患者概况的分布。敏感性分析的两个关键组成部分是(i)选择一组合理的模拟方案和(ii)感兴趣的操作特征列表。我们提出了一种新的方法来选择要包括在敏感性分析中的一组方案。我们最大化了一个效用标准,该标准形式化了一组特定的敏感性方案是否足以总结试验设计的运行特性在未知参数的合理值之间的变化。然后,我们使用优化技术选择一组最佳的模拟方案(根据研究者指定的标准)来举例说明试验设计的操作特征.我们在三个试验设计中说明了我们的建议。
    The use of simulation-based sensitivity analyses is fundamental for evaluating and comparing candidate designs of future clinical trials. In this context, sensitivity analyses are especially useful to assess the dependence of important design operating characteristics with respect to various unknown parameters. Typical examples of operating characteristics include the likelihood of detecting treatment effects and the average study duration, which depend on parameters that are unknown until after the onset of the clinical study, such as the distributions of the primary outcomes and patient profiles. Two crucial components of sensitivity analyses are (i) the choice of a set of plausible simulation scenarios and (ii) the list of operating characteristics of interest. We propose a new approach for choosing the set of scenarios to be included in a sensitivity analysis. We maximize a utility criterion that formalizes whether a specific set of sensitivity scenarios is adequate to summarize how the operating characteristics of the trial design vary across plausible values of the unknown parameters. Then, we use optimization techniques to select the best set of simulation scenarios (according to the criteria specified by the investigator) to exemplify the operating characteristics of the trial design. We illustrate our proposal in three trial designs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Multiagent coordination is highly desirable with many uses in a variety of tasks. In nature, the phenomenon of coordinated flocking is highly common with applications related to defending or escaping from predators. In this article, a hybrid multiagent system that integrates consensus, cooperative learning, and flocking control to determine the direction of attacking predators and learns to flock away from them in a coordinated manner is proposed. This system is entirely distributed requiring only communication between neighboring agents. The fusion of consensus and collaborative reinforcement learning allows agents to cooperatively learn in a variety of multiagent coordination tasks, but this article focuses on flocking away from attacking predators. The results of the flocking show that the agents are able to effectively flock to a target without collision with each other or obstacles. Multiple reinforcement learning methods are evaluated for the task with cooperative learning utilizing function approximation for state-space reduction performing the best. The results of the proposed consensus algorithm show that it provides quick and accurate transmission of information between agents in the flock. Simulations are conducted to show and validate the proposed hybrid system in both one and two predator environments, resulting in an efficient cooperative learning behavior. In the future, the system of using consensus to determine the state and reinforcement learning to learn the states can be applied to additional multiagent tasks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    The mathematical foundation of deep learning is the theorem that any continuous function can be approximated within any specified accuracy by using a neural network with certain non-linear activation functions. However, this theorem does not tell us what the network architecture should be and what the values of the weights are. One must train the network to estimate the weights. There is no guarantee that the optimal weights can be reached after training. This paper develops an explicit architecture of a universal deep network by using the Gray code order and develops an explicit formula for the weights of this deep network. This architecture is target function independent. Once the target function is known, the weights are calculated by the proposed formula, and no training is required. There is no concern whether the training may or may not reach the optimal weights. This deep network gives the same result as the shallow piecewise linear interpolation function for an arbitrary target function.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    We derive bounds on the error, in high-order Sobolev norms, incurred in the approximation of Sobolev-regular as well as analytic functions by neural networks with the hyperbolic tangent activation function. These bounds provide explicit estimates on the approximation error with respect to the size of the neural networks. We show that tanh neural networks with only two hidden layers suffice to approximate functions at comparable or better rates than much deeper ReLU neural networks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    忆阻器作为神经形态计算元件引起了人们的兴趣,因为它们在实现人工神经元和突触的有效硬件实现方面显示出希望。我们对接口型忆阻器进行了测量,以验证它们在神经形态硬件中的使用。具体来说,我们利用Nb掺杂的SrTiO3忆阻器作为模拟神经网络的突触,将它们排列成差分突触对,连接的权重由两个配对忆阻器之间的归一化电导值的差异给出。该网络通过基于新颖的监督学习算法的训练过程学习来表示函数,在此期间,离散电压脉冲被施加到每对中的两个忆阻器之一。为了模拟物理忆阻器件的初始状态和每个电压脉冲的影响都是未知的事实,我们将噪声注入到模拟中。然而,基于局部知识的离散更新被证明可以产生鲁棒的学习性能。使用这类忆阻器件作为尖峰神经网络中的突触权重元素,根据我们的知识,这类最早的模型之一,能够学习成为通用函数逼近器,并强烈建议这些忆阻器适用于未来的计算平台。
    Memristors have attracted interest as neuromorphic computation elements because they show promise in enabling efficient hardware implementations of artificial neurons and synapses. We performed measurements on interface-type memristors to validate their use in neuromorphic hardware. Specifically, we utilized Nb-doped SrTiO3 memristors as synapses in a simulated neural network by arranging them into differential synaptic pairs, with the weight of the connection given by the difference in normalized conductance values between the two paired memristors. This network learned to represent functions through a training process based on a novel supervised learning algorithm, during which discrete voltage pulses were applied to one of the two memristors in each pair. To simulate the fact that both the initial state of the physical memristive devices and the impact of each voltage pulse are unknown we injected noise into the simulation. Nevertheless, discrete updates based on local knowledge were shown to result in robust learning performance. Using this class of memristive devices as the synaptic weight element in a spiking neural network yields, to our knowledge, one of the first models of this kind, capable of learning to be a universal function approximator, and strongly suggests the suitability of these memristors for usage in future computing platforms.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    There is a longstanding debate whether the Kolmogorov-Arnold representation theorem can explain the use of more than one hidden layer in neural networks. The Kolmogorov-Arnold representation decomposes a multivariate function into an interior and an outer function and therefore has indeed a similar structure as a neural network with two hidden layers. But there are distinctive differences. One of the main obstacles is that the outer function depends on the represented function and can be wildly varying even if the represented function is smooth. We derive modifications of the Kolmogorov-Arnold representation that transfer smoothness properties of the represented function to the outer function and can be well approximated by ReLU networks. It appears that instead of two hidden layers, a more natural interpretation of the Kolmogorov-Arnold representation is that of a deep neural network where most of the layers are required to approximate the interior function.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    人们对深度神经网络的表达能力越来越感兴趣。然而,有关此主题的大多数现有工作仅关注特定的激活函数,例如ReLU或sigmoid。在本文中,我们研究了具有广泛激活函数的深度神经网络的逼近能力。这类激活函数包括大多数经常使用的激活函数。我们得出所需的深度,深度神经网络的宽度和稀疏性,以逼近任何Hölder平滑函数,直到大型激活函数的给定逼近误差。根据我们的近似误差分析,在回归和分类问题中,我们用一般激活函数推导了深度神经网络估计器的极小极大最优性。
    There has been a growing interest in expressivity of deep neural networks. However, most of the existing work about this topic focuses only on the specific activation function such as ReLU or sigmoid. In this paper, we investigate the approximation ability of deep neural networks with a broad class of activation functions. This class of activation functions includes most of frequently used activation functions. We derive the required depth, width and sparsity of a deep neural network to approximate any Hölder smooth function upto a given approximation error for the large class of activation functions. Based on our approximation error analysis, we derive the minimax optimality of the deep neural network estimators with the general activation functions in both regression and classification problems.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号