Hyperparameter

超参数
  • 文章类型: Journal Article
    脑肿瘤是全球死亡的主要原因,恶性肿瘤有多种类型,只有12%的被诊断患有脑癌的成年人存活超过五年。这项研究引入了一种超参数卷积神经网络(CNN)模型来识别脑肿瘤,具有重大的实际意义。通过微调CNN模型的超参数,优化特征提取,系统地降低模型复杂度,从而提高脑肿瘤诊断的准确性。关键超参数包括批量大小,层计数,学习率,激活函数,汇集策略,填充,和过滤器尺寸。超参数调整的CNN模型在Kaggle提供的三个不同的脑MRI数据集上进行了训练,产生优异的性能分数,准确度的平均值为97%,精度,召回,和F1得分。我们的优化模型是有效的,正如我们与最先进的方法进行有条理的比较所证明的那样。我们的超参数修改增强了模型性能并增强了其泛化能力,给医生一个更准确和有效的工具来做出关于脑肿瘤诊断的关键判断。我们的模型是朝着值得信赖和准确的医疗诊断的正确方向迈出的重要一步,对改善患者预后具有实际意义。
    Brain tumors are a leading cause of death globally, with numerous types varying in malignancy, and only 12% of adults diagnosed with brain cancer survive beyond five years. This research introduces a hyperparametric convolutional neural network (CNN) model to identify brain tumors, with significant practical implications. By fine-tuning the hyperparameters of the CNN model, we optimize feature extraction and systematically reduce model complexity, thereby enhancing the accuracy of brain tumor diagnosis. The critical hyperparameters include batch size, layer counts, learning rate, activation functions, pooling strategies, padding, and filter size. The hyperparameter-tuned CNN model was trained on three different brain MRI datasets available at Kaggle, producing outstanding performance scores, with an average value of 97% for accuracy, precision, recall, and F1-score. Our optimized model is effective, as demonstrated by our methodical comparisons with state-of-the-art approaches. Our hyperparameter modifications enhanced the model performance and strengthened its capacity for generalization, giving medical practitioners a more accurate and effective tool for making crucial judgments regarding brain tumor diagnosis. Our model is a significant step in the right direction toward trustworthy and accurate medical diagnosis, with practical implications for improving patient outcomes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    肌肉减少性肥胖(SO)的特征是伴随的肌肉减少症和肥胖,并存在残疾的高风险,发病率,和老年人的死亡率。然而,缺乏基于序贯神经网络SO研究的预测以及体能因子与SO之间的关系。本研究旨在通过关注身体健康因素来开发老年人SO的预测模型。使用顺序神经网络分析了参加全民健身计划的韩国老年人的综合数据集。使用人体测量方程将阑尾骨骼肌/体重定义为SO。独立变量包括身体脂肪(BF,%),腰围,收缩压和舒张压,和各种体能因素。因变量是二元结果(可能的SO与正常)。我们分析了超参数调整和分层K倍验证,以优化预测模型。女性的SO患病率(13.81%)明显高于男性,突出性别差异。优化后的神经网络模型和Shapley加法解释分析表明,验证准确率高达93.1%,BF%和绝对握力成为SO最有影响力的预测因子。这项研究为老年人的SO提供了一个高度准确的预测模型,强调BF%和绝对握力的关键作用。我们确认了BF,绝对握力,作为关键的SO预测因子。我们的发现强调了SO的性别特异性以及身体健康因素在预测中的重要性。
    Sarcopenic obesity (SO) is characterized by concomitant sarcopenia and obesity and presents a high risk of disability, morbidity, and mortality among older adults. However, predictions based on sequential neural network SO studies and the relationship between physical fitness factors and SO are lacking. This study aimed to develop a predictive model for SO in older adults by focusing on physical fitness factors. A comprehensive dataset of older Korean adults participating in national fitness programs was analyzed using sequential neural networks. Appendicular skeletal muscle/body weight was defined as SO using an anthropometric equation. Independent variables included body fat (BF, %), waist circumference, systolic and diastolic blood pressure, and various physical fitness factors. The dependent variable was a binary outcome (possible SO vs normal). We analyzed hyperparameter tuning and stratified K-fold validation to optimize a predictive model. The prevalence of SO was significantly higher in women (13.81%) than in men, highlighting sex-specific differences. The optimized neural network model and Shapley Additive Explanations analysis demonstrated a high validation accuracy of 93.1%, with BF% and absolute grip strength emerging as the most influential predictors of SO. This study presents a highly accurate predictive model for SO in older adults, emphasizing the critical roles of BF% and absolute grip strength. We identified BF, absolute grip strength, and sit-and-reach as key SO predictors. Our findings underscore the sex-specific nature of SO and the importance of physical fitness factors in its prediction.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    深度强化学习(DRL)在不同的领域和应用中获得了广泛的采用,主要是由于其在具有高维状态和动作的空间中解决复杂的决策问题的能力。深度确定性策略梯度(DDPG)是一种众所周知的DRL算法,采用演员-批评方法,综合基于价值和基于策略的强化学习方法的优势。这项研究的目的是全面研究最新发展,模式,障碍,以及与DDPG相关的潜在机会。使用相关的学术数据库进行了系统的搜索(Scopus,WebofScience,和ScienceDirect)确定过去五年(2018-2023年)发表的85项相关研究。我们全面概述了DDPG的关键概念和组件,包括它的配方,实施,和训练。然后,我们重点介绍了DDPG的各种应用和领域,包括自动驾驶,无人机,资源分配,通信和物联网,机器人,和金融。此外,我们提供了DDPG与其他DRL算法和传统RL方法的深入比较,突出它的优点和缺点。我们相信,这次审查将是研究人员的重要资源,为他们提供有关DRL和DDPG领域使用的方法和技术的宝贵见解。
    Deep Reinforcement Learning (DRL) has gained significant adoption in diverse fields and applications, mainly due to its proficiency in resolving complicated decision-making problems in spaces with high-dimensional states and actions. Deep Deterministic Policy Gradient (DDPG) is a well-known DRL algorithm that adopts an actor-critic approach, synthesizing the advantages of value-based and policy-based reinforcement learning methods. The aim of this study is to provide a thorough examination of the latest developments, patterns, obstacles, and potential opportunities related to DDPG. A systematic search was conducted using relevant academic databases (Scopus, Web of Science, and ScienceDirect) to identify 85 relevant studies published in the last five years (2018-2023). We provide a comprehensive overview of the key concepts and components of DDPG, including its formulation, implementation, and training. Then, we highlight the various applications and domains of DDPG, including Autonomous Driving, Unmanned Aerial Vehicles, Resource Allocation, Communications and the Internet of Things, Robotics, and Finance. Additionally, we provide an in-depth comparison of DDPG with other DRL algorithms and traditional RL methods, highlighting its strengths and weaknesses. We believe that this review will be an essential resource for researchers, offering them valuable insights into the methods and techniques utilized in the field of DRL and DDPG.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    单细胞分辨系统生物学方法,包括基于物质和成像的测量模式,生成大量表征细胞群体异质性的高维数据。表示学习方法通常用于分析这些复杂的,通过将高维数据投影到低维嵌入中。这有助于结构的解释和询问,动力学,和细胞异质性的调节。反映了它们在分析不同单细胞数据类型中的核心作用,存在无数的表征学习方法,新方法不断涌现。这里,我们对比了跨越统计的表示学习方法的一般特征,流形学习,和神经网络方法。我们考虑使用单细胞数据表示学习中涉及的关键步骤,包括数据预处理,超参数优化,下游分析,和生物验证。还强调了将这些步骤联系起来的相互依存关系和突发事件。此概述旨在指导研究人员进行选择,应用程序,以及当前和未来单细胞研究应用的表征学习策略的优化。
    Single-cell-resolved systems biology methods, including omics- and imaging-based measurement modalities, generate a wealth of high-dimensional data characterizing the heterogeneity of cell populations. Representation learning methods are routinely used to analyze these complex, high-dimensional data by projecting them into lower-dimensional embeddings. This facilitates the interpretation and interrogation of the structures, dynamics, and regulation of cell heterogeneity. Reflecting their central role in analyzing diverse single-cell data types, a myriad of representation learning methods exist, with new approaches continually emerging. Here, we contrast general features of representation learning methods spanning statistical, manifold learning, and neural network approaches. We consider key steps involved in representation learning with single-cell data, including data pre-processing, hyperparameter optimization, downstream analysis, and biological validation. Interdependencies and contingencies linking these steps are also highlighted. This overview is intended to guide researchers in the selection, application, and optimization of representation learning strategies for current and future single-cell research applications.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    统计欺诈检测包括建立一个系统,自动选择所有案件的子集(保险索赔,金融交易,等。),这是最有趣的进一步调查。需要这种系统的原因是案件总数通常比实际可以手动调查的案件总数高得多,并且欺诈往往非常罕见。Further,研究者通常仅限于控制有限数量的k个病例,由于资源有限。分配这些资源的最有效方式是尝试选择具有最高欺诈概率的k个案例。通常必须对用于此目的的预测模型进行正则化,以避免过度拟合并因此避免不良的预测性能。损失函数,表示欺诈损失,提出了通过调整参数选择模型复杂度的方法。进行模拟研究以找到用于验证的最佳设置。Further,将拟议程序的性能与最相关的竞争程序进行比较,根据接受者工作特征曲线下面积(AUC),在一组模拟中,以及信用卡默认数据集。与根据AUC选择模型相比,通过欺诈损失选择模型的复杂性在欺诈损失方面产生了可比或更好的结果。
    Statistical fraud detection consists in making a system that automatically selects a subset of all cases (insurance claims, financial transactions, etc.) that are the most interesting for further investigation. The reason why such a system is needed is that the total number of cases typically is much higher than one realistically could investigate manually and that fraud tends to be quite rare. Further, the investigator is typically limited to controlling a restricted number k of cases, due to limited resources. The most efficient manner of allocating these resources is then to try selecting the k cases with the highest probability of being fraudulent. The prediction model used for this purpose must normally be regularised to avoid overfitting and consequently bad prediction performance. A loss function, denoted the fraud loss, is proposed for selecting the model complexity via a tuning parameter. A simulation study is performed to find the optimal settings for validation. Further, the performance of the proposed procedure is compared to the most relevant competing procedure, based on the area under the receiver operating characteristic curve (AUC), in a set of simulations, as well as on a credit card default dataset. Choosing the complexity of the model by the fraud loss resulted in either comparable or better results in terms of the fraud loss than choosing it according to the AUC.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    利用深度学习网络进行语义分割已成为从高分辨率遥感图像中提取目标的重要方法。与传统的卷积神经网络(CNN)相比,VisionTransformer网络在语义分割方面的性能显着提高。视觉转换器网络具有与CNN不同的架构。映像修补程序,线性嵌入,多头自我注意力(MHSA)是几个主要的超参数。我们应该如何配置它们以提取VHR图像中的对象以及它们如何影响网络的准确性是尚未得到充分研究的主题。本文探讨了视觉变压器网络在从极高分辨率(VHR)图像中提取建筑物足迹中的作用。设计并比较了具有不同超参数值的变压器模型,并分析了它们对准确性的影响。结果表明,较小的图像块和较高维嵌入可以获得更好的精度。此外,基于Transformer的网络被证明是可扩展的,并且可以使用具有与卷积神经网络相当的模型大小和训练时间的通用规模图形处理单元(GPU)进行训练,同时实现更高的精度。该研究为视觉变压器网络在使用VHR图像进行对象提取中的潜力提供了有价值的见解。
    Semantic segmentation with deep learning networks has become an important approach to the extraction of objects from very high-resolution remote sensing images. Vision Transformer networks have shown significant improvements in performance compared to traditional convolutional neural networks (CNNs) in semantic segmentation. Vision Transformer networks have different architectures to CNNs. Image patches, linear embedding, and multi-head self-attention (MHSA) are several of the main hyperparameters. How we should configure them for the extraction of objects in VHR images and how they affect the accuracy of networks are topics that have not been sufficiently investigated. This article explores the role of vision Transformer networks in the extraction of building footprints from very-high-resolution (VHR) images. Transformer-based models with different hyperparameter values were designed and compared, and their impact on accuracy was analyzed. The results show that smaller image patches and higher-dimension embeddings result in better accuracy. In addition, the Transformer-based network is shown to be scalable and can be trained with general-scale graphics processing units (GPUs) with comparable model sizes and training times to convolutional neural networks while achieving higher accuracy. The study provides valuable insights into the potential of vision Transformer networks in object extraction using VHR images.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    COVID-19,不断发展和提出越来越重要的问题,影响了人类健康,造成了无数人死亡。它是一种发病率和死亡率都很高的传染病。这种疾病的传播也是对人类健康的重大威胁,尤其是在发展中国家。这项研究提出了一种称为基于混洗牧羊人优化的广义深度卷积模糊网络(SSO-GDCFN)的方法来诊断COVID-19疾病状态,类型,和恢复的类别。结果表明,该方法的准确率高达99.99%;同样,精确度为99.98%;灵敏度/召回率为100%;特异性为95%;κ为0.965%;AUC为0.88%;MSE小于0.07%以及25秒。通过将所提出的方法的仿真结果与几种传统技术的仿真结果进行比较,证实了所提出方法的性能。实验结果表明,与传统方法相比,COVID-19阶段的分类具有很强的性能和很高的准确性,并且重分类最少。
    COVID-19, continually developing and raising increasingly significant issues, has impacted human health and caused countless deaths. It is an infectious disease with a high incidence and mortality rate. The spread of the disease is also a significant threat to human health, especially in the developing world. This study suggests a method called shuffle shepherd optimization-based generalized deep convolutional fuzzy network (SSO-GDCFN) to diagnose the COVID-19 disease state, types, and recovered categories. The results show that the accuracy of the proposed method is as high as 99.99%; similarly, precision is 99.98%; sensitivity/recall is 100%; specificity is 95%; kappa is 0.965%; AUC is 0.88%; and MSE is less than 0.07% as well as 25 s. Moreover, the performance of the suggested method has been confirmed by comparison of the simulation results from the proposed approach with those from several traditional techniques. The experimental findings demonstrate strong performance and high accuracy for categorizing COVID-19 stages with minimal reclassifications over the conventional methods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:基于卷积神经网络(CNNSCA)的侧通道密码分析方法可以有效地进行密码攻击。实现密码分析的CNNSCA网络模型主要包括基于VGG变体的CNNSCA(VGG-CNNSCA)和基于Alexnet变体的CNNSCA(Alex-CNNSCA)。这些CNNSCA模型的学习能力和密码分析性能并不理想,并且训练后的模型精度较低,训练时间太长,并占用更多的计算资源。为了提高CNNSCA的整体性能,本文将改进CNNSCA模型设计和超参数优化。
    方法:本文首先研究了SCA应用场景中的CNN架构组成,并推导了CNN核心算法对一维数据侧信道泄漏的计算过程。其次,综合运用VGG-CNNSCA模型分类、拟合效率和Alex-CNNSCA模型占用计算资源少的优势,设计了一种新的CNNSCA基础模型,为了更好地减少深度网络中误差反向传播的梯度色散问题,SE(挤压和激励)模块新嵌入到这个基本模型中,该模块首次在CNNSCA模型中使用,形成了CNNSCA模型设计的新思路。然后将此基本模型应用于来自边信道泄漏公共数据库(ASCAD)的已知一阶屏蔽数据集。在此应用场景中,根据模型设计规则和实际实验结果,排除非必要的实验参数。在最客观的实验参数区间内优化基本模型的各种超参数,提高其密码分析性能,这导致了超参数优化方案和确定超参数的最终基准。
    结果:最后,获得了一种新的CNNSCA模型优化架构,用于攻击未受保护的加密设备-CNNSCAnew。通过对比实验,CNNSCAnew的猜测熵评价结果收敛到61。从模型训练到成功恢复的关键,总时间缩短到30分钟左右,我们获得了比其他CNNSCA模型更好的性能。
    BACKGROUND: The side-channel cryptanalysis method based on convolutional neural network (CNNSCA) can effectively carry out cryptographic attacks. The CNNSCA network models that achieve cryptanalysis mainly include CNNSCA based on the VGG variant (VGG-CNNSCA) and CNNSCA based on the Alexnet variant (Alex-CNNSCA). The learning ability and cryptanalysis performance of these CNNSCA models are not optimal, and the trained model has low accuracy, too long training time, and takes up more computing resources. In order to improve the overall performance of CNNSCA, the paper will improve CNNSCA model design and hyperparameter optimization.
    METHODS: The paper first studied the CNN architecture composition in the SCA application scenario, and derives the calculation process of the CNN core algorithm for side-channel leakage of one-dimensional data. Secondly, a new basic model of CNNSCA was designed by comprehensively using the advantages of VGG-CNNSCA model classification and fitting efficiency and Alex-CNNSCA model occupying less computing resources, in order to better reduce the gradient dispersion problem of error back propagation in deep networks, the SE (Squeeze-and-Excitation) module is newly embedded in this basic model, this module is used for the first time in the CNNSCA model, which forms a new idea for the design of the CNNSCA model. Then apply this basic model to a known first-order masked dataset from the side-channel leak public database (ASCAD). In this application scenario, according to the model design rules and actual experimental results, exclude non-essential experimental parameters. Optimize the various hyperparameters of the basic model in the most objective experimental parameter interval to improve its cryptanalysis performance, which results in a hyper-parameter optimization scheme and a final benchmark for the determination of hyper-parameters.
    RESULTS: Finally, a new CNNSCA model optimized architecture for attacking unprotected encryption devices is obtained-CNNSCAnew. Through comparative experiments, CNNSCAnew\'s guessing entropy evaluation results converged to 61. From model training to successful recovery of the key, the total time spent was shortened to about 30 min, and we obtained better performance than other CNNSCA models.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    BACKGROUND: To diagnose key pathologies of age-related macular degeneration (AMD) and diabetic macular edema (DME) quickly and accurately, researchers attempted to develop effective artificial intelligence methods by using medical images.
    RESULTS: A convolutional neural network (CNN) with transfer learning capability is proposed and appropriate hyperparameters are selected for classifying optical coherence tomography (OCT) images of AMD and DME. To perform transfer learning, a pre-trained CNN model is used as the starting point for a new CNN model for solving related problems. The hyperparameters (parameters that have set values before the learning process begins) in this study were algorithm hyperparameters that affect learning speed and quality. During training, different CNN-based models require different algorithm hyperparameters (e.g., optimizer, learning rate, and mini-batch size). Experiments showed that, after transfer learning, the CNN models (8-layer Alexnet, 22-layer Googlenet, 16-layer VGG, 19-layer VGG, 18-layer Resnet, 50-layer Resnet, and a 101-layer Resnet) successfully classified OCT images of AMD and DME.
    CONCLUSIONS: The experimental results further showed that, after transfer learning, the VGG19, Resnet101, and Resnet50 models with appropriate algorithm hyperparameters had excellent capability and performance in classifying OCT images of AMD and DME.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    聚碳酸酯-碳纳米管(PC-CNT)导电复合材料的CNT浓度覆盖0.25-4.5wt。%通过熔融共混挤出制备。已研究了复合材料的交流(AC)电导率。PC-CNT复合材料的渗透阈值是使用经典的渗透理论理论确定的,然后进行数值分析。定量PC-CNT在临界体积CNT浓度下的电导率。不同的理论模型,如Bueche,McCullough和Mamunya已应用于使用超参数优化方法预测复合材料的交流电导率。通过多个系列的超参数优化过程,发现McCullough和Mamunya电导率的理论模型与我们的实验结果非常吻合;根据这些模型,链支化度和纵横比估计为0.91和167。基于改进的Sohi模型的新模型的开发与我们的数据非常吻合,优化设计模型的确定系数R2=0.922。电导率与电磁吸收(EM)指数相关,显示出与Steffen-Boltzmann(SB)模型的良好拟合,表明在研究的频率范围内微波吸收的最终CNT体积浓度。
    Polycarbonate-carbon nanotube (PC-CNT) conductive composites containing CNT concentration covering 0.25-4.5 wt.% were prepared by melt blending extrusion. The alternating current (AC) conductivity of the composites has been investigated. The percolation threshold of the PC-CNT composites was theoretically determined using the classical theory of percolation followed by numerical analysis, quantifying the conductivity of PC-CNT at the critical volume CNT concentration. Different theoretical models like Bueche, McCullough and Mamunya have been applied to predict the AC conductivity of the composites using a hyperparameter optimization method. Through multiple series of the hyperparameter optimization process, it was found that McCullough and Mamunya theoretical models for electrical conductivity fit remarkably with our experimental results; the degree of chain branching and the aspect ratio are estimated to be 0.91 and 167 according to these models. The development of a new model based on a modified Sohi model is in good agreement with our data, with a coefficient of determination R2=0.922 for an optimized design model. The conductivity is correlated to the electromagnetic absorption (EM) index showing a fine fit with Steffen-Boltzmann (SB) model, indicating the ultimate CNTs volume concentration for microwave absorption at the studied frequency range.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号