Genetic algorithm

遗传算法
  • 文章类型: Journal Article
    我们提出了一种优化撇油器分配的遗传算法,为受约束的分配引入量身定制的修复操作。方法基本上涉及基于模拟的评估,以确保遵守韩国的规定。结果表明,优化后的分配,与目前的相比,平均减少了工作时间,并导致撇渣器总容量大幅减少。此外,我们提出了一种基于深度神经网络的代理模型,与基于仿真的优化相比,大大提高了效率。解决动员储存撇油器的地点效率低下的问题,进一步优化旨在最大程度地减少动员位置,并通过基于场景的模拟进行了验证,类似于实际情况。根据韩国的重大漏油事件,这一策略大大减少了工作时间和所需的地点。这些发现证明了所提出的遗传算法和动员位置最小化策略在增强溢油响应操作中的有效性。
    We propose a genetic algorithm for optimizing oil skimmer assignments, introducing a tailored repair operation for constrained assignments. Methods essentially involve simulation-based evaluation to ensure adherence to South Korea\'s regulations. Results show that the optimized assignments, compared to current ones, reduced work time on average and led to a significant reduction in total skimmer capacity. Additionally, we present a deep neural network-based surrogate model, greatly enhancing efficiency compared to simulation-based optimization. Addressing inefficiencies in mobilizing locations that store oil skimmers, further optimization aimed to minimize mobilized locations and was validated through scenario-based simulations resembling actual situations. Based on major oil spills in South Korea, this strategy significantly reduced work time and required locations. These findings demonstrate the effectiveness of the proposed genetic algorithm and mobilized location minimization strategy in enhancing oil spill response operations.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    不可再生能源,包括化石燃料,是一种能源,其消耗率远远超过其自然生产率。因此,如果不充分开发替代能源,不可再生资源将被耗尽,在不久的将来会导致能源危机。在本文中,已经提出了一个数学模型,用于设计田地残留物的生物质供应链,其中包括几个田地,在收集集线器中的残留物后,残留物被转移到集线器,将残余物转移到反应器中。在反应堆中,残留物转化为气体,它被转移到冷凝器和变压器,转换成电能,并通过网络发送到需求点。在本文中,考虑了稳定性和扰动的标准,在相关研究中讨论较少,所提出的模型的目的是最大化销售能源的利润,包括销售价格减去成本。遗传算法(GA)和模拟退火(SA)算法已用于求解模型。然后,为了证明问题的复杂性,在问题的不同维度上给出了不同和随机的例子。此外,通过比较GA和SA,证明了算法在小维度和大维度上的效率,因为解决方案的偏差很小,并且所使用的方法提供了适合所有决策者的可接受结果。此外,通过比较遗传算法和模拟退火算法,证明了算法在小维度和大维度上的有效性,遗传算法的值更好,考虑到2.9%的偏差。并提供了适合所有决策者的解决方案。
    Non-renewable energy sources, including fossil fuels, are a type of energy whose consumption rate far exceeds its natural production rate. Therefore, non-renewable resources will be exhausted if alternative energy is not fully developed, leading to an energy crisis in the near future. In this paper, a mathematical model has been proposed for the design of the biomass supply chain of field residues that includes several fields where residue is transferred to hubs after collecting the residue in the hub, the residue is transferred to reactors. In reactors, the residue is converted into gas, which is transferred to condenser and transformers, converted into electricity and sent to demand points through the network. In this paper, the criteria of stability and disturbance were considered, which have been less discussed in related research, and the purpose of the proposed model was to maximize the profit from the sale of energy, including the selling price minus the costs. Genetic algorithm (GA) and simulated annealing (SA) algorithm have been used to solve the model. Then, to prove the complexity of the problem, different and random examples have been presented in different dimensions of the problem. Also, the efficiency of the algorithm in small and large dimensions was proved by comparing GA and SA due to the low deviation of the solutions and the methods used have provided acceptable results suitable for all decision-makers. Also, the effectiveness of the algorithm in small and large dimensions is proven by comparing the genetic algorithm and simulated annealing, and the genetic algorithm\'s values are better, considering the deviation of 2.9%.and have provided solution methods suitable for all decision makers.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在2019年后冠状病毒病时代,专用室外空气系统(DOAS),为建筑物提供100%的室外空气,被广泛认可,因为它可以通过向占用空间提供新鲜的室外空气来确保可接受的室内空气质量。具有适当设计和操作的DOAS可以提供足够的通风和除湿,同时实现能源效率。尽管如此,在确定DOAS的最优控制序列方面,设计人员和操作员在实践中实施的指导有限。因此,在实践中,在DOAS的设计和控制阶段已经认识到许多问题,包括通风和除湿不足,担心过冷,增加送风干球温度,这可能会导致严重的不适和能源浪费。人们一直在努力开发高性能的DOAS控制,以提高能源效率。然而,这种控制通常很复杂,或者难以解释,供建筑设计师和运营商在实践中考虑。在这方面,本文探讨了一种基于仿真的框架,用于生成DOAS的送风温度控制序列,不仅可以确保提高节能潜力,而且可以保证控制逻辑的实施能力。美国能源部具有动态占用概况的原型小学是用整个建筑模拟程序建模的,EnergyPlus。该模型由DOAS组成,该DOAS带有用于通风的排气能量回收系统和用于空间冷却和加热的风机盘管单元。然后,采用遗传算法来找到真正的最佳送风温度控制序列,以最小化加热的能源成本,通风,和空调系统运行。最后,采用决策树从优化中提取规则,以得出DOAS送风温度的可实施操作顺序。总共12周的模拟,包括四周的加热,冷却,和肩膀的季节,分开,在天气条件下对纽约市进行了案例研究。本案例研究确定了基于优化信息规则提取的控制,与传统的基于室外空气温度的复位控制相比,可以节省约13%的能源成本和25%的能源消耗,冷却,和肩膀的季节。值得注意的是,节能主要是通过减少加热能耗来实现的。重要的是,它几乎对应于真正的最优控制结果,这减少了大约14%的能源成本和27%的能源消耗。从结果来看,可以强调的是,优化信息规则提取可以与最优控制一样有效,同时大大降低了控制的复杂性。
    In the era of post-Coronavirus Disease 2019, the dedicated outdoor air system (DOAS), which provides 100% outdoor air for the building, is widely acknowledged as it can ensure acceptable indoor air quality by delivering fresh outdoor air to occupied space. The DOAS with a proper design and operation can provide sufficient ventilation and dehumidification while achieving energy efficiency. Nonetheless, there is limited guidance in determining the optimal control sequence of the DOAS for the designers and operators to implement in practice. Accordingly, in practice, a number of issues have been acknowledged in the design and control phases of DOAS, including insufficient ventilation and dehumidification, and increasing supply air dry-bulb temperature in fear of over-cooling, which might cause significant discomfort and energy waste. There have been efforts to develop high-performing DOAS controls for better energy efficiency. However, such controls are often complex, or difficult to interpret, for building designers and operators to consider in practice. In this regard, this paper explores a simulation-based framework for generating a supply air temperature control sequence of the DOAS not only to ensure improved energy-saving potential but also to guarantee the implement-ability of the control logic. The U.S Department of Energy prototype primary school with dynamic occupancy profiles was modeled with a whole building simulation program, EnergyPlus. The model consists of a DOAS with an exhaust air energy recovery system for ventilation and fan-coil units for space cooling and heating. Then, a Genetic Algorithm was adopted to find the true optimal supply air temperature control sequence in terms of minimizing the energy cost of the heating, ventilation, and air conditioning system operation. Lastly, Decision Tree was adopted to extract rules out of the optimums to derive an implementable sequence of operation for the DOAS supply air temperature. A total of 12 week-simulation including four weeks of heating, cooling, and shoulder seasons, separately, under the weather condition of New York City was conducted for the case study. This case study identified that the optimization-informed rule extraction-based control, when compared to conventional outdoor air temperature-based reset control, could save about 13% of energy cost and 25% of energy consumption throughout the heating, cooling, and shoulder seasons. It is notable that the energy-saving was mainly achieved by reducing the heating energy consumption. Importantly, it nearly corresponds to the true optimal control result, which reduces approximately 14% of energy cost and 27% of energy consumption. From the results, it can be highlighted that the optimization-informed rule extraction can be as energy effective as the optimal control, while significantly reducing the complexity of the control.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    本研究使用支持向量回归(SVR)模型研究了多性状基因组预测的非线性内核。我们评估了在纯种肉鸡中测量的两种car体性状(CT1和CT2)的单性状(ST)和多性状(MT)模型提供的预测能力。MT模型还包括关于体内测量的指示性状(生长和饲料效率性状-FE)的信息。我们提出了一种称为(准)多任务SVR(QMTSVR)的方法,通过遗传算法(GA)进行超参数优化。ST和MT贝叶斯收缩和变量选择模型(基因组最佳线性无偏预测因子-GBLUP,贝叶斯C-BC,并采用再现核希尔伯特空间回归-RKHS)作为基准。使用两种验证设计(CV1和CV2)训练MT模型,如果次要特征的信息在测试集中可用,则不同。模型的预测能力用预测精度(ACC;即,预测值和观测值之间的相关性,除以表型准确性的平方根),标准化均方根误差(RMSE*),和通货膨胀因素(B)。为了解释CV2式预测中的潜在偏差,我们还计算了精度的参数估计(ACCPAR)。预测能力指标根据特征而有所不同,模型,和验证设计(CV1或CV2),ACC的范围从0.71到0.84,RMSE*为0.78至0.92,b在0.82和1.34之间。QMTSVR-CV2在两个性状中都达到了最高的ACC和最小的RMSE*。我们观察到,对于CT1,模型/验证设计选择对准确性指标(ACC或ACCPar)的选择很敏感。尽管如此,QMTSVR比MTGBLUP和MTBC更高的预测精度在精度指标上得到了复制,除了所提出的方法与MTRKHS模型的性能相似之外。结果表明,所提出的方法与使用高斯或尖峰实验室多变量先验的常规多特征贝叶斯回归模型具有竞争力。
    This study investigates nonlinear kernels for multitrait (MT) genomic prediction using support vector regression (SVR) models. We assessed the predictive ability delivered by single-trait (ST) and MT models for 2 carcass traits (CT1 and CT2) measured in purebred broiler chickens. The MT models also included information on indicator traits measured in vivo [Growth and feed efficiency trait (FE)]. We proposed an approach termed (quasi) multitask SVR (QMTSVR), with hyperparameter optimization performed via genetic algorithm. ST and MT Bayesian shrinkage and variable selection models [genomic best linear unbiased predictor (GBLUP), BayesC (BC), and reproducing kernel Hilbert space (RKHS) regression] were employed as benchmarks. MT models were trained using 2 validation designs (CV1 and CV2), which differ if the information on secondary traits is available in the testing set. Models\' predictive ability was assessed with prediction accuracy (ACC; i.e. the correlation between predicted and observed values, divided by the square root of phenotype accuracy), standardized root-mean-squared error (RMSE*), and inflation factor (b). To account for potential bias in CV2-style predictions, we also computed a parametric estimate of accuracy (ACCpar). Predictive ability metrics varied according to trait, model, and validation design (CV1 or CV2), ranging from 0.71 to 0.84 for ACC, 0.78 to 0.92 for RMSE*, and between 0.82 and 1.34 for b. The highest ACC and smallest RMSE* were achieved with QMTSVR-CV2 in both traits. We observed that for CT1, model/validation design selection was sensitive to the choice of accuracy metric (ACC or ACCpar). Nonetheless, the higher predictive accuracy of QMTSVR over MTGBLUP and MTBC was replicated across accuracy metrics, besides the similar performance between the proposed method and the MTRKHS model. Results showed that the proposed approach is competitive with conventional MT Bayesian regression models using either Gaussian or spike-slab multivariate priors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    分析物响应的先验估计对于高效开发液相色谱-电喷雾电离/质谱(LC-ESI/MS)方法至关重要。但由于缺乏对影响实验结果的因素的了解,仍然是一项艰巨的任务。在这项研究中,我们解决了发现信号响应和结构特性之间的相互作用关系的挑战,在具有定量结构-性质关系(QSPR)和实验设计(DoE)的整个方法中,方法参数和溶剂相关描述符。为了系统地研究应进行QSPR预测的实验领域,我们根据Box-BehnkenDoE方案改变了LC和仪器因子。七个化合物,包括阿立哌唑及其杂质,经受了57种不同的实验条件,产生399个LC-ESI/MS数据端点。为了获得测量响应的更标准的分布,峰面积在建模前进行对数变换。使用遗传算法(GA)选择的特征进行QSPR预测,并为梯度增强树(GBT)提供训练数据。提出的模型在测试数据上表现出令人满意的性能,RMSEP为1.57%,a为96.48%。这是LC-ESI/MS中的第一项QSPR研究,该研究提供了整个实验和化学空间中分析物响应行为的整体概述。由于分子内电子效应和分子大小被非常重视,GA-GBT模型提高了对模型化合物的信号响应生成的理解。它还强调需要微调影响去溶剂化和液滴充电效率的参数。
    A priori estimation of analyte response is crucial for the efficient development of liquid chromatography-electrospray ionization/mass spectrometry (LC-ESI/MS) methods, but remains a demanding task given the lack of knowledge about the factors affecting the experimental outcome. In this research, we address the challenge of discovering the interactive relationship between signal response and structural properties, method parameters and solvent-related descriptors throughout an approach featuring quantitative structure-property relationship (QSPR) and design of experiments (DoE). To systematically investigate the experimental domain within which QSPR prediction should be undertaken, we varied LC and instrumental factors according to the Box-Behnken DoE scheme. Seven compounds, including aripiprazole and its impurities, were subjected to 57 different experimental conditions, resulting in 399 LC-ESI/MS data endpoints. To obtain a more standard distribution of the measured response, the peak areas were log-transformed before modeling. QSPR predictions were made using features selected by Genetic Algorithm (GA) and providing Gradient Boosted Trees (GBT) with training data. Proposed model showed satisfactory performance on test data with a RMSEP of 1.57 % and a of 96.48 %. This is the first QSPR study in LC-ESI/MS that provided a holistic overview of the analyte\'s response behavior across the experimental and chemical space. Since intramolecular electronic effects and molecular size were given great importance, the GA-GBT model improved the understanding of signal response generation of model compounds. It also highlighted the need to fine-tune the parameters affecting desolvation and droplet charging efficiency.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Explainable Artificial Intelligence (XAI) makes AI understandable to the human user particularly when the model is complex and opaque. Local Interpretable Model-agnostic Explanations (LIME) has an image explainer package that is used to explain deep learning models. The image explainer of LIME needs some parameters to be manually tuned by the expert in advance, including the number of top features to be seen and the number of superpixels in the segmented input image. This parameter tuning is a time-consuming task. Hence, with the aim of developing an image explainer that automizes image segmentation, this paper proposes Ensemble-based Genetic Algorithm Explainer (EGAE) for melanoma cancer detection that automatically detects and presents the informative sections of the image to the user. EGAE has three phases. First, the sparsity of chromosomes in GAs is determined heuristically. Then, multiple GAs are executed consecutively. However, the difference between these GAs are in different number of superpixels in the input image that result in different chromosome lengths. Finally, the results of GAs are ensembled using consensus and majority votings. This paper also introduces how Euclidean distance can be used to calculate the distance between the actual explanation (delineated by experts) and the calculated explanation (computed by the explainer) for accuracy measurement. Experimental results on a melanoma dataset show that EGAE automatically detects informative lesions, and it also improves the accuracy of explanation in comparison with LIME efficiently. The python codes for EGAE, the ground truths delineated by clinicians, and the melanoma detection dataset are available at https://github.com/KhaosResearch/EGAE.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    交通拥堵问题和与空气污染有关的环境问题是大城市试图缓解的城市管理的基本问题。鉴于机动车对空气污染的贡献很大,这两个目标都是通过管理城市交通来实现的。在各种出行需求管理方法中,拥堵定价是一种非常有效的措施。本研究试图通过使用多模式网络的双水平模型来同时提高运输网络的效率并减少环境影响。为此,上层模型最小化目标函数,即,污染排放成本和总体通勤成本。下层还有一个交通网络模型,提供了用户均衡的条件。遗传和Frank-Wolfe算法已用于求解双层规划模型。两种定价方案,基于警戒线和基于链接,用于调查和协助政策制定者。该算法还应用于伊斯法罕的实际道路网络,伊朗。比较了不同定价策略的模型结果。根据结果,两种定价方案都可以缓解交通拥堵和污染,尽管警戒线外的污染减少少于内部。需求也从私家车模式转向公共交通,平均增长了15%。然而,基于链路的定价比基于警戒线的定价提供更好的性能。这项研究表明,在基于链路的定价中,更高的总收费伴随着拥堵和污染缓解的急剧减少,市政当局可以将其用于替代设施和基础设施,例如公共交通和停车场的发展。
    The problem of traffic congestion and the environmental issues related to air pollution are among the essential problems of urban management that metropolitan cities are trying to mitigate. Given that the contribution of motor vehicles to air pollution is significant, both goals are achieved by managing urban transport. Among the various methods of travel demand management, congestion pricing is a very efficient measure. This study tried to simultaneously increase the efficiency of the transportation network and reduce the environmental effects by using a bi-level model for the multi-modal network. For this purpose, the upper-level model minimizes the objective function, i.e., pollution emission costs and overall commuting costs. The lower level also has a transportation network model that provides the condition of user equilibrium. The genetic and Frank-Wolfe algorithms have been used to solve the bi-level programming model. Two pricing schemes, cordon-based and link-based, are used to investigate and assist policymakers. The proposed algorithm is also applied to a real-world road network in Isfahan, Iran. The results of the proposed models for different pricing strategies were compared. According to the results, both pricing schemes mitigate traffic congestion and pollution, although the reduction in pollution outside the cordon is less than inside. Demand has also shifted from the private car mode to public transportation by an average of 15%. However, link-based pricing provides better performance than cordon-based pricing. This study indicated that a higher total collected toll in link-based pricing is accompanied by a sharper reduction in congestion and pollution mitigation, which can be spent on alternative facilities and infrastructure by the municipality, such as the development of public transportation and parking.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    我们绘制了库尔德斯坦省Kamyaran市的滑坡敏感性图,伊朗,使用基于极限学习机(ELM)组合的鲁棒深度学习(DP)模型,深度信念网络(DBN),反向传播(BP),和遗传算法(GA)。在培训和测试数据集中,共记录并划分了118个滑坡位置。我们选择了25个调节因子,其中,我们通过信息增益比(IGR)技术指定了最重要的。我们使用包括敏感性在内的统计措施评估了DP模型的性能,特异性,准确度,F1-措施,和接受者工作特性曲线下面积(AUC)。三个基准算法,即,支持向量机(SVM),REPTree,和NBTree,用于检查所提出模型的适用性。IGR的结果得出结论,在25个调节因素中,只有16个因素对我们的建模过程很重要,其中,距离道路,道路密度,岩性和土地利用是四个最重要的因素。基于测试数据集的结果表明,DP模型具有最高的精度(0.926)的比较算法,其次是NBTree(0.917),REPTree(0.903),和SVM(0.894)。根据AUC=0.870的DP模型编制的滑坡敏感性图表现最好。我们认为DP模型是用于滑坡敏感性映射的合适工具。
    We mapped landslide susceptibility in Kamyaran city of Kurdistan Province, Iran, using a robust deep-learning (DP) model based on a combination of extreme learning machine (ELM), deep belief network (DBN), back propagation (BP), and genetic algorithm (GA). A total of 118 landslide locations were recorded and divided in the training and testing datasets. We selected 25 conditioning factors, and of these, we specified the most important ones by an information gain ratio (IGR) technique. We assessed the performance of the DP model using statistical measures including sensitivity, specificity, accuracy, F1-measure, and area under-the-receiver operating characteristic curve (AUC). Three benchmark algorithms, i.e., support vector machine (SVM), REPTree, and NBTree, were used to check the applicability of the proposed model. The results by IGR concluded that of the 25 conditioning factors, only 16 factors were important for our modeling procedure, and of these, distance to road, road density, lithology and land use were the four most significant factors. Results based on the testing dataset revealed that the DP model had the highest accuracy (0.926) of the compared algorithms, followed by NBTree (0.917), REPTree (0.903), and SVM (0.894). The landslide susceptibility maps prepared from the DP model with AUC = 0.870 performed the best. We consider the DP model a suitable tool for landslide susceptibility mapping.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    尽管硬传感器可以很容易地用于能源生产过程的各种状态监测,软传感器被限制在一些特定的情况下,由于困难的安装要求和复杂的工作条件。然而,工业过程可能指复杂的控制和操作,从丰富的传感器数据中提取相关信息可能具有挑战性,复杂过程数据模式的描述也成为软测量开发的一个热点。在本文中,提出了一种基于机理分析和数据驱动的混合软测量模型,以某电厂磨煤机通风传感为例进行了研究。首先,通过质量和能量守恒定律建立通风机理模型,和对象相关的特征被识别为数据驱动方法的输入。其次,径向基函数神经网络(RBFNN)用于软测量建模,采用遗传算法(GA)快速、准确地确定RBFNN超参数,因此提出了自适应RBFNN(SA-RBFNN)来提高能源生产过程中的软测量性能。最后,在实际电厂数据集上验证了该方法的有效性,以磨煤机通风软测量为例。
    Despite hard sensors can be easily used in various condition monitoring of energy production process, soft sensors are confined to some specific scenarios due to difficulty installation requirements and complex work conditions. However, industrial process may refer to complex control and operation, the extraction of relevant information from abundant sensors data may be challenging, and description of complicated process data patterns is also becoming a hot topic in soft-sensor development. In this paper, a hybrid soft sensor model based mechanism analysis and data-driven is proposed, and ventilation sensing of coal mill in a power plant is conducted as a case study. Firstly, mechanism model of ventilation is established via mass and energy conservation law, and object-relevant features are identified as the inputs of data-driven method. Secondly, radial basis function neural network (RBFNN) is used for soft sensor modeling, and genetic algorithm (GA) is adopted for quick and accurate determination of the RBFNN hyper-parameters, thus self-adaptive RBFNN (SA-RBFNN) is proposed to improve the soft sensor performance in energy production process. Finally, effectiveness of the proposed method is verified on a real-world power plant dataset, taking coal mill ventilation soft sensing as a case study.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    大流行期间医学专家面临的主要挑战之一是识别和验证新疾病的风险因素并制定有效的治疗方案所需的时间。传统上,这个过程涉及许多可能需要长达数年的临床试验,在此期间,必须采取严格的预防措施来控制疫情并减少死亡。先进的数据分析技术,然而,可以用来指导和加快这一进程。在这项研究中,我们结合进化搜索算法,深度学习,和先进的模型解释方法,以开发一个整体的探索性-预测性-解释性机器学习框架,可以帮助临床决策者及时应对大流行的挑战。拟议的框架在使用真实世界电子健康记录数据库中的急诊就诊研究COVID-19患者的急诊科(ED)再入院时得到了展示。在使用遗传算法进行探索性特征选择阶段之后,我们开发和训练一个深度人工神经网络来早期预测(即,7天)再入院(AUC=0.883)。最后,建立了SHAP模型来估计加性Shapley值(即,重要性评分)的特征,并解释其影响的大小和方向。这些发现大多与冗长而昂贵的临床试验研究报告的结果一致。
    One of the major challenges that confront medical experts during a pandemic is the time required to identify and validate the risk factors of the novel disease and to develop an effective treatment protocol. Traditionally, this process involves numerous clinical trials that may take up to several years, during which strict preventive measures must be in place to control the outbreak and reduce the deaths. Advanced data analytics techniques, however, can be leveraged to guide and speed up this process. In this study, we combine evolutionary search algorithms, deep learning, and advanced model interpretation methods to develop a holistic exploratory-predictive-explanatory machine learning framework that can assist clinical decision-makers in reacting to the challenges of a pandemic in a timely manner. The proposed framework is showcased in studying emergency department (ED) readmissions of COVID-19 patients using ED visits from a real-world electronic health records database. After an exploratory feature selection phase using genetic algorithm, we develop and train a deep artificial neural network to predict early (i.e., 7-day) readmissions (AUC = 0.883). Lastly, a SHAP model is formulated to estimate additive Shapley values (i.e., importance scores) of the features and to interpret the magnitude and direction of their effects. The findings are mostly in line with those reported by lengthy and expensive clinical trial studies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号