Long short-term memory

长期短期记忆
  • 文章类型: Journal Article
    台风引起的巨浪经常在沿海地区引发严重的灾害,使台风诱发波的有效预测成为研究人员的关键问题。近年来,水下物联网(IoUT)的发展迅速增加了对海洋环境灾害的预测。过去的研究利用了气象数据和前馈神经网络(例如,BPNN)具有静态网络结构,以建立较短的提前期(例如,1h)台湾沿海的台风波浪预报模型。然而,足够的预测提前期对于做好准备仍然至关重要,预警,和响应,以最大程度地减少台风期间的生命和财产损失。这项研究的目的是建立一个新的长提前期台风诱发波预测模型,使用长短期记忆(LSTM),它包含了一个动态的网络结构。LSTM可以通过其循环结构捕获长期信息,并使用存储门选择性地保留必要的信号。与早期的研究相比,该方法延长了预测提前期,显著提高了学习和泛化能力,从而显著提高预测精度。
    Huge waves caused by typhoons often induce severe disasters along coastal areas, making the effective prediction of typhoon-induced waves a crucial research issue for researchers. In recent years, the development of the Internet of Underwater Things (IoUT) has rapidly increased the prediction of oceanic environmental disasters. Past studies have utilized meteorological data and feedforward neural networks (e.g., BPNN) with static network structures to establish short lead time (e.g., 1 h) typhoon wave prediction models for the coast of Taiwan. However, sufficient lead time for prediction remains essential for preparedness, early warning, and response to minimize the loss of lives and properties during typhoons. The aim of this research is to construct a novel long lead time typhoon-induced wave prediction model using Long Short-Term Memory (LSTM), which incorporates a dynamic network structure. LSTM can capture long-term information through its recurrent structure and selectively retain necessary signals using memory gates. Compared to earlier studies, this method extends the prediction lead time and significantly improves the learning and generalization capability, thereby enhancing prediction accuracy markedly.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    电导率(EC)被广泛认为是预测盐度和矿化的最重要的水质指标之一。在目前的研究中,使用一种新颖的深度学习算法(卷积神经网络结合长短期记忆模型,CNN-LSTM)。Boruta-XGBoost特征选择方法用于确定模型的重要输入(时间序列滞后数据)。为了比较Boruta-XGB-CNN-LSTM模型的性能,三种机器学习方法-多层感知器神经网络(MLP),K-最近邻(KNN),和极端梯度增强(XGBoost)被使用。不同的统计指标,如相关系数(R),均方根误差(RMSE),和平均绝对百分比误差,用于评估模型的性能。从两条河流10年的数据来看,7年(2012-2018年)被用作训练集,3年(2019-2021年)用于测试模型。Boruta-XGB-CNN-LSTM模型在EC提前一天预测中的应用表明,在两个站中,Boruta-XGB-CNN-LSTM可以比其他机器学习模型更好地预测测试数据集的EC参数(对于AlbertRiver,R=0.9429,RMSE=45.6896,MAPE=5.9749,R=0.9215,RMSE=43.8315,BarrattaCreek的MAPE=7.6029)。考虑到Boruta-XGB-CNN-LSTM模型在两条河流中的性能更好,该模型用于预测EC前3-10天.结果表明,Boruta-XGB-CNN-LSTM模型非常能够预测未来10天的EC。结果表明,通过将预报时间从3天增加到10天,Boruta-XGB-CNN-LSTM模型的性能略有下降。这项研究的结果表明,Boruta-XGB-CNN-LSTM模型可以作为一种很好的软计算方法,用于准确预测河流中EC将如何变化。
    Electrical conductivity (EC) is widely recognized as one of the most essential water quality metrics for predicting salinity and mineralization. In the current research, the EC of two Australian rivers (Albert River and Barratta Creek) was forecasted for up to 10 days using a novel deep learning algorithm (Convolutional Neural Network combined with Long Short-Term Memory Model, CNN-LSTM). The Boruta-XGBoost feature selection method was used to determine the significant inputs (time series lagged data) to the model. To compare the performance of Boruta-XGB-CNN-LSTM models, three machine learning approaches-multi-layer perceptron neural network (MLP), K-nearest neighbour (KNN), and extreme gradient boosting (XGBoost) were used. Different statistical metrics, such as correlation coefficient (R), root mean square error (RMSE), and mean absolute percentage error, were used to assess the models\' performance. From 10 years of data in both rivers, 7 years (2012-2018) were used as a training set, and 3 years (2019-2021) were used for testing the models. Application of the Boruta-XGB-CNN-LSTM model in forecasting one day ahead of EC showed that in both stations, Boruta-XGB-CNN-LSTM can forecast the EC parameter better than other machine learning models for the test dataset (R = 0.9429, RMSE = 45.6896, MAPE = 5.9749 for Albert River, and R = 0.9215, RMSE = 43.8315, MAPE = 7.6029 for Barratta Creek). Considering the better performance of the Boruta-XGB-CNN-LSTM model in both rivers, this model was used to forecast 3-10 days ahead of EC. The results showed that the Boruta-XGB-CNN-LSTM model is very capable of forecasting the EC for the next 10 days. The results showed that by increasing the forecasting horizon from 3 to 10 days, the performance of the Boruta-XGB-CNN-LSTM model slightly decreased. The results of this study show that the Boruta-XGB-CNN-LSTM model can be used as a good soft computing method for accurately predicting how the EC will change in rivers.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    精神分裂症(SZ)是一种严重的,没有特殊治疗的慢性精神障碍。由于SZ在社会中的患病率越来越高,并且这种疾病的特征与双相情感障碍等其他精神疾病相似,大多数人没有意识到它在他们的日常生活中。因此,早期发现这种疾病将使患者寻求治疗或至少控制它。以前通过机器学习方法进行的SZ检测研究,需要在分类过程之前提取和选择特征。这项研究试图开发一种小说,基于15层卷积神经网络(CNN)和16层CNN-长短期记忆(LSTM)的端到端方法,以帮助精神科医生自动诊断SZ脑电图(EEG)信号。深度模型使用CNN层来学习信号的时间属性,而LSTM层提供序列学习机制。此外,在训练集上采用基于生成对抗网络的数据增强方法来增加数据的多样性。大型EEG数据集上的结果表明,两种提出的方法都具有很高的诊断潜力,达到98%和99%的显著精度。这项研究表明,所提出的框架能够准确地将SZ与健康受试者区分开来,并且可能对开发SZ障碍的诊断工具有用。
    Schizophrenia (SZ) is a severe, chronic mental disorder without specific treatment. Due to the increasing prevalence of SZ in societies and the similarity of the characteristics of this disease with other mental illnesses such as bipolar disorder, most people are not aware of having it in their daily lives. Therefore, early detection of this disease will allow the sufferer to seek treatment or at least control it. Previous SZ detection studies through machine learning methods, require the extraction and selection of features before the classification process. This study attempts to develop a novel, end-to-end approach based on a 15-layers convolutional neural network (CNN) and a 16-layers CNN- long short-term memory (LSTM) to help psychiatrists automatically diagnose SZ from electroencephalogram (EEG) signals. The deep model uses CNN layers to learn the temporal properties of the signals, while LSTM layers provide the sequence learning mechanism. Also, data augmentation method based on generative adversarial networks is employed over the training set to increase the diversity of the data. Results on a large EEG dataset show the high diagnostic potential of both proposed methods, achieving remarkable accuracy of 98% and 99%. This study shows that the proposed framework is able to accurately discriminate SZ from healthy subject and is potentially useful for developing diagnostic tools for SZ disorder.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    分娩期间的胎儿心率监测可以帮助医疗保健专业人员识别心率模式的变化。然而,指南和产科医生专业知识的差异在解释胎儿心率方面提出了挑战,包括未能承认调查结果或误解。人工智能有可能支持产科医生诊断胎儿心率异常。
    采用预处理技术来减轻丢失信号和伪影对模型的影响,利用数据增强方法来解决数据不平衡问题。介绍一种用各种时间尺度数据训练的多尺度长短期记忆神经网络,用于自动对胎儿心率进行分类。在单尺度和多尺度模型上进行了实验。
    结果表明,多尺度LSTM模型在各种性能度量方面优于常规LSTM模型。具体来说,在测试的单个模型中,采样率为10的模型显示出最高的分类精度。该模型的准确率达到85.73%,特异性为85.32%,CTU-UHB数据集上的精度为85.53%。此外,0.918的接受者工作曲线下面积表明我们的模型具有较高的可信度.
    与以前的研究相比,我们的方法在各种评估指标中表现出卓越的性能。通过将替代采样率纳入模型,我们观察到所有绩效指标的改善,包括ACC(85.73%与83.28%),SP(85.32%与82.47%),PR(85.53%与82.84%),召回(86.13%与84.09%),F1得分(85.79%vs.83.42%),和AUC(0.9180vs.0.8667)。这项研究的局限性包括对孕妇临床特征的考虑有限,以及忽略不同孕周的潜在影响。
    UNASSIGNED: Fetal heart rate monitoring during labor can aid healthcare professionals in identifying alterations in the heart rate pattern. However, discrepancies in guidelines and obstetrician expertise present challenges in interpreting fetal heart rate, including failure to acknowledge findings or misinterpretation. Artificial intelligence has the potential to support obstetricians in diagnosing abnormal fetal heart rates.
    UNASSIGNED: Employ preprocessing techniques to mitigate the effects of missing signals and artifacts on the model, utilize data augmentation methods to address data imbalance. Introduce a multi-scale long short-term memory neural network trained with a variety of time-scale data for automatically classifying fetal heart rate. Carried out experimental on both single and multi-scale models.
    UNASSIGNED: The results indicate that multi-scale LSTM models outperform regular LSTM models in various performance metrics. Specifically, in the single models tested, the model with a sampling rate of 10 exhibited the highest classification accuracy. The model achieves an accuracy of 85.73%, a specificity of 85.32%, and a precision of 85.53% on CTU-UHB dataset. Furthermore, the area under the receiver operating curve of 0.918 suggests that our model demonstrates a high level of credibility.
    UNASSIGNED: Compared to previous research, our methodology exhibits superior performance across various evaluation metrics. By incorporating alternative sampling rates into the model, we observed improvements in all performance indicators, including ACC (85.73% vs. 83.28%), SP (85.32% vs. 82.47%), PR (85.53% vs. 82.84%), recall (86.13% vs. 84.09%), F1-score (85.79% vs. 83.42%), and AUC(0.9180 vs. 0.8667). The limitations of this research include the limited consideration of pregnant women\'s clinical characteristics and disregard the potential impact of varying gestational weeks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    桥梁交通荷载识别对于车辆超载控制以及桥梁工程的结构管理和维护具有重要意义。与传统的载荷识别方法在逆求解运动方程时总是遇到病态和同时识别多参数困难的问题不同,提出了一种基于智能传感结合智能算法的实时交通负荷监测策略。一系列钛酸铅锆传感器用于捕获梁桥的动态响应,采用长短期记忆(LSTM)神经网络,通过数据挖掘建立桥梁动态响应与交通荷载之间的映射关系。结果表明,通过将实时应变响应馈送到LSTM网络中,例如,当与实际施加的负载相比时,移动负载的速度和大小可以以高精度同时被识别。当前方法可以促进移动负载的时变特性的高效识别,并且可以提供用于服务中的桥梁的长期交通负载监测和交通控制的有用工具。
    Traffic load identification for bridges is of great significance for overloaded vehicle control as well as the structural management and maintenance in bridge engineering. Unlike the conventional load identification methods that always encounter problems of ill-condition and difficulties in identifying multi parameters simultaneously when solving the motion equations inversely, a novel strategy is proposed based on smart sensing combing intelligent algorithm for real-time traffic load monitoring. An array of lead zirconium titanate sensors is applied to capture the dynamic responses of a beam bridge, while the Long Short-Term Memory (LSTM) neural network is employed to establish the mapping relations between the dynamic responses of the bridge and the traffic load through data mining. The results reveal that, with the real-time strain responses fed into the LSTM network, the speed and magnitude of the moving load may be identified simultaneously with high accuracy when compared to the practically applied load. The current method may facilitate highly efficient identification of the time-varying characteristics of moving loads and may provide a useful tool for long-term traffic load monitoring and traffic control for in-service bridges.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在大的空间尺度上提供准确的作物产量估算和了解极端气候胁迫下的产量损失是维持全球粮食安全的紧迫挑战。虽然数据驱动的深度学习方法在预测产量模式方面表现出了很大的能力,它检测和归因于极端气候对产量的影响的能力仍然未知。在这项研究中,我们开发了一个基于深度神经网络的多任务学习框架,以估计2006年至2018年美国玉米带县级玉米产量的变化,并特别关注2012年的极端产量损失.我们发现,我们的深度学习模型在2006-2018年(R2=0.81)具有良好的准确性,并很好地再现了2012年的极端产量异常(R2=0.79)。进一步的归因分析表明,极端热胁迫是产量损失的主要原因,造成72.5%的产量损失,其次是蒸气压不足(17.6%)和降水(10.8%)的异常。我们的深度学习模型还能够估计气候因素对玉米产量的累积影响,并确定2012年蚕丝期是影响产量对极端气候胁迫的最关键阶段。我们的研究结果提供了一个新的时空深度学习框架,以评估和归因于数据丰富时代的作物产量对气候变化的响应。
    Providing accurate crop yield estimations at large spatial scales and understanding yield losses under extreme climate stress is an urgent challenge for sustaining global food security. While the data-driven deep learning approach has shown great capacity in predicting yield patterns, its capacity to detect and attribute the impacts of climatic extremes on yields remains unknown. In this study, we developed a deep neural network based multi-task learning framework to estimate variations of maize yield at the county level over the US Corn Belt from 2006 to 2018, with a special focus on the extreme yield loss in 2012. We found that our deep learning model hindcasted the yield variations with good accuracy for 2006-2018 (R2 = 0.81) and well reproduced the extreme yield anomalies in 2012 (R2 = 0.79). Further attribution analysis indicated that extreme heat stress was the major cause for yield loss, contributing to 72.5% of the yield loss, followed by anomalies of vapor pressure deficit (17.6%) and precipitation (10.8%). Our deep learning model was also able to estimate the accumulated impact of climatic factors on maize yield and identify that the silking phase was the most critical stage shaping the yield response to extreme climate stress in 2012. Our results provide a new framework of spatio-temporal deep learning to assess and attribute the crop yield response to climate variations in the data rich era.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    随着现代拒绝服务攻击的规模和复杂性的升级,有必要在机器学习(ML)的背景下进行研究,以用于攻击执行和防御此类攻击。本文研究了ML在使用长短期记忆网络生成行为遥测数据和欺骗请求以使分析的流量看起来合法方面的潜在用途。对于这项研究,构建了一个自定义测试环境,该环境可侦听鼠标和键盘事件并对其进行相应分析。虽然这次攻击的经济可行性目前限制了它的直接威胁,技术的进步可以使攻击者在未来更具成本效益。因此,积极发展对策对于减轻潜在风险和领先于不断发展的攻击方法仍然至关重要。
    With the escalation in the size and complexity of modern Denial of Service attacks, there is a need for research in the context of Machine Learning (ML) used in attack execution and defense against such attacks. This paper investigates the potential use of ML in generating behavioral telemetry data using Long Short-Term Memory network and spoofing requests for the analyzed traffic to look legitimate. For this research, a custom testing environment was built that listens for mouse and keyboard events and analyzes them accordingly. While the economic feasibility of this attack currently limits its immediate threat, advancements in technology could make it more cost-effective for attackers in the future. Therefore, proactive development of countermeasures remains essential to mitigate potential risks and stay ahead of evolving attack methods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    随着时间的推移,牙科空气涡轮手机(DATH)的内部机制变得越来越复杂。为了提高牙科手术的操作可靠性并确保患者安全,本研究建立了具有时间序列因果关系函数的时间卷积网络(TCN)预测模型,传输内存,学习,存储,和快速收敛监测的健康和诊断DATH的转子和夹头故障。模仿牙医的手负荷为100g的手机被用来反复研磨玻璃瓷块以进行切割。在手持件自由运行时,采用加速度计来捕获振动信号。旨在辨别这些振动的特征。然后利用这些数据来创建诊断健康分类(DHC),以进一步开发TCN。一维卷积神经网络(CNN),和长短期记忆(LSTM)预测模型。这三个框架被用于机器学习并进行比较,以建立DATH的DHC预测模型。实验结果表明,就实验数据集预测的DHC而言,TCN框架的平方分类交叉熵损失函数误差普遍低于1DCNN,它没有内存框架或消失梯度问题的缺点。此外,TCN框架优于LSTM模型,这需要更长的病史才能提供足够的诊断能力。尽管如此,通过振动信号,在进给驱动铣削方向和机头重力方向均实现了高精度。总的来说,当嵌入式传感器可用时,故障分类预测模型可以在使用DATH之前准确预测牙科手机的健康和故障模式。因此,该模型可以被证明是预测实际牙科手机在剩余使用寿命中的劣化模式的有益工具。
    The internal mechanisms of dental air turbine handpieces (DATHs) have become increasingly intricate over time. To enhance the operational reliability of dental procedures and guarantee patient safety, this study formulated temporal convolution network (TCN) prediction models with the functions of causality in time sequence, transmitting memory, learning, storing, and fast convergence for monitoring the health and diagnosing the rotor and collet failure of DATHs. A handpiece mimicking a dentist\'s hand load of 100 g was employed to repeatedly mill a glass porcelain block back and forth for cutting. An accelerometer was employed to capture vibration signals during free-running of unrestrained operation of the handpiece, aiming to discern the characteristic features of these vibrations. These data were then utilized to create a diagnostic health classification (DHC) for further developing a TCN, a 1D convolutional neural network (CNN), and long short-term memory (LSTM) prediction models. The three frameworks were used and compared for machine learning to establish DHC prediction models for the DATH. The experimental results indicate that, in terms of DHC predicted for the experimental dataset, the square categorical cross-entropy loss function error of the TCN framework was generally lower than that of the 1D CNN, which did not have a memory framework or the drawback of the vanishing gradient problem. In addition, the TCN framework outperformed the LSTM model, which required a longer history to provide sufficient diagnostic ability. Still, high accuracies were achieved both in the direction of feed-drive milling and in the gravity of the handpiece through vibration signals. In general, the failure classification prediction model could accurately predict the health and failure mode of the dental handpiece before the use of the DATH when an embedded sensor was available. Therefore, this model could prove to be a beneficial tool for predicting the deterioration patterns of real dental handpieces in their remaining useful life.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    肺音(LS)的表征对于诊断呼吸道病理学是必不可少的。尽管传统的神经网络(NN)已被广泛用于肺音的自动诊断,通过允许准确的分类而不需要预处理和特征提取,深度神经网络可能比传统神经网络更有用。利用长短期记忆(LSTM)层揭示LS时间序列的基于序列的属性,一种由卷积长短期记忆(ConvLSTM)和LSTM层级联组成的新颖架构,即ConvLSNet的开发,这允许肺部疾病状态的高度准确的诊断。通过ConvLSTM层对多通道肺音进行建模,所提出的ConvLSNet架构可以同时处理六通道LS记录的空间和时间属性,而无需进行大量的预处理或数据转换。值得注意的是,所提出的模型基于对应于三种肺部状况的LS数据实现了97.4%的分类准确率,即哮喘,COPD,和健康的状态。与仅由CNN或LSTM层组成的架构相比,以及采用2DCNN和LSTM层级联集成的那些,提出的ConvLSNet架构表现出最高的分类精度,在施加由参数数量量化的最低计算成本的同时,培训时间,和学习率。
    Characterization of lung sounds (LS) is indispensable for diagnosing respiratory pathology. Although conventional neural networks (NNs) have been widely employed for the automatic diagnosis of lung sounds, deep neural networks can potentially be more useful than conventional NNs by allowing accurate classification without requiring preprocessing and feature extraction. Utilizing the long short-term memory (LSTM) layers to reveal the sequence-based properties of the LS time series, a novel architecture consisting of a cascade of convolutional long short-term memory (ConvLSTM) and LSTM layers, namely ConvLSNet is developed, which permits highly accurate diagnosis of pulmonary disease states. By modeling the multichannel lung sounds through the ConvLSTM layer, the proposed ConvLSNet architecture can concurrently deal with the spatial and temporal properties of the six-channel LS recordings without heavy preprocessing or data transformation. Notably, the proposed model achieves a classification accuracy of 97.4 % based on LS data corresponding to three pulmonary conditions, namely asthma, COPD, and the healthy state. Compared with architectures consisting exclusively of CNN or LSTM layers, as well as those employing a cascade integration of 2DCNN and LSTM layers, the proposed ConvLSNet architecture exhibited the highest classification accuracy, while imposing the lowest computational cost as quantified by the number of parameters, training time, and learning rate.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    青春期是社会交往对心理健康至关重要的发展时期。虽然COVID-19的发作显著扰乱了青少年的社会环境和心理健康,目前尚不清楚青少年如何适应大流行的后期阶段.我们利用了具有基于梯度的特征重要性的长短期记忆循环网络(LSTM)的机器学习架构,在大流行的三个阶段,建立日常社交互动和抑郁症状之间的关联模型。在COVID-19前一年,148名青少年报告了社会交往和抑郁症状,每天21天。这些年轻人中有116人在学校因COVID-19关闭后完成了为期28天的日记。这些年轻人中有79名和另外116名新参与者在大流行大约一年后完成了为期28天的日记。我们的结果表明,LSTM成功地从至少一周的社交互动中预测了所有三波的抑郁症状(r2>.70)。我们的研究表明,使用分析方法可以识别时间和非线性途径,通过这些途径,社会互动可能会带来抑郁症的风险。我们对输入特征重要性的独特分析使我们能够解释社交互动与抑郁症状之间的关联。总的来说,我们观察到大流行一年后恢复到大流行前的模式,大流行关闭期间性别和年龄差异减少。这种模式表明,青春期的社会影响系统受到COVID-19的影响,这种影响在大流行的更慢性阶段减弱。
    Adolescence is a developmental period in which social interactions are critical for mental health. While the onset of COVID-19 significantly disrupted adolescents\' social environments and mental health, it remains unclear how adolescents have adapted to later stages of the pandemic. We harnessed a machine learning architecture of Long Short-Term Memory recurrent networks (LSTM) with gradient-based feature importance, to model the association among daily social interactions and depressive symptoms during three stages of the pandemic. A year before COVID-19, 148 adolescents reported social interactions and depressive symptoms, every day for 21 days. One hundred sixteen of these youths completed a 28-day diary after schools closed due to COVID-19. Seventy-nine of these youths and additional 116 new participants completed a 28-day diary approximately a year into the pandemic. Our results show that LSTM successfully predicted depressive symptoms from at least a week of social interactions for all three waves (r2 > .70). Our study shows the utility of using an analytic approach that can identify temporal and nonlinear pathways through which social interactions may confer risk for depression. Our unique analysis of the importance of input features enabled us to interpret the association between social interactions and depressive symptoms. Collectively, we observed a return to pre-pandemic patterns a year into the pandemic, with reduced gender and age differences during the pandemic closures. This pattern suggests that the system of social influences in adolescence was affected by COVID-19, and that this effect was attenuated in more chronic stages of the pandemic.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号