Long short-term memory

长期短期记忆
  • 文章类型: Journal Article
    在保持低误报率的同时预测低血糖是糖尿病管理中广泛采用连续葡萄糖监测(CGM)设备的挑战。一项小型研究表明,在欧洲1型糖尿病患者中,基于长短期记忆(LSTM)网络的深度学习模型在低血糖预测方面比传统机器学习算法具有更好的性能。然而,鉴于许多公认的深度学习模型在训练设置之外表现不佳,目前尚不清楚LSTM模型是否可以推广到不同人群或其他糖尿病亚型患者.
    本研究的目的是验证LSTM低血糖预测模型在更多不同人群和不同糖尿病亚型患者中的应用。
    我们组装了两个1型和2型糖尿病患者的大型数据集。主要数据集包括来自192名中国糖尿病患者的CGM数据用于开发LSTM,支持向量机(SVM),和随机森林(RF)模型,用于预测30分钟的低血糖预测。低血糖分为轻度(葡萄糖=54-70mg/dL)和重度(葡萄糖<54mg/dL)。使用美国的427名欧美血统患者的验证数据集来验证模型并检查其概括性。根据灵敏度评估了模型的预测性能,特异性,和接受者工作特征曲线下面积(AUC)。
    对于难以预测的轻度低血糖事件,LSTM模型在主要数据集中始终实现了大于97%的AUC值,验证数据集的AUC减少不到3%,表明该模型是稳健的,可在人群中推广。当LSTM模型应用于验证数据集中的1型和2型糖尿病时,AUC值也达到了93%以上。进一步加强了模型的泛化性。不同满意水平下对轻度和重度低血糖的预测敏感性,LSTM模型比SVM和RF模型具有更高的特异性,从而减少误报。
    我们的结果表明,LSTM模型对于低血糖预测是稳健的,并且可在人群或糖尿病亚型中推广。鉴于其减少误报的额外优势,LSTM模型是未来用于低血糖预测的CGM设备中广泛应用的有力候选模型.
    UNASSIGNED: Predicting hypoglycemia while maintaining a low false alarm rate is a challenge for the wide adoption of continuous glucose monitoring (CGM) devices in diabetes management. One small study suggested that a deep learning model based on the long short-term memory (LSTM) network had better performance in hypoglycemia prediction than traditional machine learning algorithms in European patients with type 1 diabetes. However, given that many well-recognized deep learning models perform poorly outside the training setting, it remains unclear whether the LSTM model could be generalized to different populations or patients with other diabetes subtypes.
    UNASSIGNED: The aim of this study was to validate LSTM hypoglycemia prediction models in more diverse populations and across a wide spectrum of patients with different subtypes of diabetes.
    UNASSIGNED: We assembled two large data sets of patients with type 1 and type 2 diabetes. The primary data set including CGM data from 192 Chinese patients with diabetes was used to develop the LSTM, support vector machine (SVM), and random forest (RF) models for hypoglycemia prediction with a prediction horizon of 30 minutes. Hypoglycemia was categorized into mild (glucose=54-70 mg/dL) and severe (glucose<54 mg/dL) levels. The validation data set of 427 patients of European-American ancestry in the United States was used to validate the models and examine their generalizations. The predictive performance of the models was evaluated according to the sensitivity, specificity, and area under the receiver operating characteristic curve (AUC).
    UNASSIGNED: For the difficult-to-predict mild hypoglycemia events, the LSTM model consistently achieved AUC values greater than 97% in the primary data set, with a less than 3% AUC reduction in the validation data set, indicating that the model was robust and generalizable across populations. AUC values above 93% were also achieved when the LSTM model was applied to both type 1 and type 2 diabetes in the validation data set, further strengthening the generalizability of the model. Under different satisfactory levels of sensitivity for mild and severe hypoglycemia prediction, the LSTM model achieved higher specificity than the SVM and RF models, thereby reducing false alarms.
    UNASSIGNED: Our results demonstrate that the LSTM model is robust for hypoglycemia prediction and is generalizable across populations or diabetes subtypes. Given its additional advantage of false-alarm reduction, the LSTM model is a strong candidate to be widely implemented in future CGM devices for hypoglycemia prediction.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:精神疾病的高患病率是一个严重的社会问题。精神卫生服务的可获得性有限是加剧这一问题的一个主要因素。一种解决方案是使用具体化的会话代理(ECA)提供认知行为疗法(CBT)。ECA可以在没有地点或时间限制的情况下提供医疗保健。CBT中使用的一种技术是苏格拉底式提问,引导用户纠正负面想法。这种方法的有效性取决于治疗师的技能,以适应用户的情绪或痛苦水平。然而,目前的ECA不具备这一技能。因此,对ECA实施这种适应能力至关重要。
    目的:本研究旨在开发和评估一种方法,该方法根据在使用ECA的CBT会话中检测到的心理困扰水平自动适应苏格拉底式问题的数量。我们假设这种选择问题数量的适应性方法会降低心理困扰,减少负面情绪状态,与随机数量的问题相比,产生更多的认知变化。
    方法:在本研究中,设想日常生活中的医疗保健支持,我们招募了年龄在18~65岁之间的参与者,进行了一项涉及2种不同条件的实验:一种是根据心理困扰检测结果适应多个问题的ECA,另一种是仅提出随机问题的ECA.参与者被分配到两个条件中的一个,经历了一次与ECA的CBT会议,并在会议前后填写了问卷。
    结果:参与者完成了实验。性别差异很小,年龄,与实验前心理困扰水平介于2个条件之间。适应的问题数量条件显示的心理困扰明显低于会议后的随机问题数量。我们还发现,当根据检测到的困扰水平调整问题数量时,认知变化存在显着差异,与问题数量少于检测到的遇险水平的情况相比。
    结论:结果表明,ECA根据检测到的遇险水平调整苏格拉底问题的数量可以提高CBT的有效性。接受自适应数量问题的参与者比接受随机数量问题的参与者经历了更大的痛苦减少。此外,当问题的数量与检测到的困扰水平相匹配时,参与者表现出更大的认知变化.这表明,根据遇险水平检测调整问题数量可以改善ECA提供的CBT结果。这些结果说明了ECA的优势,为更有针对性和更有效的精神卫生保健铺平道路。
    BACKGROUND: The high prevalence of mental illness is a critical social problem. The limited availability of mental health services is a major factor that exacerbates this problem. One solution is to deliver cognitive behavioral therapy (CBT) using an embodied conversational agent (ECA). ECAs make it possible to provide health care without location or time constraints. One of the techniques used in CBT is Socratic questioning, which guides users to correct negative thoughts. The effectiveness of this approach depends on a therapist\'s skill to adapt to the user\'s mood or distress level. However, current ECAs do not possess this skill. Therefore, it is essential to implement this adaptation ability to the ECAs.
    OBJECTIVE: This study aims to develop and evaluate a method that automatically adapts the number of Socratic questions based on the level of detected psychological distress during a CBT session with an ECA. We hypothesize that this adaptive approach to selecting the number of questions will lower psychological distress, reduce negative emotional states, and produce more substantial cognitive changes compared with a random number of questions.
    METHODS: In this study, which envisions health care support in daily life, we recruited participants aged from 18 to 65 years for an experiment that involved 2 different conditions: an ECA that adapts a number of questions based on psychological distress detection or an ECA that only asked a random number of questions. The participants were assigned to 1 of the 2 conditions, experienced a single CBT session with an ECA, and completed questionnaires before and after the session.
    RESULTS: The participants completed the experiment. There were slight differences in sex, age, and preexperimental psychological distress levels between the 2 conditions. The adapted number of questions condition showed significantly lower psychological distress than the random number of questions condition after the session. We also found a significant difference in the cognitive change when the number of questions was adapted based on the detected distress level, compared with when the number of questions was fewer than what was appropriate for the level of distress detected.
    CONCLUSIONS: The results show that an ECA adapting the number of Socratic questions based on detected distress levels increases the effectiveness of CBT. Participants who received an adaptive number of questions experienced greater reductions in distress than those who received a random number of questions. In addition, the participants showed a greater amount of cognitive change when the number of questions matched the detected distress level. This suggests that adapting the question quantity based on distress level detection can improve the results of CBT delivered by an ECA. These results illustrate the advantages of ECAs, paving the way for mental health care that is more tailored and effective.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    简介:运动过程中血乳酸浓度(BLC)的获取有利于耐力训练,然而,一个方便的方法来衡量它仍然是不可用的。BLC和心电图(ECG)均随运动强度和持续时间的变化而变化。在这项研究中,我们假设运动期间的BLC可以使用ECG数据进行预测.方法:31名健康参与者接受了四次心肺运动试验,包括一个增量测试和三个低恒定工作速率(CWR)测试,中度,和高强度。在每次CWR测试后立即获得静脉血样以测量BLC。使用31个三重CWR测试建立了数学模型,它利用残差网络结合长短期记忆来分析II导联ECG波形的每一次搏动作为2D图像。人工神经网络用于分析变量,如RR间期,年龄,性别,和体重指数。结果:低强度和中等强度的拟合误差标准偏差为0.12mmol/L,和0.19mmol/L为高强度。加权分析表明,心电图数据,包括心电图波形的每一次搏动和RR间期,主要贡献。结论:通过采用基于二维卷积和人工神经网络的方法,运动期间的BLC可以使用ECG数据进行准确的非侵入性估计,在运动训练中具有潜在的应用。
    Introduction: The acquisition of blood lactate concentration (BLC) during exercise is beneficial for endurance training, yet a convenient method to measure it remains unavailable. BLC and electrocardiogram (ECG) both exhibit variations with changes in exercise intensity and duration. In this study, we hypothesized that BLC during exercise can be predicted using ECG data. Methods: Thirty-one healthy participants underwent four cardiopulmonary exercise tests, including one incremental test and three constant work rate (CWR) tests at low, moderate, and high intensity. Venous blood samples were obtained immediately after each CWR test to measure BLC. A mathematical model was constructed using 31 trios of CWR tests, which utilized a residual network combined with long short-term memory to analyze every beat of lead II ECG waveform as 2D images. An artificial neural network was used to analyze variables such as the RR interval, age, sex, and body mass index. Results: The standard deviation of the fitting error was 0.12 mmol/L for low and moderate intensities, and 0.19 mmol/L for high intensity. Weighting analysis demonstrated that ECG data, including every beat of ECG waveform and RR interval, contribute predominantly. Conclusion: By employing 2D convolution and artificial neural network-based methods, BLC during exercise can be accurately estimated non-invasively using ECG data, which has potential applications in exercise training.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:虽然先前的研究已经确定了糖尿病酮症酸中毒(DKA)的多种危险因素,临床医生仍然缺乏临床准备模型来预测DKA的危险和昂贵的发作。我们问我们是否可以应用深度学习,特别是使用长期短期(LSTM)模型,准确预测青少年1型糖尿病(T1D)患者DKA相关住院180天风险。
    目的:描述LSTM模型的发展,以预测T1D青年DKA相关住院的180天风险。
    方法:我们使用了17个连续日历季度的临床数据(2016年01月10日-2020年03月18日),来自美国中西部儿童糖尿病临床网络的1745名8至18岁的T1D青年。输入数据包括人口统计,离散临床观察(实验室结果,生命体征,人体测量,诊断和程序代码),药物,按相遇类型划分的访问计数,历史DKA事件的数量,自上次DKA入院以来的天数,患者报告的结果(诊所摄入量问题的答案),以及通过自然语言处理(NLP)从与糖尿病和非糖尿病相关的临床笔记中得出的数据特征。我们使用来自季度1-7的输入数据(n=1377)训练模型,使用部分样本外队列(OOS-P;n=1505)中第3-9季度的输入进行验证,并在一个完整的样本外队列(OOS-F;n=354)中进一步验证,输入来自10-15季度。
    结果:在两个OOS队列中,DKA入院的发生率为每180天5%。对于OOS-P和OOS-F队列,分别为:中位年龄13.7岁(IQR=11.3,15.8)和13.1岁(10.7,15.5);纳入时HbA1c分别为8.6%(7.6,9.8)[70(60,84)mmol/mol]和8.1%(6.9,9.5)[65(52,80)mmol/mol];14%和13%曾有DKA入院经历(T1D诊断后);召回率分别为0.33和第对于按住院概率排序的列表,OOS-P队列中位置1-80,1-25和1-10的精确度从0.33提高到0.56至1.0,OOS-F队列中位置1-18,1-10和1-5的精确度从0.50提高到0.60至0.80.
    结论:提出的用于预测180天DKA相关住院的LSTM模型在本样本中是有效的。未来的工作应评估模型在多个人群和环境中的有效性,以解决可能存在于不同人群中的健康不平等现象(例如,种族和/或社会经济多样化的群体)。按DKA相关住院的概率对青年进行排序将使诊所能够识别出风险最高的青年。这样做的临床意义是,诊所可以根据可用资源创建和评估新的预防性干预措施。
    BACKGROUND: Although prior research has identified multiple risk factors for diabetic ketoacidosis (DKA), clinicians continue to lack clinic-ready models to predict dangerous and costly episodes of DKA. We asked whether we could apply deep learning, specifically the use of a long short-term memory (LSTM) model, to accurately predict the 180-day risk of DKA-related hospitalization for youth with type 1 diabetes (T1D).
    OBJECTIVE: We aimed to describe the development of an LSTM model to predict the 180-day risk of DKA-related hospitalization for youth with T1D.
    METHODS: We used 17 consecutive calendar quarters of clinical data (January 10, 2016, to March 18, 2020) for 1745 youths aged 8 to 18 years with T1D from a pediatric diabetes clinic network in the Midwestern United States. The input data included demographics, discrete clinical observations (laboratory results, vital signs, anthropometric measures, diagnosis, and procedure codes), medications, visit counts by type of encounter, number of historic DKA episodes, number of days since last DKA admission, patient-reported outcomes (answers to clinic intake questions), and data features derived from diabetes- and nondiabetes-related clinical notes via natural language processing. We trained the model using input data from quarters 1 to 7 (n=1377), validated it using input from quarters 3 to 9 in a partial out-of-sample (OOS-P; n=1505) cohort, and further validated it in a full out-of-sample (OOS-F; n=354) cohort with input from quarters 10 to 15.
    RESULTS: DKA admissions occurred at a rate of 5% per 180-days in both out-of-sample cohorts. In the OOS-P and OOS-F cohorts, the median age was 13.7 (IQR 11.3-15.8) years and 13.1 (IQR 10.7-15.5) years; median glycated hemoglobin levels at enrollment were 8.6% (IQR 7.6%-9.8%) and 8.1% (IQR 6.9%-9.5%); recall was 33% (26/80) and 50% (9/18) for the top-ranked 5% of youth with T1D; and 14.15% (213/1505) and 12.7% (45/354) had prior DKA admissions (after the T1D diagnosis), respectively. For lists rank ordered by the probability of hospitalization, precision increased from 33% to 56% to 100% for positions 1 to 80, 1 to 25, and 1 to 10 in the OOS-P cohort and from 50% to 60% to 80% for positions 1 to 18, 1 to 10, and 1 to 5 in the OOS-F cohort, respectively.
    CONCLUSIONS: The proposed LSTM model for predicting 180-day DKA-related hospitalization was valid in this sample. Future research should evaluate model validity in multiple populations and settings to account for health inequities that may be present in different segments of the population (eg, racially or socioeconomically diverse cohorts). Rank ordering youth by probability of DKA-related hospitalization will allow clinics to identify the most at-risk youth. The clinical implication of this is that clinics may then create and evaluate novel preventive interventions based on available resources.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: English Abstract
    Cardiovascular disease is the leading cause of death worldwide, accounting for 48.0% of all deaths in Europe and 34.3% in the United States. Studies have shown that arterial stiffness takes precedence over vascular structural changes and is therefore considered to be an independent predictor of many cardiovascular diseases. At the same time, the characteristics of Korotkoff signal is related to vascular compliance. The purpose of this study is to explore the feasibility of detecting vascular stiffness based on the characteristics of Korotkoff signal. First, the Korotkoff signals of normal and stiff vessels were collected and preprocessed. Then the scattering features of Korotkoff signal were extracted by wavelet scattering network. Next, the long short-term memory (LSTM) network was established as a classification model to classify the normal and stiff vessels according to the scattering features. Finally, the performance of the classification model was evaluated by some parameters, such as accuracy, sensitivity, and specificity. In this study, 97 cases of Korotkoff signal were collected, including 47 cases from normal vessels and 50 cases from stiff vessels, which were divided into training set and test set according to the ratio of 8 : 2. The accuracy, sensitivity and specificity of the final classification model was 86.4%, 92.3% and 77.8%, respectively. At present, non-invasive screening method for vascular stiffness is very limited. The results of this study show that the characteristics of Korotkoff signal are affected by vascular compliance, and it is feasible to use the characteristics of Korotkoff signal to detect vascular stiffness. This study might be providing a new idea for non-invasive detection of vascular stiffness.
    血管硬化是心血管疾病的独立预测因子,柯氏音的特征与血管顺应性密切相关。本研究的目的是探究基于柯氏音信号的特征进行血管硬化检测的可行性。分别采集正常血管和硬化血管的柯氏音信号,并进行预处理,利用小波散射网络对柯氏音信号进行散射特征提取,搭建长短期记忆网络(LSTM)作为分类模型,对散射特征进行分类,评估 LSTM 分类模型的性能。本研究共有 97 例柯氏音信号数据,其中血管硬化组为 50 例,血管正常组为 47 例,按照 8∶2 的比例划分为训练集和测试集。最终分类模型的准确率为 86.4%,敏感度为 92.3%,特异性为 77.8%。研究结果表明,柯氏音信号的特征受到血管顺应性的影响,利用柯氏音信号的特征进行血管硬化的检测是可行的,本研究为无创血管硬化检测提供了一种新的思路。.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    早期预防和诊断高血压,有越来越多的需求,以确定其状态,以符合患者。这项试点研究旨在研究使用光电体积描记(PPG)信号的非侵入性方法如何与深度学习算法一起工作。利用便携式PPG采集设备(Max30101光子传感器)来(1)捕获PPG信号和(2)无线传输数据集。与传统的特征工程机器学习分类方案相比,这项研究对原始数据进行了预处理,并直接应用了深度学习算法(LSTM-Attention)来提取这些原始数据集之间的更深层相关性。基于门机制和内存单元的长短期内存(LSTM)模型使其能够更有效地处理长序列数据,避免梯度消失,并具有解决长期依赖关系的能力。为了增强远距离采样点之间的相关性,引入了一种注意力机制来捕获比单独的LSTM模型更多的数据变化特征.实施了15名健康志愿者和15名高血压患者的协议以获得这些数据集。处理结果表明,所提出的模型可以提供令人满意的性能(精度:0.991;精度:0.989;召回率:0.993;F1分数:0.991)。与相关研究相比,我们提出的模型也显示出优越的性能。结果表明,所提出的方法可以有效地诊断和识别高血压;因此,使用可穿戴智能设备可以迅速建立一种经济有效地筛查高血压的范例.
    To prevent and diagnose hypertension early, there has been a growing demand to identify its states that align with patients. This pilot study aims to research how a non-invasive method using photoplethysmographic (PPG) signals works together with deep learning algorithms. A portable PPG acquisition device (Max30101 photonic sensor) was utilized to (1) capture PPG signals and (2) wirelessly transmit data sets. In contrast to traditional feature engineering machine learning classification schemes, this study preprocessed raw data and applied a deep learning algorithm (LSTM-Attention) directly to extract deeper correlations between these raw datasets. The Long Short-Term Memory (LSTM) model underlying a gate mechanism and memory unit enables it to handle long sequence data more effectively, avoiding gradient disappearance and possessing the ability to solve long-term dependencies. To enhance the correlation between distant sampling points, an attention mechanism was introduced to capture more data change features than a separate LSTM model. A protocol with 15 healthy volunteers and 15 hypertension patients was implemented to obtain these datasets. The processed result demonstrates that the proposed model could present satisfactory performance (accuracy: 0.991; precision: 0.989; recall: 0.993; F1-score: 0.991). The model we proposed also demonstrated superior performance compared to related studies. The outcome indicates the proposed method could effectively diagnose and identify hypertension; thus, a paradigm to cost-effectively screen hypertension could rapidly be established using wearable smart devices.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    使用能够预测光伏(PV)能源生产的模型对于确保该能源与传统配电网的最佳集成至关重要。长短期记忆网络(LSTM)通常用于此目的,但它们的使用可能不是更好的选择,因为它们的计算复杂性很大,推理和训练时间较慢。因此,在这项工作中,我们寻求评估神经网络MLP(多层感知器)的使用,循环神经网络(RNN),和LSTMs,用于预测5min的光伏能源产量。预测的每次迭代都使用从光伏系统收集的最后120分钟的数据(功率,辐照,和PV电池温度),从2019年到2022年年中在Maceió(巴西)测量。此外,使用贝叶斯超参数优化来获得每个模型的最佳结果,并在平等的基础上进行比较。结果表明,MLP表现令人满意,需要更少的时间来训练和预测,表明在特定情况下处理非常短期的预测时,它们可能是一个更好的选择,例如,在计算资源很少的系统中。
    The use of models capable of forecasting the production of photovoltaic (PV) energy is essential to guarantee the best possible integration of this energy source into traditional distribution grids. Long Short-Term Memory networks (LSTMs) are commonly used for this purpose, but their use may not be the better option due to their great computational complexity and slower inference and training time. Thus, in this work, we seek to evaluate the use of neural networks MLPs (Multilayer Perceptron), Recurrent Neural Networks (RNNs), and LSTMs, for the forecast of 5 min of photovoltaic energy production. Each iteration of the predictions uses the last 120 min of data collected from the PV system (power, irradiation, and PV cell temperature), measured from 2019 to mid-2022 in Maceió (Brazil). In addition, Bayesian hyperparameters optimization was used to obtain the best of each model and compare them on an equal footing. Results showed that the MLP performs satisfactorily, requiring much less time to train and forecast, indicating that they can be a better option when dealing with a very short-term forecast in specific contexts, for example, in systems with little computational resources.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    1型糖尿病(T1D)结局预测在识别新的危险因素中起着至关重要的作用。确保早期患者护理和设计队列研究。TEDDY是一项纵向队列研究,从参与者那里收集了大量的多组学和临床数据,以探索T1D的进展和标志物。然而,组学概况中的缺失数据使结果预测成为一项艰巨的任务。TEDDY收集了少于6%的参与者的时间序列基因表达。此外,对于基因表达被收集的参与者,79%的时间步骤缺失。本研究引入了用于基因表达插补和胰岛自身免疫(IA)预测的先进生物信息学框架。插补模型为部分或完全缺失基因表达的参与者生成合成数据。该预测模型将合成基因表达与其他风险因素相结合,以达到更好的预测性能。对TEDDY数据集的综合实验表明:(1)我们的管道可以有效地整合合成基因表达与家族史,与文献中的单个数据集和最新结果(AUC0.682)相比,HLA基因型和SNP可以更好地预测2年的IA状态(敏感性0.622,AUC0.715)。(2)合成基因表达包含与真实基因表达一样强的预测信号,减少对昂贵和长期纵向数据收集的依赖。(3)时间序列基因表达对于所提出的改进至关重要,并且显示出比横截面基因表达明显更好的预测能力。(4)我们的管道对有限的数据可用性是稳健的。可用性:代码可在https://github.com/compbiolabucf/TEDDY获得。
    Type 1 diabetes (T1D) outcome prediction plays a vital role in identifying novel risk factors, ensuring early patient care and designing cohort studies. TEDDY is a longitudinal cohort study that collects a vast amount of multi-omics and clinical data from its participants to explore the progression and markers of T1D. However, missing data in the omics profiles make the outcome prediction a difficult task. TEDDY collected time series gene expression for less than 6% of enrolled participants. Additionally, for the participants whose gene expressions are collected, 79% time steps are missing. This study introduces an advanced bioinformatics framework for gene expression imputation and islet autoimmunity (IA) prediction. The imputation model generates synthetic data for participants with partially or entirely missing gene expression. The prediction model integrates the synthetic gene expression with other risk factors to achieve better predictive performance. Comprehensive experiments on TEDDY datasets show that: (1) Our pipeline can effectively integrate synthetic gene expression with family history, HLA genotype and SNPs to better predict IA status at 2 years (sensitivity 0.622, AUC 0.715) compared with the individual datasets and state-of-the-art results in the literature (AUC 0.682). (2) The synthetic gene expression contains predictive signals as strong as the true gene expression, reducing reliance on expensive and long-term longitudinal data collection. (3) Time series gene expression is crucial to the proposed improvement and shows significantly better predictive ability than cross-sectional gene expression. (4) Our pipeline is robust to limited data availability. Availability: Code is available at https://github.com/compbiolabucf/TEDDY.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    越来越多的抑郁症患者和初级保健服务的超负荷使得有必要使用易于获取的生物标志物(如移动脑电图(EEG))来识别抑郁状态。一些研究通过收集和分析EEG静息状态来搜索适当的特征和分类方法来解决这个问题。传统上,抑郁症的EEG静息状态分类方法主要基于线性或线性和非线性特征的组合。我们假设,持续抑郁状态的参与者与可以在EEG静息状态数据中捕获的复杂脑动力学模式的控制不同,只在几个电极上使用非线性测量,这使得开发廉价且可穿戴的设备成为可能,甚至可以通过智能手机进行监控。为了验证这样的观点,对50名参与者进行了静息状态脑电图研究,一半有抑郁状态(DEP),一半有对照(CTL)。采用数据驱动的方法选择最适合脑电图分析的时间窗口和电极。正如贾科梅蒂的建议,以及最有效的非线性特征和分类器,区分CTL和DEP参与者。选择了显示时空和频谱复杂性的非线性特征。结果证实,在15s的时间窗口中从一些选定的电极计算非线性特征足以对DEP和CTL参与者进行准确分类。最后,在内部训练和测试分类器后,经过训练的机器被应用于公开可用数据库中的EEG静息状态数据(CTL和DEP),用来自不同设备的数据验证分类器的泛化能力,人口,和环境获得接近100%的准确度。
    The growing number of depressive people and the overload in primary care services make it necessary to identify depressive states with easily accessible biomarkers such as mobile electroencephalography (EEG). Some studies have addressed this issue by collecting and analyzing EEG resting state in a search of appropriate features and classification methods. Traditionally, EEG resting state classification methods for depression were mainly based on linear or a combination of linear and non-linear features. We hypothesize that participants with ongoing depressive states differ from controls in complex patterns of brain dynamics that can be captured in EEG resting state data, using only nonlinear measures on a few electrodes, making it possible to develop cheap and wearable devices that could be even monitored through smartphones. To validate such a perspective, a resting-state EEG study was conducted with 50 participants, half with depressive state (DEP) and half controls (CTL). A data-driven approach was applied to select the most appropriate time window and electrodes for the EEG analyses, as suggested by Giacometti⁠, as well as the most efficient nonlinear features and classifiers, to distinguish between CTL and DEP participants. Nonlinear features showing temporo-spatial and spectral complexity were selected. The results confirmed that computing nonlinear features from a few selected electrodes in a 15 s time window are sufficient to classify DEP and CTL participants accurately. Finally, after training and testing internally the classifier, the trained machine was applied to EEG resting state data (CTL and DEP) from a publicly available database, validating the capacity of generalization of the classifier with data from different equipment, population, and environment obtaining an accuracy near 100%.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    实时预测河流本身的状况及其对人民的受益程度是实现人水和谐的主导方式。采用指标评分法作为评价方法,我们使用具有时间序列特征的河流评估数据和结果作为特征和标签,并将迁移学习的概念应用于长短期记忆,建立了六个子系统,包括水安全,水质,经济贡献,水生态,水管理和水文化,对我国淮河流域江苏段河流幸福度进行实时滚动评价仿真研究。实证结果表明,各系统的训练集和测试集的最大均方根误差(RMSE)为0.0226,最低判定系数R2为0.9011,证明模型拟合良好,根据该数据,引入了2022年6月分水岭的相关数据,评价结果为89.77分。总体趋势是好的,但是可以发现经济贡献水平有一定的回落趋势,客观地分析了原因。
    Real-time prediction of the state of the river itself and the degree of its benefit to the people is the leading way to achieve human-water harmony. Using the indicator scoring method as the evaluation method, we used the river evaluation data and results with time series characteristics as features and labels and applied the concept of transfer learning to Long Short-Term Memory to establish six subsystems, including water safety, water quality, economic contribution, water ecology, water management and water culture, to conduct a real-time rolling evaluation simulation study on the degree of river happiness in the Jiangsu section of the Huaihe River Basin in China. The empirical results show that the maximum Root Mean Square Error (RMSE) of the training set and test set of each system is 0.0226, and the lowest coefficient of determination R2 is 0.9011, which proves that the model fits well, according to which the relevant data of the watershed in June 2022 are brought in, and the evaluation result is obtained as 89.77 points. The overall trend is good, but a certain tendency to fall back at the level of economic contribution can be found, and the reasons are analyzed objectively.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号