disease prediction

疾病预测
  • 文章类型: Journal Article
    骨质疏松症,以低骨密度(BMD)为特征,是一个日益严重的公共卫生问题。到目前为止,已经提出了几种传统的回归模型和机器学习(ML)算法来预测骨质疏松症的风险。然而,这些模型在临床应用中显示出相对较低的准确性.最近提出的深度学习(DL)方法,如深度神经网络(DNN),它可以从复杂的隐藏互动中发现知识,提供了提高预测性能的新机会。在这项研究中,我们旨在评估DNN在骨质疏松风险预测中是否能取得更好的效果.
    通过利用来自路易斯安那州骨质疏松症研究(LOS)的8,134名年龄超过40岁的受试者的髋部BMD和广泛的人口统计学和常规临床数据,我们开发并构建了一个预测骨质疏松症风险的新DNN框架,并将其在骨质疏松症风险预测中的性能与四种常规ML模型进行了比较。即随机森林(RF),人工神经网络(ANN),k-最近邻(KNN),和支持向量机(SVM),以及称为骨质疏松症自我评估工具(OST)的传统回归模型。通过接收器工作曲线下面积(AUC)和准确性评估模型性能。
    通过使用16个判别变量,我们观察到DNN方法在对骨质疏松症(髋部BMDT评分≤-1.0)和非骨质疏松症风险(髋部BMDT评分>-1.0)受试者进行分类方面取得了最佳预测性能(AUC=0.848),与其他方法相比。特征重要性分析表明,DNN模型确定的前10个最重要的变量是权重,年龄,性别,握力,高度,喝啤酒,舒张压,饮酒,烟雾年,和经济水平。此外,我们进行了子抽样分析,以评估不同数量的样本量和变量对这些测试模型的预测性能的影响.值得注意的是,我们观察到,DNN模型的表现同样良好(AUC=0.846),即使仅利用了预测骨质疏松风险的前10个最重要变量.同时,当样本量减少到原始数据集的50%时,DNN模型仍然可以实现高预测性能(AUC=0.826)。
    总而言之,我们开发了一种新的DNN模型,该模型被认为是老年人群骨质疏松症早期诊断和干预的有效算法。
    UNASSIGNED: Osteoporosis, characterized by low bone mineral density (BMD), is an increasingly serious public health issue. So far, several traditional regression models and machine learning (ML) algorithms have been proposed for predicting osteoporosis risk. However, these models have shown relatively low accuracy in clinical implementation. Recently proposed deep learning (DL) approaches, such as deep neural network (DNN), which can discover knowledge from complex hidden interactions, offer a new opportunity to improve predictive performance. In this study, we aimed to assess whether DNN can achieve a better performance in osteoporosis risk prediction.
    UNASSIGNED: By utilizing hip BMD and extensive demographic and routine clinical data of 8,134 subjects with age more than 40 from the Louisiana Osteoporosis Study (LOS), we developed and constructed a novel DNN framework for predicting osteoporosis risk and compared its performance in osteoporosis risk prediction with four conventional ML models, namely random forest (RF), artificial neural network (ANN), k-nearest neighbor (KNN), and support vector machine (SVM), as well as a traditional regression model termed osteoporosis self-assessment tool (OST). Model performance was assessed by area under \'receiver operating curve\' (AUC) and accuracy.
    UNASSIGNED: By using 16 discriminative variables, we observed that the DNN approach achieved the best predictive performance (AUC = 0.848) in classifying osteoporosis (hip BMD T-score ≤ -1.0) and non-osteoporosis risk (hip BMD T-score > -1.0) subjects, compared to the other approaches. Feature importance analysis showed that the top 10 most important variables identified by the DNN model were weight, age, gender, grip strength, height, beer drinking, diastolic pressure, alcohol drinking, smoke years, and economic level. Furthermore, we performed subsampling analysis to assess the effects of varying number of sample size and variables on the predictive performance of these tested models. Notably, we observed that the DNN model performed equally well (AUC = 0.846) even by utilizing only the top 10 most important variables for osteoporosis risk prediction. Meanwhile, the DNN model can still achieve a high predictive performance (AUC = 0.826) when sample size was reduced to 50% of the original dataset.
    UNASSIGNED: In conclusion, we developed a novel DNN model which was considered to be an effective algorithm for early diagnosis and intervention of osteoporosis in the aging population.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:开发一种新技术来确定与单个风险点相对应的最佳回归单元数量,同时从基于逻辑回归的疾病预测模型创建风险评分系统。这个超参数的最佳值平衡了简单性和准确性,为患者风险分层提供小规模和高准确性的风险评分。
    方法:所提出的技术在所有潜在的超参数值上应用自适应线搜索。此外,集成了DeLong测试,以确保选定的值产生的准确性与最佳可实现的风险评分准确性没有显着差异。我们通过两个病例研究评估方法,预测糖尿病视网膜病变(DR)在6个月内和髋部骨折再入院(HFR)在30天内,涉及90400名糖尿病患者和18065名髋部骨折患者。
    结果:我们的分数与现有方法获得的分数没有显着差异,DR和HFR预测的AUROC达到0.803和0.645,分别。关于规模,我们的DR评分为0-53,HFR评分为0-15,而现有方法产生的分数经常跨越数百或数千。
    结论:根据评估,我们的风险评分为疾病提供了简单而准确的预测.此外,我们的新DR评分为DR的最新风险评分提供了一个有竞争力的替代方案,而我们的HFR病例研究显示了这种情况的第一个风险评分。
    结论:我们的技术为制作紧凑量表的精确风险评分提供了一个可概括的框架,解决医疗保健中对用户友好和有效的风险分层工具的需求。
    OBJECTIVE: Develop a novel technique to identify an optimal number of regression units corresponding to a single risk point, while creating risk scoring systems from logistic regression-based disease predictive models. The optimal value of this hyperparameter balances simplicity and accuracy, yielding risk scores of small scale and high accuracy for patient risk stratification.
    METHODS: The proposed technique applies an adapted line search across all potential hyperparameter values. Additionally, DeLong test is integrated to ensure the selected value produces an accuracy insignificantly different from the best achievable risk score accuracy. We assessed the approach through two case studies predicting diabetic retinopathy (DR) within six months and hip fracture readmissions (HFR) within 30 days, involving cohorts of 90 400 diabetic patients and 18 065 hip fracture patients.
    RESULTS: Our scores achieve accuracies insignificantly different from those obtained by existing approaches, reaching AUROCs of 0.803 and 0.645 for DR and HFR predictions, respectively. Regarding the scale, our scores ranged 0-53 for DR and 0-15 for HFR, while scores produced by existing methods frequently spanned hundreds or thousands.
    CONCLUSIONS: According to the assessment, our risk scores offer simple and accurate predictions for diseases. Furthermore, our new DR score provides a competitive alternative to state-of-the-art risk scores for DR, while our HFR case study presents the first risk score for this condition.
    CONCLUSIONS: Our technique offers a generalizable framework for crafting precise risk scores of compact scales, addressing the demand for user-friendly and effective risk stratification tool in healthcare.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    图神经网络(GNN)在疾病预测中获得了极大的关注,其中患者的潜在嵌入被建模为节点,患者之间的相似性通过边缘表示。图结构,它决定了信息是如何聚合和传播的,在图形学习中起着至关重要的作用。最近的方法通常基于患者的潜在嵌入创建图,这可能无法准确反映他们现实世界的亲密关系。我们的分析表明,原始数据,如人口统计属性和实验室结果,为评估患者的相似性提供了丰富的信息,并且可以作为仅由潜在嵌入构建的图形的补偿措施。在这项研究中,我们首先分别从潜在表示和原始数据构造自适应图,然后通过加权求和合并这些图。鉴于图形可能包含无关和嘈杂的连接,我们应用程度敏感的边缘修剪和kNN稀疏化技术来选择性地稀疏化和修剪这些边缘。我们对两个诊断预测数据集进行了深入的实验,结果表明,我们提出的方法超越了当前最先进的技术。
    Graph neural networks (GNNs) have gained significant attention in disease prediction where the latent embeddings of patients are modeled as nodes and the similarities among patients are represented through edges. The graph structure, which determines how information is aggregated and propagated, plays a crucial role in graph learning. Recent approaches typically create graphs based on patients\' latent embeddings, which may not accurately reflect their real-world closeness. Our analysis reveals that raw data, such as demographic attributes and laboratory results, offers a wealth of information for assessing patient similarities and can serve as a compensatory measure for graphs constructed exclusively from latent embeddings. In this study, we first construct adaptive graphs from both latent representations and raw data respectively, and then merge these graphs via weighted summation. Given that the graphs may contain extraneous and noisy connections, we apply degree-sensitive edge pruning and kNN sparsification techniques to selectively sparsify and prune these edges. We conducted intensive experiments on two diagnostic prediction datasets, and the results demonstrate that our proposed method surpasses current state-of-the-art techniques.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:慢性疾病管理是全球范围内的主要健康问题。随着向预防医学的范式转变,使用机器学习的疾病预测建模对于精确和准确的医学判断越来越重要。
    目的:本研究旨在使用通用数据模型(CDM)和机器学习开发4种慢性病的高性能预测模型,并确认所提出模型扩展的可能性。
    方法:在本研究中,4种主要的慢性病,即糖尿病,高血压,高脂血症,和心血管疾病-被选中,并建立了预测其在10年内发生的模型。对于模型开发,Atlas分析工具用于定义要预测的慢性病,并根据定义的条件从CDM中提取数据。用先前研究验证的4种算法建立了预测每种疾病的模型,并在应用网格搜索后比较了性能。
    结果:对于每种疾病的预测,我们应用了4种算法(逻辑回归,梯度增强,随机森林,和极端梯度提升),所有模型的准确率都超过80%。与优化模型的性能相比,极端梯度提升对这4种疾病(糖尿病,高血压,高脂血症,和心血管疾病),曲线标准下面积为80%或更高,且为0.84至0.93。
    结论:本研究通过使用CDM和机器学习预测慢性病的发生,证明了对慢性病进行抢先管理的可能性。有了这些模型,通过使用我们的基于现实世界数据的CDM和个人可以轻松获得的国家健康保险公司检查数据开发的慢性病预测机器学习模型,可以通过识别健康风险因素来证明在10年内发展为重大慢性病的风险。
    BACKGROUND: Chronic disease management is a major health issue worldwide. With the paradigm shift to preventive medicine, disease prediction modeling using machine learning is gaining importance for precise and accurate medical judgement.
    OBJECTIVE: This study aimed to develop high-performance prediction models for 4 chronic diseases using the common data model (CDM) and machine learning and to confirm the possibility for the extension of the proposed models.
    METHODS: In this study, 4 major chronic diseases-namely, diabetes, hypertension, hyperlipidemia, and cardiovascular disease-were selected, and a model for predicting their occurrence within 10 years was developed. For model development, the Atlas analysis tool was used to define the chronic disease to be predicted, and data were extracted from the CDM according to the defined conditions. A model for predicting each disease was built with 4 algorithms verified in previous studies, and the performance was compared after applying a grid search.
    RESULTS: For the prediction of each disease, we applied 4 algorithms (logistic regression, gradient boosting, random forest, and extreme gradient boosting), and all models show greater than 80% accuracy. As compared to the optimized model\'s performance, extreme gradient boosting presented the highest predictive performance for the 4 diseases (diabetes, hypertension, hyperlipidemia, and cardiovascular disease) with 80% or greater and from 0.84 to 0.93 in area under the curve standards.
    CONCLUSIONS: This study demonstrates the possibility for the preemptive management of chronic diseases by predicting the occurrence of chronic diseases using the CDM and machine learning. With these models, the risk of developing major chronic diseases within 10 years can be demonstrated by identifying health risk factors using our chronic disease prediction machine learning model developed with the real-world data-based CDM and National Health Insurance Corporation examination data that individuals can easily obtain.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    转录组学研究的数据采集曾经是转录组学分析流程中的瓶颈。然而,转录组分析技术的最新发展提高了研究人员获取数据的能力,导致重点转向数据分析。将机器学习与传统分析方法相结合,可以更有效地处理更大量的复杂数据。许多生物信息学家,特别是那些在人类转录组学和复杂生物系统研究中不熟悉ML的人,由于他们对当前ML在这一领域的利用状况的认识有限,因此面临着巨大的障碍。为了解决这个差距,这篇评论试图向这些人介绍一般类型的机器学习,其次是一系列更具体的技术,通过将它们纳入人类转录组研究的分析管道的例子进行了证明。重要的计算方面,如数据预处理,任务制定,结果(ML模型的性能),并涵盖了验证方法。希望有更好的实际意义,过去五年内发表的研究非常关注,几乎只检查人类转录组,与标准非ML工具相比的结果。
    Data acquisition for transcriptomic studies used to be the bottleneck in the transcriptomic analytical pipeline. However, recent developments in transcriptome profiling technologies have increased researchers\' ability to obtain data, resulting in a shift in focus to data analysis. Incorporating machine learning to traditional analytical methods allows the possibility of handling larger volumes of complex data more efficiently. Many bioinformaticians, especially those unfamiliar with ML in the study of human transcriptomics and complex biological systems, face a significant barrier stemming from their limited awareness of the current landscape of ML utilisation in this field. To address this gap, this review endeavours to introduce those individuals to the general types of ML, followed by a comprehensive range of more specific techniques, demonstrated through examples of their incorporation into analytical pipelines for human transcriptome investigations. Important computational aspects such as data pre-processing, task formulation, results (performance of ML models), and validation methods are encompassed. In hope of better practical relevance, there is a strong focus on studies published within the last five years, almost exclusively examining human transcriptomes, with outcomes compared with standard non-ML tools.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目标:肝病每年导致两百万人死亡,占全球所有死亡人数的4%。通过机器学习算法对大型临床数据进行疾病的预测或早期检测已经变得很有希望和潜在的强大。但由于数据的复杂性,这种方法往往有一定的局限性。在这方面,集成学习已经显示出有希望的结果。迫切需要评估不同的算法,然后在肝脏疾病预测中提出一种鲁棒的集成算法。
    方法:在包括30,691个具有11个特征的样本的大型肝脏患者数据集上评估了具有9种算法的三种集成方法。各种预处理程序被用来为所提出的模型提供更好的质量数据,除了适当调整超参数和选择的特征。
    结果:对每种算法的模型性能进行了广泛的评估,包括几个正面和负面的性能指标以及运行时间。梯度提升具有98.80%的精度和98.50%的精度,具有整体最佳性能,每个人的回忆和F1得分。
    结论:与最近的几个类似作品相比,所提出的具有梯度提升的模型在大多数指标中都得到了改善,提示其预测肝脏疾病的功效。它可以进一步应用于预测指标的共性预测其他疾病。
    OBJECTIVE: Liver disease causes two million deaths annually, accounting for 4% of all deaths globally. Prediction or early detection of the disease via machine learning algorithms on large clinical data have become promising and potentially powerful, but such methods often have some limitations due to the complexity of the data. In this regard, ensemble learning has shown promising results. There is an urgent need to evaluate different algorithms and then suggest a robust ensemble algorithm in liver disease prediction.
    METHODS: Three ensemble approaches with nine algorithms are evaluated on a large dataset of liver patients comprising 30,691 samples with 11 features. Various preprocessing procedures are utilized to feed the proposed model with better quality data, in addition to the appropriate tuning of hyperparameters and selection of features.
    RESULTS: The models\' performances with each algorithm are extensively evaluated with several positive and negative performance metrics along with runtime. Gradient boosting is found to have the overall best performance with 98.80% accuracy and 98.50% precision, recall and F1-score for each.
    CONCLUSIONS: The proposed model with gradient boosting bettered in most metrics compared with several recent similar works, suggesting its efficacy in predicting liver disease. It can be further applied to predict other diseases with the commonality of predicate indicators.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:准确预测个体对疾病的易感性对于预防医学和早期干预至关重要。已经开发了使用临床基因组数据进行疾病预测的各种统计和机器学习模型。然而,不同祖先组的疾病临床基因组预测的准确性可能存在显著差异,因为它们在临床基因组数据集中的代表性不平等.
    方法:我们引入了一种深度迁移学习方法,以改善针对数据不利祖先群体的临床基因组预测模型的性能。我们在肺癌的多个祖先基因组数据集上进行了机器学习实验,前列腺癌,和老年痴呆症,以及具有跨祖先组的内置数据不平等和分布变化的合成数据集。
    结果:在我们的多祖先机器学习实验中,深度迁移学习显着提高了数据弱势人群的疾病预测准确性。相比之下,对于这些数据处于不利地位的人群,基于线性框架的迁移学习没有取得可比的改善.
    结论:这项研究表明,深度迁移学习可以通过提高数据弱势人群的预测准确性来增强多祖先机器学习的公平性,而不会影响其他人群的预测准确性。从而为疾病的公平临床基因组预测提供了帕累托改进。
    BACKGROUND: Accurate prediction of an individual\'s predisposition to diseases is vital for preventive medicine and early intervention. Various statistical and machine learning models have been developed for disease prediction using clinico-genomic data. However, the accuracy of clinico-genomic prediction of diseases may vary significantly across ancestry groups due to their unequal representation in clinical genomic datasets.
    METHODS: We introduced a deep transfer learning approach to improve the performance of clinico-genomic prediction models for data-disadvantaged ancestry groups. We conducted machine learning experiments on multi-ancestral genomic datasets of lung cancer, prostate cancer, and Alzheimer\'s disease, as well as on synthetic datasets with built-in data inequality and distribution shifts across ancestry groups.
    RESULTS: Deep transfer learning significantly improved disease prediction accuracy for data-disadvantaged populations in our multi-ancestral machine learning experiments. In contrast, transfer learning based on linear frameworks did not achieve comparable improvements for these data-disadvantaged populations.
    CONCLUSIONS: This study shows that deep transfer learning can enhance fairness in multi-ancestral machine learning by improving prediction accuracy for data-disadvantaged populations without compromising prediction accuracy for other populations, thus providing a Pareto improvement towards equitable clinico-genomic prediction of diseases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:最近提出抗己糖激酶1(HK1)的自身抗体与糖尿病性黄斑水肿(DME)有关。我们假设抗HK1自身抗体可用作DME标志物并预测DME发病。
    方法:来自1)DME患者的血清,2)糖尿病(DM),3)过敏或自身免疫,和4)通过免疫印迹测试对照受试者的抗HK1和抗己糖激酶2(HK2)自身抗体。对DM患者进行了长达9年的前瞻性随访,并评估了抗HK1抗体与新发DME的相关性。还测试了玻璃体液的自身抗体。
    结果:在DME患者中,32%的抗HK1自身抗体呈阳性(42%的潜在1型DM患者和31%的潜在2型DM患者),12%的抗HK2自身抗体阳性,两组患者仅部分重叠。抗HK1阳性也占DM患者的7%,6%的患者有过敏和自身免疫,和3%的对照受试者。后三组抗HK2阴性。最初抗HK1阳性的7例DM患者中只有1例发展为DME。
    结论:抗HK1自身抗体可用作DME标志物,但不能预测DME的发病。
    OBJECTIVE: Autoantibodies against hexokinase 1 (HK1) were recently proposed to be associated with diabetic macular edema (DME). We hypothesized that anti-HK1 autoantibodies can be used as DME markers and to predict DME onset.
    METHODS: Serum from patients with 1) DME, 2) diabetes mellitus (DM), 3) allergies or autoimmunities, and 4) control subjects was tested for anti-HK1 and anti-hexokinase 2 (HK2) autoantibodies by immunoblotting. Patients with DM were prospectively followed for up to nine years, and the association of anti-HK1 antibodies with new-onset DME was evaluated. The vitreous humor was also tested for autoantibodies.
    RESULTS: Among patients with DME, 32 % were positive for anti-HK1 autoantibodies (42 % of those with underlying type 1 DM and 31 % of those with underlying type 2 DM), and 12 % were positive for anti-HK2 autoantibodies, with only partial overlap of these two groups of patients. Anti-HK1 positive were also 7 % of patients with DM, 6 % of patients with allergies and autoimmunities, and 3 % of control subjects. The latter three groups were anti-HK2 negative. Only one of seven patients with DM who were initially anti-HK1 positive developed DME.
    CONCLUSIONS: Anti-HK1 autoantibodies can be used as DME markers but fail to predict DME onset.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:医疗记录是了解患者健康状况的宝贵来源。医生经常使用这些记录来评估健康状况,而不仅仅依赖于耗时和复杂的检查。然而,这些记录可能并不总是与患者当前的健康问题直接相关。例如,有关普通感冒的信息可能与更具体的健康状况无关。虽然经验丰富的医生可以有效地浏览医疗记录中不必要的细节,这些多余的信息对机器学习模型以电子方式预测疾病提出了挑战。为了解决这个问题,我们开发了“al-BERT”,一种利用BERT框架的新疾病预测模型。该模型旨在从医疗记录中识别关键信息,并将其用于预测疾病。“al-BERT”的工作原理是诊断记录中的句子结构类似于常规语言模式。然而,就像说话中的口吃会引入“噪音”或不相关的信息一样,类似的问题可能出现在书面记录中,复杂的模型训练。为了克服这一点,“al-BERT”包含一个半监督层,可从患者就诊记录中过滤掉不相关的数据。这个过程旨在完善数据,从而为疾病相关性提供更可靠的指标,并提高模型在医疗诊断中的预测准确性和实用性。
    方法:为了辨别患者记录中的噪声疾病,尤其是那些类似流感的疾病,我们的方法采用了一种定制的半监督学习算法,该算法配备了集中注意力机制。这种机制经过专门校准,以增强模型对慢性病的敏感性,同时从患者记录中提取显着特征,从而增强了该模型在临床环境中的预测准确性和实用性。我们使用台湾国民健康保险提供的真实世界健康保险数据来评估al-BERT的表现。
    结果:在我们的研究中,我们对另外两个模型进行了评估:一个基于BERT,使用完整的疾病记录,以及包括额外过滤技术的另一种变体。我们的研究结果表明,包含过滤机制的模型通常比使用整个,未过滤的数据集。我们的方法改进了几个关键指标的结果:AUC-ROC(模型区分类别能力的指标),精度(积极预测的准确性),召回(模型找到所有相关案例的能力),整体精度。最值得注意的是,我们的模型显示,与目前在疾病预测领域表现最好的方法相比,召回率提高了15%.
    结论:进行的消融研究肯定了我们的注意力机制的优势,并强调了选择模块在al-BERT中的关键作用。
    BACKGROUND: Medical records are a valuable source for understanding patient health conditions. Doctors often use these records to assess health without solely depending on time-consuming and complex examinations. However, these records may not always be directly relevant to a patient\'s current health issue. For instance, information about common colds may not be relevant to a more specific health condition. While experienced doctors can effectively navigate through unnecessary details in medical records, this excess information presents a challenge for machine learning models in predicting diseases electronically. To address this, we have developed \'al-BERT\', a new disease prediction model that leverages the BERT framework. This model is designed to identify crucial information from medical records and use it to predict diseases. \'al-BERT\' operates on the principle that the structure of sentences in diagnostic records is similar to regular linguistic patterns. However, just as stuttering in speech can introduce \'noise\' or irrelevant information, similar issues can arise in written records, complicating model training. To overcome this, \'al-BERT\' incorporates a semi-supervised layer that filters out irrelevant data from patient visitation records. This process aims to refine the data, resulting in more reliable indicators for disease correlations and enhancing the model\'s predictive accuracy and utility in medical diagnostics.
    METHODS: To discern noise diseases within patient records, especially those resembling influenza-like illnesses, our approach employs a customized semi-supervised learning algorithm equipped with a focused attention mechanism. This mechanism is specifically calibrated to enhance the model\'s sensitivity to chronic conditions while concurrently distilling salient features from patient records, thereby augmenting the predictive accuracy and utility of the model in clinical settings. We evaluate the performance of al-BERT using real-world health insurance data provided by Taiwan\'s National Health Insurance.
    RESULTS: In our study, we evaluated our model against two others: one based on BERT that uses complete disease records, and another variant that includes extra filtering techniques. Our findings show that models incorporating filtering mechanisms typically perform better than those using the entire, unfiltered dataset. Our approach resulted in improved outcomes across several key measures: AUC-ROC (an indicator of a model\'s ability to distinguish between classes), precision (the accuracy of positive predictions), recall (the model\'s ability to find all relevant cases), and overall accuracy. Most notably, our model showed a 15% improvement in recall compared to the current best-performing method in the field of disease prediction.
    CONCLUSIONS: The conducted ablation study affirms the advantages of our attention mechanism and underscores the crucial role of the selection module within al-BERT.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    髓磷脂少突胶质细胞糖蛋白-抗体相关疾病(MOGAD)是一种脱髓鞘性中枢神经系统疾病。我们旨在揭示MOGAD中改变的免疫途径以预测疾病进展。使用nanostringnCounter技术,我们分析了MOGAD患者PBMC中的免疫基因表达,并将其与健康对照(HCs)进行了比较.我们发现了35个区分MOGAD患者和HCs的基因。然后,我们在包括MS和NMOSD患者在内的更大队列中验证了这些结果。HLA-DRA在MOGAD患者中的表达明显降低。HLA-DRA的减少,与单相病程和更大的脑容量相关,增强我们预测MOGAD进展的能力。
    Myelin oligodendrocyte glycoprotein-antibody-associated disease (MOGAD) is a demyelinating central nervous system disorder. We aimed to uncover immune pathways altered in MOGAD to predict disease progression. Using nanostring nCounter technology, we analyzed immune gene expression in PBMCs from MOGAD patients and compare it with healthy controls (HCs). We found 35 genes that distinguished MOGAD patients and HCs. We then validated those results in a larger cohort including MS and NMOSD patients. Expressions of HLA-DRA was significantly lower in MOGAD patients. This reduction in HLA-DRA, correlated with a monophasic disease course and greater brain volume, enhancing our ability to predict MOGAD progression.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号