disease similarity

疾病相似性
  • 文章类型: Journal Article
    微生物组研究揭示了肠道微生物群对复杂疾病的潜在影响。然而,许多研究通常集中于每个队列的一种疾病。我们开发了肠道微生物组概况的荟萃分析工作流程,并分析了涵盖11种疾病的猎枪宏基因组数据。使用可解释的机器学习和差异丰度分析,我们的研究结果加强了克罗恩病(CD)和结直肠癌(CRC)二元分类器的推广,以支持队列并强调驱动这些分类的关键微生物.我们在疾病对如CD和溃疡性结肠炎(UC)中发现了高度的微生物相似性,CD与CRC,帕金森病与2型糖尿病(T2D),和精神分裂症vsT2D。我们还发现阿尔茨海默病与CD和UC之间存在强烈的负相关。这些发现,被我们的管道探测到,为这些疾病提供有价值的见解。
    目的:在基于疾病的药物重新定位方法之前,评估疾病相似性是必不可少的第一步。我们的研究为强调将微生物组见解整合到疾病相似性评估中的潜力提供了适度的第一步。最近的微生物组研究主要集中在分析个体疾病以了解它们的独特特征。根据设计,它排除了个人的合并症。我们分析了现有研究的鸟枪宏基因组数据,并确定了以前未知的疾病之间的相似性。我们的研究代表了一项开创性的工作,利用可解释的机器学习和差异丰度分析来评估疾病之间的微生物相似性。
    Microbiome studies have revealed gut microbiota\'s potential impact on complex diseases. However, many studies often focus on one disease per cohort. We developed a meta-analysis workflow for gut microbiome profiles and analyzed shotgun metagenomic data covering 11 diseases. Using interpretable machine learning and differential abundance analysis, our findings reinforce the generalization of binary classifiers for Crohn\'s disease (CD) and colorectal cancer (CRC) to hold-out cohorts and highlight the key microbes driving these classifications. We identified high microbial similarity in disease pairs like CD vs ulcerative colitis (UC), CD vs CRC, Parkinson\'s disease vs type 2 diabetes (T2D), and schizophrenia vs T2D. We also found strong inverse correlations in Alzheimer\'s disease vs CD and UC. These findings, detected by our pipeline, provide valuable insights into these diseases.
    OBJECTIVE: Assessing disease similarity is an essential initial step preceding a disease-based approach for drug repositioning. Our study provides a modest first step in underscoring the potential of integrating microbiome insights into the disease similarity assessment. Recent microbiome research has predominantly focused on analyzing individual diseases to understand their unique characteristics, which by design excludes comorbidities in individuals. We analyzed shotgun metagenomic data from existing studies and identified previously unknown similarities between diseases. Our research represents a pioneering effort that utilizes both interpretable machine learning and differential abundance analysis to assess microbial similarity between diseases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    人类疾病的分类主要基于受影响的器官系统和表型特征。这限制了对病理表现的看法,而忽视了对制定治疗策略至关重要的机械关系。这项工作旨在推进对疾病及其相关性的理解,超越传统的表型观点。因此,502种疾病之间的相似性是使用六个不同的数据维度绘制的,包括分子,临床,和从公共来源检索的药理信息。多距离测量和多视图聚类用于评估疾病相关性的模式。将所有六个维度整合到疾病关系的共识图中,揭示了国际疾病分类(ICD)的不同疾病观点。强调多视图疾病地图提供的新颖见解。疾病特征如基因,通路,并确定了富含不同疾病组的化学物质。最后,对西方人群中常见的三种候选疾病中最相似的疾病的评估显示与已知的流行病学关联一致,并揭示了2型糖尿病(T2D)和阿尔茨海默病之间的罕见特征。疾病关系的修订有望促进共病模式的重建,重新利用药物,并推动未来的药物发现。
    The categorization of human diseases is mainly based on the affected organ system and phenotypic characteristics. This is limiting the view to the pathological manifestations, while it neglects mechanistic relationships that are crucial to develop therapeutic strategies. This work aims to advance the understanding of diseases and their relatedness beyond traditional phenotypic views. Hence, the similarity among 502 diseases is mapped using six different data dimensions encompassing molecular, clinical, and pharmacological information retrieved from public sources. Multiple distance measures and multi-view clustering are used to assess the patterns of disease relatedness. The integration of all six dimensions into a consensus map of disease relationships reveals a divergent disease view from the International Classification of Diseases (ICD), emphasizing novel insights offered by a multi-view disease map. Disease features such as genes, pathways, and chemicals that are enriched in distinct disease groups are identified. Finally, an evaluation of the top similar diseases of three candidate diseases common in the Western population shows concordance with known epidemiological associations and reveals rare features shared between Type 2 diabetes (T2D) and Alzheimer\'s disease. A revision of disease relationships holds promise for facilitating the reconstruction of comorbidity patterns, repurposing drugs, and advancing drug discovery in the future.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Preprint
    微生物组研究揭示了肠道微生物群对复杂疾病的潜在影响。然而,许多研究通常集中于每个队列的一种疾病。我们开发了肠道微生物组概况的荟萃分析工作流程,并分析了涵盖11种疾病的猎枪宏基因组数据。使用可解释的机器学习和差异丰度分析,我们的研究结果加强了克罗恩病(CD)和结直肠癌(CRC)二元分类器的推广,以支持队列并强调驱动这些分类的关键微生物.我们在疾病对如CD和溃疡性结肠炎(UC)中发现了高度的微生物相似性,CD与CRC,帕金森病与2型糖尿病(T2D),和精神分裂症vsT2D。我们还发现阿尔茨海默病与CD和UC之间存在强烈的负相关。我们的管道检测到的这些发现为这些疾病提供了有价值的见解。
    目的:在基于疾病的药物重新定位方法之前,评估疾病相似性是必要的第一步。我们的研究为强调将微生物组见解整合到疾病相似性评估中的潜力提供了适度的第一步。最近的微生物组研究主要集中在分析个体疾病以了解其独特的特征,根据设计,这排除了个人的合并症。我们分析了现有研究的鸟枪宏基因组数据,并确定了以前未知的疾病之间的相似性。我们的研究代表了一项开创性的工作,利用可解释的机器学习和差异丰度分析来评估疾病之间的微生物相似性。
    Microbiome studies have revealed gut microbiota\'s potential impact on complex diseases. However, many studies often focus on one disease per cohort. We developed a meta-analysis workflow for gut microbiome profiles and analyzed shotgun metagenomic data covering 11 diseases. Using interpretable machine learning and differential abundance analysis, our findings reinforce the generalization of binary classifiers for Crohn\'s disease (CD) and colorectal cancer (CRC) to hold-out cohorts and highlight the key microbes driving these classifications. We identified high microbial similarity in disease pairs like CD vs ulcerative colitis (UC), CD vs CRC, Parkinson\'s disease vs type 2 diabetes (T2D), and schizophrenia vs T2D. We also found strong inverse correlations in Alzheimer\'s disease vs CD and UC. These findings detected by our pipeline provide valuable insights into these diseases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    有强有力的证据支持miRNA的突变和失调与多种疾病相关。包括癌症.然而,用于鉴定疾病相关miRNA的实验方法昂贵且耗时.识别疾病相关miRNA的有效计算方法需求很高,并且将有助于检测用于疾病诊断的lncRNA生物标志物。治疗,和预防。在这项研究中,我们开发了一个集成学习框架来揭示miRNA与疾病之间的潜在关联(ELMDA)。ELMDA框架在计算miRNA和疾病相似性时不依赖于已知的关联,并使用多分类器投票来预测疾病相关的miRNA。因此,在5倍交叉验证中,HMDDv2.0数据库的ELMDA框架的平均AUC为0.9229.预测了HMDDV2.0数据库中的所有潜在关联,前50名结果中有90%通过更新的HMDDV3.2数据库进行了验证。ELMDA框架用于研究胃肿瘤,前列腺肿瘤和结肠肿瘤,100%,94%,90%,分别,通过HMDDV3.2数据库验证了前50个潜在的miRNA。此外,ELMDA框架可以预测分离的疾病相关miRNA。总之,ELMDA似乎是揭示疾病相关miRNA的可靠方法。
    There is strong evidence to support that mutations and dysregulation of miRNAs are associated with a variety of diseases, including cancer. However, the experimental methods used to identify disease-related miRNAs are expensive and time-consuming. Effective computational approaches to identify disease-related miRNAs are in high demand and would aid in the detection of lncRNA biomarkers for disease diagnosis, treatment, and prevention. In this study, we develop an ensemble learning framework to reveal the potential associations between miRNAs and diseases (ELMDA). The ELMDA framework does not rely on the known associations when calculating miRNA and disease similarities and uses multi-classifiers voting to predict disease-related miRNAs. As a result, the average AUC of the ELMDA framework was 0.9229 for the HMDD v2.0 database in a fivefold cross-validation. All potential associations in the HMDD V2.0 database were predicted, and 90% of the top 50 results were verified with the updated HMDD V3.2 database. The ELMDA framework was implemented to investigate gastric neoplasms, prostate neoplasms and colon neoplasms, and 100%, 94%, and 90%, respectively, of the top 50 potential miRNAs were validated by the HMDD V3.2 database. Moreover, the ELMDA framework can predict isolated disease-related miRNAs. In conclusion, ELMDA appears to be a reliable method to uncover disease-associated miRNAs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    COVID-19疫苗大大降低了SARS-CoV-2感染的风险。然而,一些人接种疫苗后出现不良反应,这些有时可能很严重。性别,年龄,疫苗,尤其是某些疾病史与COVID-19疫苗接种后的严重不良反应有关。然而,有成千上万种疾病,只有一些与这些严重的不良反应有关。与其他疾病发生严重不良反应的风险仍然未知。因此,有必要进行预测性研究,以提供改善的医疗护理并将风险降至最低.在这里,对现有COVID-19疫苗不良反应数据的统计结果进行分析,提出了COVID-19疫苗严重不良反应风险预测方法,名为CVSARRP。使用留一交叉验证方法测试了CVSARRP方法的性能。预测风险与实际风险的相关系数大于0.86。CVSARRP方法预测了10855种疾病接种COVID-19后从不良反应到严重不良反应的风险。患有某些疾病的人,比如中枢神经系统疾病,心脏病,泌尿系统疾病,贫血,癌症,和呼吸道疾病,其中,接种COVID-19疫苗后,严重不良反应可能增加,并经历不良事件。
    COVID-19 vaccines greatly reduce the risk of infection with SARS-CoV-2. However, some people have adverse reactions after vaccination, and these can sometimes be severe. Gender, age, vaccines, and especially certain diseases histories are related to severe adverse reactions following COVID-19 vaccination. However, there are thousands of diseases and only some are known to be related to these severe adverse reactions. The risk of severe adverse reactions with other diseases remains unknown. Therefore, there is a need for predictive studies to provide improved medical care and minimize risk. Herein, we analyzed the statistical results of existing COVID-19 vaccine adverse reaction data and proposed a COVID-19 vaccine severe adverse reaction risk prediction method, named CVSARRP. The performance of the CVSARRP method was tested using the leave-one-out cross-validation approach. The correlation coefficient between the predicted and real risk is greater than 0.86. The CVSARRP method predicts the risk from adverse reactions to severe adverse reactions after COVID-19 vaccination for 10855 diseases. People with certain diseases, such as central nervous system diseases, heart diseases, urinary system disease, anemia, cancer, and respiratory tract disease, among others, may potentially have increased of severe adverse reactions following vaccination against COVID-19 and experiencing adverse events.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:已证实微RNA(miRNA)与人类复杂疾病的出现密不可分。疾病相关miRNA的鉴定已逐渐成为揭示所检查疾病的遗传机制的常规方法。
    方法:在本研究中,提出了一种基于加权双水平网络的BLNIMDA方法,用于预测miRNA与疾病之间的隐藏关联。为此,miRNA与疾病之间的已知关联以及miRNA与疾病之间的整合相似性被映射到双水平网络中。基于发达的双层网络,miRNA-疾病关联(MDA)被定义为强关联,潜在的关联和没有关联。然后,每个miRNA-疾病对(MDP)根据双向信息分配策略分配两个信息属性,即,miRNA与疾病的关联,反之亦然。最后,然后,对从信息属性和关联类型获得的每个MDP的两个亲和性权重进行平均,作为MDP的最终关联得分。BLNIMDA的亮点在于MDA类型的定义,从双向信息分发策略和定义的关联类型引入亲和力权重评估,保证了MDAs最终预测得分的全面性和准确性。
    结果:五重交叉验证和留一交叉验证用于评估BLNIMDA的性能。曲线下面积的结果表明,BLNIMDA比其他七种选定的计算方法具有许多优势。此外,基于四种常见疾病和miRNA的案例研究证明BLNIMDA具有良好的预测性能。
    结论:因此,BLNIMDA是一种预测隐藏MDAs的有效方法。
    BACKGROUND: MicroRNAs (miRNAs) have been confirmed to be inextricably linked to the emergence of human complex diseases. The identification of the disease-related miRNAs has gradually become a routine way to unveil the genetic mechanisms of examined disorders.
    METHODS: In this study, a method BLNIMDA based on a weighted bi-level network was proposed for predicting hidden associations between miRNAs and diseases. For this purpose, the known associations between miRNAs and diseases as well as integrated similarities between miRNAs and diseases are mapped into a bi-level network. Based on the developed bi-level network, the miRNA-disease associations (MDAs) are defined as strong associations, potential associations and no associations. Then, each miRNA-disease pair (MDP) is assigned two information properties according to the bidirectional information distribution strategy, i.e., associations of miRNA towards disease and vice-versa. Finally, two affinity weights for each MDP obtained from the information properties and the association type are then averaged as the final association score of the MDP. Highlights of the BLNIMDA lie in the definition of MDA types, and the introduction of affinity weights evaluation from the bidirectional information distribution strategy and defined association types, which ensure the comprehensiveness and accuracy of the final prediction score of MDAs.
    RESULTS: Five-fold cross-validation and leave-one-out cross-validation are used to evaluate the performance of the BLNIMDA. The results of the Area Under Curve show that the BLNIMDA has many advantages over the other seven selected computational methods. Furthermore, the case studies based on four common diseases and miRNAs prove that the BLNIMDA has good predictive performance.
    CONCLUSIONS: Therefore, the BLNIMDA is an effective method for predicting hidden MDAs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    自闭症谱系障碍(ASD)是一种临床表现不均匀的神经发育障碍,变量严重性,和多种合并症。复杂的潜在遗传结构与临床异质性相匹配,证据表明,几种共同发生的脑部疾病与ASD具有遗传成分。在这项研究中,我们建立了一种遗传相似性疾病网络方法来探索ASD和常见共病脑疾病(和亚型)之间的共享遗传学,即智力残疾,注意缺陷/多动症,癫痫,以及精神分裂症和双相疾病谱中其他很少同时发生的神经精神疾病。使用由DisGeNET数据库管理的疾病相关基因集,根据疾病对之间的Jaccard系数估计疾病遗传相似性,Leiden检测算法用于识别网络疾病群落并定义共享的生物途径。我们发现了一个与ASD基因更相似的异质性脑疾病群落,包括癫痫,双相情感障碍,注意缺陷/多动障碍综合类型,和精神分裂症谱系中的一些疾病。为了确定疾病群落共同基因中的功能丧失罕见的从头变异,我们分析了一个大型ASD全基因组测序数据集,表明ASD与其他多种脑部疾病共享基因,基因相似度较低,社区。一些基因(例如,SHANK3,ASH1L,SCN2A,CHD2和MECP2)先前与ASD和这些疾病有关。这种方法能够进一步澄清ASD和脑部疾病之间的遗传共享,具有更精细的疾病分类粒度和来自DisGeNet的多层次证据。了解疾病之间的遗传共享对疾病的发病具有重要意义,病理生理学,个性化治疗。
    Autism Spectrum Disorder (ASD) is a neurodevelopmental disorder with heterogeneous clinical presentation, variable severity, and multiple comorbidities. A complex underlying genetic architecture matches the clinical heterogeneity, and evidence indicates that several co-occurring brain disorders share a genetic component with ASD. In this study, we established a genetic similarity disease network approach to explore the shared genetics between ASD and frequent comorbid brain diseases (and subtypes), namely Intellectual Disability, Attention-Deficit/Hyperactivity Disorder, and Epilepsy, as well as other rarely co-occurring neuropsychiatric conditions in the Schizophrenia and Bipolar Disease spectrum. Using sets of disease-associated genes curated by the DisGeNET database, disease genetic similarity was estimated from the Jaccard coefficient between disease pairs, and the Leiden detection algorithm was used to identify network disease communities and define shared biological pathways. We identified a heterogeneous brain disease community that is genetically more similar to ASD, and that includes Epilepsy, Bipolar Disorder, Attention-Deficit/Hyperactivity Disorder combined type, and some disorders in the Schizophrenia Spectrum. To identify loss-of-function rare de novo variants within shared genes underlying the disease communities, we analyzed a large ASD whole-genome sequencing dataset, showing that ASD shares genes with multiple brain disorders from other, less genetically similar, communities. Some genes (e.g., SHANK3, ASH1L, SCN2A, CHD2, and MECP2) were previously implicated in ASD and these disorders. This approach enabled further clarification of genetic sharing between ASD and brain disorders, with a finer granularity in disease classification and multi-level evidence from DisGeNET. Understanding genetic sharing across disorders has important implications for disease nosology, pathophysiology, and personalized treatment.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    药物性肝损伤(DILI),尽管发生率低,会导致严重的副作用甚至导致死亡。因此,这是终止新事物发展的主要原因之一,限制使用已经流通的,drugs.此外,它的多因素性质,结合通常模仿其他肝脏疾病的临床表现,使DILI相关(或“阳性”)文献的识别复杂化,这仍然是从临床实践和实验研究中获取结果的主要媒介。这项工作为“大型数据分析关键评估(CAMDA)2021的“DILI挑战的文献AI”做出了贡献,提供了区分DILI阳性和阴性出版物的自动化管道。我们使用自然语言处理(NLP)来过滤文本中无信息的部分,识别和提取化学物质和疾病的提及。我们将这些信息与小分子和疾病嵌入相结合,能够捕捉化学物质和疾病的相似性,以提高分类性能。前者直接来自化学检查器(CC)。对于后者,我们从美国国家医学图书馆(NLM)医学主题词(MeSH)词库和比较毒性基因组学数据库(CTD)收集了编码疾病相似性不同方面的数据.遵循与CC中使用的类似的过程,学习并评估了疾病的矢量表示。开发了两个神经网络(NN)分类器:一个接受文本作为输入的基线模型和一个增强的,扩展,还利用化学和疾病嵌入的模型。我们训练过,已验证,并通过具有10个外部和5个内部折叠的嵌套交叉验证(NCV)方案测试分类器。在此期间,基线模型和扩展模型的性能几乎相同,F1评分分别为95.04±0.61%和94.80±0.41%,分别。在外部验证时,扣留,旨在评估分类器可泛化性的数据集,扩展模型的F1分数为91.14±1.62%,表现优于基线同行,后者得分较低,为88.30±2.44%。我们对分类器进行了进一步的比较,并讨论了未来的改进和方向,包括利用化学和疾病嵌入对DILI阳性文献进行可视化和探索性分析。
    Drug-Induced Liver Injury (DILI), despite its low occurrence rate, can cause severe side effects or even lead to death. Thus, it is one of the leading causes for terminating the development of new, and restricting the use of already-circulating, drugs. Moreover, its multifactorial nature, combined with a clinical presentation that often mimics other liver diseases, complicate the identification of DILI-related (or \"positive\") literature, which remains the main medium for sourcing results from the clinical practice and experimental studies. This work-contributing to the \"Literature AI for DILI Challenge\" of the Critical Assessment of Massive Data Analysis (CAMDA) 2021- presents an automated pipeline for distinguishing between DILI-positive and negative publications. We used Natural Language Processing (NLP) to filter out the uninformative parts of a text, and identify and extract mentions of chemicals and diseases. We combined that information with small-molecule and disease embeddings, which are capable of capturing chemical and disease similarities, to improve classification performance. The former were directly sourced from the Chemical Checker (CC). For the latter, we collected data that encode different aspects of disease similarity from the National Library of Medicine\'s (NLM) Medical Subject Headings (MeSH) thesaurus and the Comparative Toxicogenomics Database (CTD). Following a similar procedure as the one used in the CC, vector representations for diseases were learnt and evaluated. Two Neural Network (NN) classifiers were developed: a baseline model that accepts texts as input and an augmented, extended, model that also utilises chemical and disease embeddings. We trained, validated, and tested the classifiers through a Nested Cross-Validation (NCV) scheme with 10 outer and 5 inner folds. During this, the baseline and extended models performed virtually identically, with F1-scores of 95.04 ± 0.61% and 94.80 ± 0.41%, respectively. Upon validation on an external, withheld, dataset that is meant to assess classifier generalisability, the extended model achieved an F1-score of 91.14 ± 1.62%, outperforming its baseline counterpart which received a lower score of 88.30 ± 2.44%. We make further comparisons between the classifiers and discuss future improvements and directions, including utilising chemical and disease embeddings for visualisation and exploratory analysis of the DILI-positive literature.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:测量复杂疾病之间的相似性对于揭示疾病的发病机理和生物医学领域的发展具有重要意义。疾病相关基因之间的功能关联和语义关联可以用于计算疾病相似性。目前,越来越多的研究证明了非编码RNA在基因组组织和基因表达调控中的重要作用。因此,考虑ncRNA可用于测量疾病相似性。然而,现有方法忽略了ncRNA在生物过程中的调控功能。在这项研究中,我们提出了一种新的深度学习方法来推断疾病相似度。
    结果:在本文中,我们提出了一种新的方法,ImpAESim,集成多个网络嵌入的框架,以学习紧凑的特征表示和疾病相似度计算。我们首先利用三种不同的疾病相关信息网络来构建异构网络,在网络扩散过程之后,RWR,提出了一种由经典自动编码器(AE)和改进的AE模型组成的紧凑特征学习模型,以提取约束和低维特征表示。我们最终获得了疾病的准确低维特征表示,然后我们使用余弦距离作为疾病相似性的度量。
    结论:ImpAESim专注于基于ncRNA调控提取特征的低维向量表示,和基因-基因相互作用网络。我们的方法可以显着减少由语义关联得出的稀疏疾病关联导致的计算偏差。
    BACKGROUND: Measuring similarity between complex diseases has significant implications for revealing the pathogenesis of diseases and development in the domain of biomedicine. It has been consentaneous that functional associations between disease-related genes and semantic associations can be applied to calculate disease similarity. Currently, more and more studies have demonstrated the profound involvement of non-coding RNA in the regulation of genome organization and gene expression. Thus, taking ncRNA into account can be useful in measuring disease similarities. However, existing methods ignore the regulation functions of ncRNA in biological process. In this study, we proposed a novel deep-learning method to deduce disease similarity.
    RESULTS: In this article, we proposed a novel method, ImpAESim, a framework integrating multiple networks embedding to learn compact feature representations and disease similarity calculation. We first utilize three different disease-related information networks to build up a heterogeneous network, after a network diffusion process, RWR, a compact feature learning model composed of classic Auto Encoder (AE) and improved AE model is proposed to extract constraints and low-dimensional feature representations. We finally obtain an accurate and low-dimensional feature representation of diseases, then we employed the cosine distance as the measurement of disease similarity.
    CONCLUSIONS: ImpAESim focuses on extracting a low-dimensional vector representation of features based on ncRNA regulation, and gene-gene interaction network. Our method can significantly reduce the calculation bias resulted from the sparse disease associations which are derived from semantic associations.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    长链非编码RNA(lncRNA),一种超过200个核苷酸的非编码RNA,与各种复杂疾病有关。为了准确地识别潜在的lncRNA-疾病关联对于了解疾病的发病机制非常重要。开发新的药物,并针对不同的人类疾病设计个性化的诊断和治疗方法。与生物实验的复杂性和高成本相比,计算方法可以快速有效地预测潜在的lncRNA-疾病关联。因此,开发lncRNA疾病预测的计算方法是一个有前途的途径。然而,由于现有技术方法的预测精度较低,目前,准确有效地识别lncRNA疾病具有巨大的挑战性。本文提出了一种称为LPARP的集成方法,这是基于标签传播算法和随机投影来解决这个问题。具体来说,标签传播算法最初用于获得lncRNA-疾病关联的估计得分,然后使用随机预测来准确预测疾病相关的lncRNAs。实证实验表明,LAPRP在三个golddata集上取得了良好的预测效果,优于现有的最先进的预测方法。它也可用于预测分离的疾病和新的lncRNAs。膀胱癌的案例研究,食管鳞状细胞癌,结直肠癌进一步证明了该方法的可靠性。提出的LPARP算法可以用较少的数据稳定有效地预测潜在的lncRNA-疾病相互作用。LPARP可以作为生物医学研究的有效和可靠的工具。
    Long noncoding RNA (lncRNA), a type of more than 200 nucleotides non-coding RNA, is related to various complex diseases. To precisely identify the potential lncRNA-disease association is important to understand the disease pathogenesis, to develop new drugs, and to design individualized diagnosis and treatment methods for different human diseases. Compared with the complexity and high cost of biological experiments, computational methods can quickly and effectively predict potential lncRNA-disease associations. Thus, it is a promising avenue to develop computational methods for lncRNA-disease prediction. However, owing to the low prediction accuracy ofstate of the art methods, it is vastly challenging to accurately and effectively identify lncRNA-disease at present. This article proposed an integrated method called LPARP, which is based on label-propagation algorithm and random projection to address the issue. Specifically, the label-propagation algorithm is initially used to obtain the estimated scores of lncRNA-disease associations, and then random projections are used to accurately predict disease-related lncRNAs.The empirical experiments showed that LAPRP achieved good prediction on three golddatasets, which is superior to existing state-of-the-art prediction methods. It can also be used to predict isolated diseases and new lncRNAs. Case studies of bladder cancer, esophageal squamous-cell carcinoma, and colorectal cancer further prove the reliability of the method. The proposed LPARP algorithm can predict the potential lncRNA-disease interactions stably and effectively with fewer data. LPARP can be used as an effective and reliable tool for biomedical research.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号