Data mining

数据挖掘
  • 文章类型: Journal Article
    背景:慢性糖尿病伤口对糖尿病患者的健康构成重大威胁,代表严重和持久的并发症。全球范围内,估计有2.5%到15%的年度健康预算与糖尿病有关,糖尿病伤口占很大比例。探索新的治疗剂和方法来解决糖尿病中延迟和受损的伤口愈合变得势在必行。中药治疗慢性创面愈合历史悠久,疗效显著。在这项研究中,所有经国家药品监督管理局(NMPA)正式批准的局部应用的伤口愈合中成药(pCM)均来自NMPATCM数据库。采用数据挖掘的方法获得了一个高频中药成分对,珍珠冰片(1:1)。
    方法:本研究通过动物实验和代谢组学研究了珍珠-冰片对糖尿病创面愈合的影响和分子机制。动物实验结果显示,珍珠-冰片对显著加速糖尿病创面愈合,表现出比单独的珍珠或冰片治疗更有效的效果。同时,代谢组学分析确定了模型组和正常组之间伤口代谢谱的显着差异,表明糖尿病伤口具有与正常伤口不同的代谢特征。此外,凡士林处理的伤口表现出与模型组伤口相似的代谢特征,这表明凡士林对糖尿病伤口代谢的影响可以忽略不计。此外,用珍珠治疗的伤口,冰片,和珍珠-冰片对显示出与凡士林治疗伤口明显不同的代谢谱,表示这些治疗对伤口代谢的影响。随后对代谢途径的富集分析强调了精氨酸代谢途径的参与,与糖尿病伤口密切相关,在珍珠-冰片对治疗的愈合过程中。进一步分析显示精氨酸和瓜氨酸水平升高,与正常伤口相比,模型伤口和凡士林处理伤口中的一氧化氮(NO)减少,指出糖尿病伤口中精氨酸的利用受损。有趣的是,用珍珠和珍珠-冰片对治疗降低精氨酸和瓜氨酸水平,同时增加NO含量,提示这些治疗可能促进精氨酸的分解代谢产生NO,从而促进更快的伤口闭合。此外,单独冰片显著升高伤口中的NO含量,可能是由于其直接将硝酸盐/亚硝酸盐还原为NO的能力。氧化应激是糖尿病伤口中代谢受损的决定性特征。
    结果:结果表明,与凡士林治疗相比,Pearl和Pearl-Borneol对降低了糖尿病伤口中氧化应激生物标志物甲硫氨酸亚砜的水平,表明Pearl单独或联合冰片可增强糖尿病伤口的氧化应激微环境。
    结论:总之,研究结果验证了珍珠冰片对加速糖尿病伤口愈合的有效性,具有减少氧化应激的作用,增强精氨酸代谢,增加NO的产生,为这种治疗方法提供了机械基础。
    BACKGROUND: Chronic diabetic wounds pose a significant threat to the health of diabetic patients, representing severe and enduring complications. Globally, an estimated 2.5% to 15% of the annual health budget is associated with diabetes, with diabetic wounds accounting for a substantial share. Exploring new therapeutic agents and approaches to address delayed and impaired wound healing in diabetes becomes imperative. Traditional Chinese medicine (TCM) has a long history and remarkable efficacy in treating chronic wound healing. In this study, all topically applied proprietary Chinese medicines (pCMs) for wound healing officially approved by the National Medical Products Administration (NMPA) were collected from the NMPA TCM database. Data mining was employed to obtain a high-frequency TCM ingredients pair, Pearl-Borneol (1:1).
    METHODS: This study investigated the effect and molecular mechanism of the Pearl-Borneol pair on the healing of diabetic wounds by animal experiments and metabolomics. The results from animal experiments showed that the Pearl-Borneol pair significantly accelerated diabetic wound healing, exhibiting a more potent effect than the Pearl or Borneol treatment alone. Meanwhile, the metabolomics analysis identified significant differences in metabolic profiles in wounds between the model and normal groups, indicating that diabetic wounds had distinct metabolic characteristics from normal wounds. Moreover, Vaseline-treated wounds exhibited similar metabolic profiles to the wounds from the model group, suggesting that Vaseline might have a negligible impact on diabetic wound metabolism. In addition, wounds treated with Pearl, Borneol, and Pearl-Borneol pair displayed significantly different metabolic profiles from Vaseline-treated wounds, signifying the influence of these treatments on wound metabolism. Subsequent enrichment analysis of the metabolic pathway highlighted the involvement of the arginine metabolic pathway, closely associated with diabetic wounds, in the healing process under Pearl- Borneol pair treatment. Further analysis revealed elevated levels of arginine and citrulline, coupled with reduced nitric oxide (NO) in both the model and Vaseline-treated wounds compared to normal wounds, pointing to impaired arginine utilization in diabetic wounds. Interestingly, treatment with Pearl and Pearl-Borneol pair lowered arginine and citrulline levels while increasing NO content, suggesting that these treatments may promote the catabolism of arginine to generate NO, thereby facilitating faster wound closure. Additionally, borneol alone significantly elevated NO content in wounds, potentially due to its ability to directly reduce nitrates/nitrites to NO. Oxidative stress is a defining characteristic of impaired metabolism in diabetic wounds.
    RESULTS: The result showed that both Pearl and Pearl-Borneol pair decreased the oxidative stress biomarker methionine sulfoxide level in diabetic wounds compared to those treated with Vaseline, indicating that Pearl alone or combined with Borneol may enhance the oxidative stress microenvironment in diabetic wounds.
    CONCLUSIONS: In summary, the findings validate the effectiveness of the Pearl-Borneol pair in accelerating the healing of diabetic wounds, with effects on reducing oxidative stress, enhancing arginine metabolism, and increasing NO generation, providing a mechanistic basis for this therapeutic approach.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Atogepant,口服给药,小分子,降钙素基因相关肽(CGRP)受体拮抗剂,正在研究偏头痛的治疗方法。
    我们从美国食品和药物管理局不良事件报告系统(FAERS)数据库收集数据。四种算法(ROR,PRR,BCPNN,和EBGM)被用作检测真实世界数据中与不良事件(AE)相关的信号的量度。
    在3,552,072份报告中,2876明确指出使用atogepant。女性占不良事件(AE)的大多数,显着年龄集中在45-65岁。报告的不良事件百分比在美国最高。重要的系统器官类别(SOC)包括神经系统疾病,胃肠道疾病,神经系统疾病,手术和医疗程序,耳朵和迷宫障碍。值得注意的是,与atogepant相关的首选术语(PT)包括偏头痛,便秘,恶心,眩晕,嗜睡,食欲下降,头晕和疲劳。意外的不良事件,如异常的梦,自我伤害的想法,脑雾,紧张性头痛,噩梦,脑肿瘤,感觉异常,欣快的心情,还发现了高音和脑震荡后综合征。
    本调查发现了新的和意外的与抗药物相关的药物不良反应(ADR)信号。为了确认这些解决了以前被忽视的安全问题,更多的研究是必要的。
    UNASSIGNED: Atogepant, an orally administered, small-molecule, calcitonin gene-related peptide (CGRP) receptor antagonist, is being investigated for the treatment of migraine.
    UNASSIGNED: We collected data from the US Food and Drug Administration Adverse Event Reporting System (FAERS) database. Four algorithms (ROR, PRR, BCPNN, and EBGM) were used as measures to detect signals of atogepant-associated adverse events (AEs) in real-world data.
    UNASSIGNED: Of the 3,552,072 reports, 2876 expressly stated the use of atogepant. Women accounted for the majority of adverse events (AEs), with a notable age concentration of 45-65 years. The percentage of reported adverse events was the highest in the United States. Significant system organ categories (SOC) included nervous system disorders, gastrointestinal disorders, nervous system disorders, surgical and medical procedures, ear and labyrinth disorders. Notably, preferred terms (PTs) related to atogepant include migraine, constipation, nausea, vertigo, somnolence, decreased appetite, dizziness and fatigue. Unexpected adverse events such as abnormal dreams, self-injurious ideation, brain fog, tension headache, nightmare, brain neoplasm, feeling abnormal, euphoric mood, hyperacusis and post concussion syndrome were also identified.
    UNASSIGNED: The present investigation has detected new and unexpected signals of atogepant-related adverse drug reactions (ADRs). In order to confirm these solve safety issues that were previously overlooked, more research is necessary.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    2020年3月,COVID-19的爆发引发了近期历史上最显著的股市下跌之一。本文探讨了在大流行的不同阶段,与COVID-19相关的公众情绪与股市波动之间的关系。利用自然语言处理和情感分析,我们检查Twitter数据中与大流行相关的关键词,以评估这些情绪是否可以预测股市趋势的变化。我们的分析扩展到其他数据集:一个由市场专家注释的数据集,将专业财务情绪与市场动态相结合,另一个包括长期社交媒体情绪数据,以观察从大流行阶段到流行阶段的公众情绪变化。我们的研究结果表明,社交媒体上表达的情绪与市场波动之间存在很强的相关性,特别是与股票直接相关的情绪。这些见解验证了我们的情绪(S)-LSTM模型的有效性,这有助于了解从2020年到2023年公众情绪和股市趋势之间的演变动态,因为情况从大流行转变为地方性疾病,并接近新的常态。
    In March 2020, the outbreak of COVID-19 precipitated one of the most significant stock market downturns in recent history. This paper explores the relationship between public sentiment related to COVID-19 and stock market fluctuations during the different phases of the pandemic. Utilizing natural language processing and sentiment analysis, we examine Twitter data for pandemic-related keywords to assess whether these sentiments can predict changes in stock market trends. Our analysis extends to additional datasets: one annotated by market experts to integrate professional financial sentiment with market dynamics, and another comprising long-term social media sentiment data to observe changes in public sentiment from the pandemic phase to the endemic phase. Our findings indicate a strong correlation between the sentiments expressed on social media and market volatility, particularly sentiments directly associated with stocks. These insights validate the effectiveness of our Sentiment(S)-LSTM model, which helps to understand the evolving dynamics between public sentiment and stock market trends from 2020 through 2023, as the situation shifts from pandemic to endemic and approaches new normalcy.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:准确检测病原微生物需要有效的引物和探针设计。文献衍生的引物是宝贵的资源,因为它们已经在先前的研究中进行了测试并证明是有效的。然而,从已发布的文本中手动挖掘引物是耗时的,并且在物种范围内受到限制。
    结果:为了应对这些挑战,我们开发了MiPRIME,实时微生物引物挖掘平台,用于病原微生物的引物/探针序列提取,具有三个亮点:i)全面集成。涵盖超过4000万条物品和548,942种生物,该平台可以从全球角度进行高频微生物基因发现,促进用户定义的引物设计和推进微生物研究。ii)采用基于BioBERT的文本挖掘模型,准确率为98.02%,大大缩短了信息处理时间。iii)使用引物排名得分,PRscore,用于物种特异性引物的智能推荐。总的来说,MIPRIME是泛微生物领域底漆开采的实用工具,节省试验和错误实验的时间和成本。
    背景:网站{{https://www.ai-bt.com}.
    背景:补充数据可在Bioinformatics在线获得。
    BACKGROUND: Accurately detecting pathogenic microorganisms requires effective primers and probe designs. Literature-derived primers are a valuable resource as they have been tested and proven effective in previous research. However, manually mining primers from published texts is time-consuming and limited in species scop.
    RESULTS: To address these challenges, we have developed MiPRIME, a real-time Microbial Primer Mining platform for primer/probe sequences extraction of pathogenic microorganisms with three highlights: (i) comprehensive integration. Covering >40 million articles and 548 942 organisms, the platform enables high-frequency microbial gene discovery from a global perspective, facilitating user-defined primer design and advancing microbial research. (ii) Using a BioBERT-based text mining model with 98.02% accuracy, greatly reducing information processing time. (iii) Using a primer ranking score, PRscore, for intelligent recommendation of species-specific primers. Overall, MiPRIME is a practical tool for primer mining in the pan-microbial field, saving time and cost of trial-and-error experiments.
    METHODS: The web is available at {{https://www.ai-bt.com}}.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:慢性阻塞性肺疾病(COPD)的特点是发病率高,残疾,和全世界的死亡率。RNA结合蛋白(RBP)可能调控COPD患者氧化应激和炎症相关基因。单细胞转录组测序(scRNA-seq)为识别细胞间异质性和免疫细胞多样性提供了准确的工具。然而,RBPs在调节各种细胞中的作用,尤其是AT2细胞,仍然难以捉摸。
    方法:采用scRNA-seq数据集(GSE173896)和从气道组织获得的大量RNA-seq数据集(GSE124180)进行数据挖掘。接下来,在COPD和对照患者中进行RNA-seq分析。差异表达基因(DEGs)使用倍数变化(FC≥1.5或≤1.5)和P值≤0.05的标准进行鉴定。最后,基因本体论(GO),京都基因和基因组百科全书(KEGG),并进行了选择性剪接鉴定分析。
    结果:RBP基因在不同细胞群中表现出特定的表达模式,并参与AT2细胞的细胞增殖和线粒体功能障碍。作为RBP,AZGP1表达在scRNA-seq和RNA-seq数据集中均上调。它可能是一种候选免疫生物标志物,通过调节SAMD5,DNER的表达来调节AT2细胞增殖和粘附,从而调节COPD进展。DPYSL3、GBP5、GBP3和KCNJ2。此外,AZGP1调控COPD中的选择性剪接事件,特别是DDAH1和SFRP1,在COPD中具有重要意义。
    结论:RBP基因AZGP1通过调节参与可变剪接的基因抑制COPD的上皮细胞增殖。
    BACKGROUND: Chronic Obstructive Pulmonary Disease (COPD) is characterized by high morbidity, disability, and mortality rates worldwide. RNA-binding proteins (RBPs) might regulate genes involved in oxidative stress and inflammation in COPD patients. Single-cell transcriptome sequencing (scRNA-seq) offers an accurate tool for identifying intercellular heterogeneity and the diversity of immune cells. However, the role of RBPs in the regulation of various cells, especially AT2 cells, remains elusive.
    METHODS: A scRNA-seq dataset (GSE173896) and a bulk RNA-seq dataset acquired from airway tissues (GSE124180) were employed for data mining. Next, RNA-seq analysis was performed in both COPD and control patients. Differentially expressed genes (DEGs) were identified using criteria of fold change (FC ≥ 1.5 or ≤ 1.5) and P value ≤ 0.05. Lastly, Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and alternative splicing identification analyses were carried out.
    RESULTS: RBP genes exhibited specific expression patterns across different cell groups and participated in cell proliferation and mitochondrial dysfunction in AT2 cells. As an RBP, AZGP1 expression was upregulated in both the scRNA-seq and RNA-seq datasets. It might potentially be a candidate immune biomarker that regulates COPD progression by modulating AT2 cell proliferation and adhesion by regulating the expression of SAMD5, DNER, DPYSL3, GBP5, GBP3, and KCNJ2. Moreover, AZGP1 regulated alternative splicing events in COPD, particularly DDAH1 and SFRP1, holding significant implications in COPD.
    CONCLUSIONS: RBP gene AZGP1 inhibits epithelial cell proliferation by regulating genes participating in alternative splicing in COPD.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    这项研究基于FDA不良事件报告系统(FAERS)数据库进行了药物警戒分析,以比较吸入或鼻用倍氯米松的感染风险,氟替卡松,布地奈德,环索奈德,莫米松,曲安奈德.
    我们使用比例失衡分析来评估ICS/INC与感染事件之间的相关性。数据是从2015年4月至2023年9月的FAERS数据库中提取的。进一步分析其临床特点,感染部位,以及ICS和INCs感染不良事件(AEs)的病原菌。我们使用气泡图来显示它们的前5个感染不良事件。
    我们分析了21,837例与ICS和INCs相关的感染不良事件报告,平均年龄为62.12岁。其中,61.14%的感染报告与女性有关。据报道,氟替卡松感染的三分之一发生在下呼吸道,布地奈德,Ciclesonidec,和莫米松;曲安奈德报告的感染中有40%以上是眼部感染;倍氯米松引起的口腔感染率为7.39%。倍氯米松引起的真菌和病毒感染的报告率分别为21.15%和19.2%,分别。布地奈德和西索奈德引起的分枝杆菌感染分别占3.29%和2.03%,分别。气泡图显示ICS组有更多的真菌感染,口腔感染,肺炎,支气管炎,等。INCs组有更多的眼部症状,鼻炎,鼻窦炎,鼻咽炎,等。
    使用ICS和INCs的女性更容易发生感染事件。与布地奈德相比,氟替卡松似乎有较高的肺炎和口腔念珠菌病的风险。莫米松可能导致更多的上呼吸道感染。倍氯米松的口腔感染风险较高。倍氯米松会导致更多的真菌和病毒感染,而环索奈德和布地奈德更容易感染分枝杆菌。
    UNASSIGNED: This study conducted a pharmacovigilance analysis based on the FDA Adverse Event Reporting System (FAERS) database to compare the infection risk of inhaled or nasal Beclomethasone, Fluticasone, Budesonide, Ciclesonide, Mometasone, and Triamcinolone Acetonide.
    UNASSIGNED: We used proportional imbalance analysis to evaluate the correlation between ICS /INCs and infection events. The data was extracted from the FAERS database from April 2015 to September 2023. Further analysis was conducted on the clinical characteristics, site of infection, and pathogenic bacteria of ICS and INCs infection adverse events (AEs). We used bubble charts to display their top 5 infection adverse events.
    UNASSIGNED: We analyzed 21,837 reports of infection AEs related to ICS and INCs, with an average age of 62.12 years. Among them, 61.14% of infection reports were related to females. One-third of infections reported to occur in the lower respiratory tract with Fluticasone, Budesonide, Ciclesonidec, and Mometasone; over 40% of infections reported by Triamcinolone Acetonide were eye infections; the rate of oral infections caused by Beclomethasone were 7.39%. The reported rates of fungal and viral infections caused by beclomethasone were 21.15% and 19.2%, respectively. The mycobacterial infections caused by Budesonide and Ciclesonidec account for 3.29% and 2.03%, respectively. Bubble plots showed that the ICS group had more fungal infections, oral infections, pneumonia, tracheitis, etc. The INCs group had more eye symptoms, rhinitis, sinusitis, nasopharyngitis, etc.
    UNASSIGNED: Women who use ICS and INCs are more prone to infection events. Compared to Budesonide, Fluticasone seemed to have a higher risk of pneumonia and oral candidiasis. Mometasone might lead to more upper respiratory tract infections. The risk of oral infection was higher with Beclomethasone. Beclomethasone causes more fungal and viral infections, while Ciclesonide and Budesonide are more susceptible to mycobacterial infections.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    阑尾炎是由阑尾腔阻塞或血液供应终止引起的炎症,导致阑尾坏死,随后继发细菌感染。TYROBP基因与阑尾炎护理的关系尚不清楚。从GPL571产生的基因表达综合数据库下载阑尾炎数据集GSE9579概况。筛选差异表达基因,其次是加权基因共表达网络分析,功能富集分析,基因集富集分析,蛋白质相互作用网络的构建与分析,比较毒性基因组学数据库分析,和免疫浸润分析。绘制基因表达水平的热图。总共鉴定了1570个差异表达的基因。根据基因本体论分析,它们主要富集在有机酸代谢过程中,凝聚染色体动粒,氧化还原酶活性。在京都基因和基因组分析百科全书,它们主要集中在代谢途径,P53信号通路,PPAR信号通路。加权基因共表达网络分析中的软阈值功率设为12。通过对蛋白质-蛋白质相互作用网络的构建和分析,5个核心基因(FCGR2A,IL1B,ITGAM,获得TLR2、TYROBP)。核心基因表达水平的热图显示TYROBP在阑尾炎样品中的高表达。比较毒性基因组学数据库分析发现,核心基因(FCGR2A,IL1B,ITGAM,TLR2、TYROBP)与腹痛密切相关,胃肠功能障碍,发烧,和炎症的发生。TYROBP基因在阑尾炎中高表达,TYROBP基因表达越高,预后越差。TYROBP可作为阑尾炎及其护理的分子靶标。
    Appendicitis is an inflammation caused by obstruction of the appendiceal lumen or termination of blood supply leading to appendiceal necrosis followed by secondary bacterial infection. The relationship between TYROBP gene and the nursing of appendicitis remains unclear. The appendicitis dataset GSE9579 profile was downloaded from the gene expression omnibus database generated from GPL571. Differentially expressed genes were screened, followed by weighted gene co-expression network analysis, functional enrichment analysis, gene set enrichment analysis, construction and analysis of protein-protein interaction network, Comparative Toxicogenomics Database analysis, and immune infiltration analysis. Heatmaps of gene expression levels were plotted. A total of 1570 differentially expressed genes were identified. According to gene ontology analysis, they were mainly enriched in organic acid metabolic process, condensed chromosome kinetochore, oxidoreductase activity. In Kyoto Encyclopedia of Gene and Genome analysis, they mainly concentrated in metabolic pathways, P53 signaling pathway, PPAR signaling pathway. The soft threshold power in weighted gene co-expression network analysis was set to 12. Through the construction and analysis of protein-protein interaction network, 5 core genes (FCGR2A, IL1B, ITGAM, TLR2, TYROBP) were obtained. Heatmap of core gene expression levels revealed high expression of TYROBP in appendicitis samples. Comparative Toxicogenomics Database analysis found that core genes (FCGR2A, IL1B, ITGAM, TLR2, TYROBP) were closely related to abdominal pain, gastrointestinal dysfunction, fever, and inflammation occurrence. TYROBP gene is highly expressed in appendicitis, and higher expression of TYROBP gene indicates worse prognosis. TYROBP may serve as a molecular target for appendicitis and its nursing.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    基于骨架节点的视频动作识别是计算机视觉领域的一个突出问题。在实际应用场景中,个体间大量的骨架节点和行为遮挡问题严重影响识别的速度和准确性。因此,提出了一种轻量级的多流特征交叉融合(L-MSFCF)模型来识别格斗等异常行为,恶毒的踢,爬过墙壁,etal.,基于轻量级骨架节点计算,可以明显提高识别速度,基于遮挡骨架节点预测分析提高识别精度,以有效解决行为遮挡问题。实验表明,我们提出的All-MSFCF模型对8种异常行为的视频动作识别平均准确率为92.7%。尽管我们提出的轻量级L-MSFCF模型的平均准确率为87.3%,其平均识别速度比全骨架识别模型高62.7%,更适合解决实时跟踪问题。此外,我们提出的轨迹预测跟踪(TPT)模型可以根据动态选择的核心骨架节点计算实时预测运动位置,特别是对于具有较低平均丢失误差的15帧和30帧内的短期预测。
    Video action recognition based on skeleton nodes is a highlighted issue in the computer vision field. In real application scenarios, the large number of skeleton nodes and behavior occlusion problems between individuals seriously affect recognition speed and accuracy. Therefore, we proposed a lightweight multi-stream feature cross-fusion (L-MSFCF) model to recognize abnormal behaviors such as fighting, vicious kicking, climbing over the wall, et al., which could obviously improve recognition speed based on lightweight skeleton node calculation, and improve recognition accuracy based on occluded skeleton node prediction analysis in order to effectively solve the behavior occlusion problem. The experiments show that our proposed All-MSFCF model has a video action recognition average accuracy rate of 92.7% for eight kinds of abnormal behavior recognition. Although our proposed lightweight L-MSFCF model has an 87.3% average accuracy rate, its average recognition speed is 62.7% higher than the full-skeleton recognition model, which is more suitable for solving real-time tracing problems. Moreover, our proposed Trajectory Prediction Tracking (TPT) model could real-time predict the moving positions based on the dynamically selected core skeleton node calculation, especially for the short-term prediction within 15 frames and 30 frames that have lower average loss errors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:文档级别的生物医学关系提取(Bio-DocRE)涉及从跨越多个句子的生物医学文本中提取关系实例,通常包含各种实体概念,如基因,疾病,化学品,变体,等目前,此任务通常基于图形或变压器来实现。然而,大多数工作直接将实体特征建模为关系预测,忽略实体对信息作为关系预测中间状态的有效性。在这篇文章中,我们将此任务分离为一个三阶段过程,以捕获足够的信息来改善关系预测。
    结果:我们为Bio-DocRE提出了一个创新的框架HTGRS,它构造了一个层次树图(HTG)来集成文档中的关键信息源,实现基于实体的关系推理。此外,受到语义分割思想的启发,我们将任务概念化为表填充问题,并开发了关系分割(RS)模块来增强基于实体对的关系推理。在三个数据集上的大量实验表明,所提出的框架优于最先进的方法,并实现了卓越的性能。
    方法:我们的源代码可在https://github.com/pusheryjy/HTGRS获得。
    背景:补充数据可在Bioinformatics在线获得。
    BACKGROUND: Biomedical relation extraction at the document level (Bio-DocRE) involves extracting relation instances from biomedical texts that span multiple sentences, often containing various entity concepts such as genes, diseases, chemicals, variants, etc. Currently, this task is usually implemented based on graphs or transformers. However, most work directly models entity features to relation prediction, ignoring the effectiveness of entity pair information as an intermediate state for relation prediction. In this article, we decouple this task into a three-stage process to capture sufficient information for improving relation prediction.
    RESULTS: We propose an innovative framework HTGRS for Bio-DocRE, which constructs a hierarchical tree graph (HTG) to integrate key information sources in the document, achieving relation reasoning based on entity. In addition, inspired by the idea of semantic segmentation, we conceptualize the task as a table-filling problem and develop a relation segmentation (RS) module to enhance relation reasoning based on the entity pair. Extensive experiments on three datasets show that the proposed framework outperforms the state-of-the-art methods and achieves superior performance.
    METHODS: Our source code is available at https://github.com/passengeryjy/HTGRS.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    建立了正四面体模型,以通过高分辨率质谱来刺穿四元成分中溶解有机物(DOM)的分馏。该模型可以立体可视化DOM的分子式,以根据正四面体中的位置显示对每个组件的偏好。随后开发了一种分类方法,将分子式分为与分馏比有关的15类,证明其相对变化与质量峰面积的不确定性一致。以胞外聚合物分层与OrbitrapMS耦合为例,以垃圾渗滤液处理和污水处理厂的7种污泥为例,验证了正四面体模型的实用性,呈现分层污泥絮体中的DOM化学多样性。敏感性分析证明,在四个模型参数的扰动下,分类结果相对稳定。根据正四面体模型的分类结果,多项逻辑回归分析可以进一步帮助识别分子性质对DOM分馏的影响。该模型提供了一种方法,用于评估从固体或半固体成分中顺序提取DOM的特异性,并简化了四元成分分馏系数的复杂数学表达式。
    A regular tetrahedron model was established to pierce the fractionation of dissolved organic matter (DOM) among quaternary components by using high-resolution mass spectrometry. The model can stereoscopically visualize molecular formulas of DOM to show the preference to each component according to the position in a regular tetrahedron. A classification method was subsequently developed to divide molecular formulas into 15 categories related to fractionation ratios, the relative change of which was demonstrated to be convergent with the uncertainty of mass peak area. The practicality of the regular tetrahedron model was verified by seven kinds of sludge from waste leachate treatment and sewage wastewater treatment plants by using stratification of extracellular polymeric substances coupled with Orbitrap MS as an example, presenting the DOM chemodiversity in stratified sludge flocs. Sensitivity analysis proved that classification results were relatively stable with the perturbation of four model parameters. Multinomial logistic regression analysis could further help identify the effect of molecular properties on the fractionation of DOM based on the classification results of the regular tetrahedron model. This model offers a methodology for the assessment of specificity of sequential extraction on DOM from solid or semisolid components and simplifies the complex mathematical expression of fractionation coefficients for quaternary components.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号