Marker genes

标记基因
  • 文章类型: Journal Article
    脊髓神经系统的损伤通常会导致永久性的感觉丧失,电机,和自主功能。准确识别脊髓神经的细胞状态极为重要,可以促进新的治疗和康复策略的开发。用于鉴定脊髓神经发育的现有实验技术是劳动密集型且昂贵的。在这项研究中,我们开发了一个机器学习预测器,ScnML,用于预测脊髓神经细胞亚群以及识别标记基因。在训练数据集上评估了ScnML的预测性能,准确率为94.33%。基于XGBoost,ScnML在测试数据集上达到94.08%94.24%,94.26%,精度为94.24%,召回,和F1测量分数,分别。重要的是,ScnML通过模型解释和生物景观分析确定了新的重要基因。ScnML可以成为预测脊髓神经元细胞状态的强大工具,快速有效地揭示潜在的特定生物标志物,并为精准医学和康复康复提供重要见解。
    Injuries to the spinal cord nervous system often result in permanent loss of sensory, motor, and autonomic functions. Accurately identifying the cellular state of spinal cord nerves is extremely important and could facilitate the development of new therapeutic and rehabilitative strategies. Existing experimental techniques for identifying the development of spinal cord nerves are both labor-intensive and costly. In this study, we developed a machine learning predictor, ScnML, for predicting subpopulations of spinal cord nerve cells as well as identifying marker genes. The prediction performance of ScnML was evaluated on the training dataset with an accuracy of 94.33%. Based on XGBoost, ScnML on the test dataset achieved 94.08% 94.24%, 94.26%, and 94.24% accuracies with precision, recall, and F1-measure scores, respectively. Importantly, ScnML identified new significant genes through model interpretation and biological landscape analysis. ScnML can be a powerful tool for predicting the status of spinal cord neuronal cells, revealing potential specific biomarkers quickly and efficiently, and providing crucial insights for precision medicine and rehabilitation recovery.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    感染的常见结果是异常的免疫反应,这可能对主机有害。为了控制感染,免疫系统可能会受到调节,因此产生过量的促炎或抗炎途径,可导致广泛的炎症,组织损伤,器官衰竭。失调的免疫应答可以表现为分化的免疫细胞群体和循环生物标志物浓度的变化。为了提出一种早期诊断系统,能够区分并识别免疫失调综合征的严重程度,我们建立了一个人工智能工具,使用来自单细胞RNA测序的输入数据。在我们的结果中,单细胞转录组学成功区分了轻度和重度脓毒症和COVID-19感染。此外,通过解释我们分类系统的决策模式,我们发现不同的免疫细胞上调或下调CD3,CD14,CD16,FOSB,S100A12和TCRrδ能准确区分不同程度的感染。我们的研究已经确定了有效区分感染的重要基因,作为诊断标志物提供了有希望的前景,并为治疗干预提供了潜在的目标。
    A common result of infection is an abnormal immune response, which may be detrimental to the host. To control the infection, the immune system might undergo regulation, therefore producing an excess of either pro-inflammatory or anti-inflammatory pathways that can lead to widespread inflammation, tissue damage, and organ failure. A dysregulated immune response can manifest as changes in differentiated immune cell populations and concentrations of circulating biomarkers. To propose an early diagnostic system that enables differentiation and identifies the severity of immune-dysregulated syndromes, we built an artificial intelligence tool that uses input data from single-cell RNA sequencing. In our results, single-cell transcriptomics successfully distinguished between mild and severe sepsis and COVID-19 infections. Moreover, by interpreting the decision patterns of our classification system, we identified that different immune cells upregulating or downregulating the expression of the genes CD3, CD14, CD16, FOSB, S100A12, and TCRɣδ can accurately differentiate between different degrees of infection. Our research has identified genes of significance that effectively distinguish between infections, offering promising prospects as diagnostic markers and providing potential targets for therapeutic intervention.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    大脑调节鱼类的多种生理过程。尽管如此,关于非模型鱼类不同大脑区域的基本结构和功能的知识仍然有限,因为它们的多样性和常见生物标志物的稀缺性。在本研究中,大脑的四个主要部分,端脑,间脑,中脑和菱形脑,被隔离在大嘴鲈鱼中,小昆虫。在这些部分中,通过形态学和细胞结构分析进一步鉴定了9个脑区和74个细胞核.转录组分析显示总共7153个区域高表达基因和176个区域特异性表达基因。与生长有关的基因,繁殖,情感,学习,和记忆在嗅球和端脑(OBT)中明显过表达。喂养和应激相关基因位于下丘脑(Hy)。视觉系统相关基因主要富集在视神经顶盖(OT),而视觉和听觉相关基因在小脑(Ce)区域广泛表达。与感觉输入和运动输出相关的基因位于延髓(Mo)中。宇宙调节,应激反应,睡眠/觉醒周期,与繁殖相关的基因在其余大脑(RB)中高表达。进一步确定了每个大脑区域的三个候选标记基因,如OBT的神经肽FF(NPFF),Hy的促黑色素浓缩激素(pmch),用于OT的囊泡抑制性氨基酸转运蛋白(viaat),Ce的兴奋性氨基酸转运蛋白1(eaat1),为Mo,和用于RB的isotocinneurophysin(itnp)。此外,通过检查标记基因的表达,分析了7种神经递质型神经元和5种非神经元细胞在不同脑区的分布。值得注意的是,谷氨酸能和GABA能神经元的标记基因在所有大脑区域显示出最高的表达水平。同样,与其他标记相比,放射状星形胶质细胞的标记基因表现出高表达,而小胶质细胞的表达最少。总的来说,我们的结果全面概述了大嘴鲈鱼不同大脑区域的结构和功能特征,这为理解中枢神经系统在调节硬骨鱼生理过程中的作用提供了宝贵的资源。
    The brain regulates multiple physiological processes in fish. Despite this, knowledge about the basic structure and function of distinct brain regions in non-model fish species remains limited due to their diversity and the scarcity of common biomarkers. In the present study, four major brain parts, the telencephalon, diencephalon, mesencephalon and rhombencephalon, were isolated in largemouth bass, Micropterus salmoides. Within these parts, nine brain regions and 74 nuclei were further identified through morphological and cytoarchitectonic analysis. Transcriptome analysis revealed a total of 7153 region-highly expressed genes and 176 region-specifically expressed genes. Genes related to growth, reproduction, emotion, learning, and memory were significantly overexpressed in the olfactory bulb and telencephalon (OBT). Feeding and stress-related genes were in the hypothalamus (Hy). Visual system-related genes were predominantly enriched in the optic tectum (OT), while vision and hearing-related genes were widely expressed in the cerebellum (Ce) region. Sensory input and motor output-related genes were in the medulla oblongata (Mo). Osmoregulation, stress response, sleep/wake cycles, and reproduction-related genes were highly expressed in the remaining brain (RB). Three candidate marker genes were further identified for each brain regions, such as neuropeptide FF (npff) for OBT, pro-melanin-concentrating hormone (pmch) for Hy, vesicular inhibitory amino acid transporter (viaat) for OT, excitatory amino acid transporter 1 (eaat1) for Ce, peripherin (prph) for Mo, and isotocin neurophysin (itnp) for RB. Additionally, the distribution of seven neurotransmitter-type neurons and five types of non-neuronal cells across different brain regions were analyzed by examining the expression of their marker genes. Notably, marker genes for glutamatergic and GABAergic neurons showed the highest expression levels across all brain regions. Similarly, the marker gene for radial astrocytes exhibited high expression compared to other markers, while those for microglia were the least expressed. Overall, our results provide a comprehensive overview of the structural and functional characteristics of distinct brain regions in the largemouth bass, which offers a valuable resource for understanding the role of central nervous system in regulating physiological processes in teleost.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:随着与胎盘功能障碍相关的发育编程效应的重要性日益增加,更多的研究致力于改善胎盘特征在健康和疾病中的表征和理解.胎盘是一种短暂但动态的器官,可适应胎儿发育的变化需求以及整个怀孕期间母体供应的可用资源。滋养层(细胞滋养层,合胞体滋养层,和绒毛外滋养层)是胎盘特异性细胞类型,负责主要的胎盘交换和适应。具有单细胞分辨率的转录组研究在理解胎盘在健康和疾病中的作用方面取得了进展。这些研究,然而,通常显示不同胎盘细胞类型的表征差异。
    目的:我们旨在回顾从使用单细胞RNA测序(scRNAseq)获得的有关胎盘结构和功能的知识,然后比较细胞类型特异性基因,突出它们的异同。此外,我们打算在研究中确定各种滋养层细胞类型的共有标记基因.最后,我们将讨论scRNAseq在研究妊娠相关疾病中的贡献和潜在应用。
    方法:我们进行了全面的系统文献综述,以确定不同的细胞类型及其在人类母胎界面的功能,重点关注2023年3月之前发表的关于胎盘的所有原始scRNAseq研究以及使用PubMed搜索发表的评论(共确定28项研究).我们的方法涉及管理先前使用scRNAseq定义的细胞类型和亚型,并比较用作标记或鉴定为潜在新标记的基因。接下来,我们重新分析了来自六个可用的带有细胞注释的scRNAseq原始数据集的表达矩阵(四个来自前三个月,两个在足月),使用Wilcoxon秩和检验比较研究中的基因表达,并注释孕早期和足月胎盘中的滋养层细胞标志物。此外,我们整合了来自18个健康孕早期和9个足月胎盘的scRNAseq原始数据,并进行了聚类和差异基因表达分析。我们进一步将通过对注释和原始数据集的分析获得的标记与文献进行比较,以获得主要胎盘细胞类型的常见签名基因列表。
    结果:采样地点的变化,胎龄,胎儿性别,以及随后的测序和分析方法在研究之间进行了观察。尽管它们的比例各不相同,在所有scRNAseq研究中,这三种滋养层类型都得到了一致的鉴定,不同于其他非滋养层细胞类型。值得注意的是,对于所研究的任何细胞类型,所有研究均未共享标记基因.此外,一项研究中大多数新定义的标志物在其他研究中未观察到.我们对滋养层细胞类型的分析证实了这些差异,在每项研究中都鉴定出数百个潜在的标记基因,但在研究中几乎没有重叠。从35.461和23.378细胞的高质量在前三个月和足月胎盘,分别,我们获得了主要的胎盘细胞类型,包括以前在妊娠早期未发现的血管周围细胞。重要的是,基于我们广泛的研究,我们的荟萃分析提供了主要胎盘细胞类型的标记基因.
    结论:这篇综述和荟萃分析强调了从scRNAseq数据中注释胎盘细胞类型建立共识的必要性。这里鉴定的标记基因可以用于定义人类胎盘细胞类型,从而促进和提高滋养层细胞注释的可重复性。
    BACKGROUND: With increasing significance of developmental programming effects associated with placental dysfunction, more investigations are devoted to improving the characterization and understanding of placental signatures in health and disease. The placenta is a transitory but dynamic organ adapting to the shifting demands of fetal development and available resources of the maternal supply throughout pregnancy. Trophoblasts (cytotrophoblasts, syncytiotrophoblasts, and extravillous trophoblasts) are placental-specific cell types responsible for the main placental exchanges and adaptations. Transcriptomic studies with single-cell resolution have led to advances in understanding the placenta\'s role in health and disease. These studies, however, often show discrepancies in characterization of the different placental cell types.
    OBJECTIVE: We aim to review the knowledge regarding placental structure and function gained from the use of single-cell RNA sequencing (scRNAseq), followed by comparing cell-type-specific genes, highlighting their similarities and differences. Moreover, we intend to identify consensus marker genes for the various trophoblast cell types across studies. Finally, we will discuss the contributions and potential applications of scRNAseq in studying pregnancy-related diseases.
    METHODS: We conducted a comprehensive systematic literature review to identify different cell types and their functions at the human maternal-fetal interface, focusing on all original scRNAseq studies on placentas published before March 2023 and published reviews (total of 28 studies identified) using PubMed search. Our approach involved curating cell types and subtypes that had previously been defined using scRNAseq and comparing the genes used as markers or identified as potential new markers. Next, we reanalyzed expression matrices from the six available scRNAseq raw datasets with cell annotations (four from first trimester and two at term), using Wilcoxon rank-sum tests to compare gene expression among studies and annotate trophoblast cell markers in both first trimester and term placentas. Furthermore, we integrated scRNAseq raw data available from 18 healthy first trimester and nine term placentas, and performed clustering and differential gene expression analysis. We further compared markers obtained with the analysis of annotated and raw datasets with the literature to obtain a common signature gene list for major placental cell types.
    RESULTS: Variations in the sampling site, gestational age, fetal sex, and subsequent sequencing and analysis methods were observed between the studies. Although their proportions varied, the three trophoblast types were consistently identified across all scRNAseq studies, unlike other non-trophoblast cell types. Notably, no marker genes were shared by all studies for any of the investigated cell types. Moreover, most of the newly defined markers in one study were not observed in other studies. These discrepancies were confirmed by our analysis on trophoblast cell types, where hundreds of potential marker genes were identified in each study but with little overlap across studies. From 35 461 and 23 378 cells of high quality in the first trimester and term placentas, respectively, we obtained major placental cell types, including perivascular cells that previously had not been identified in the first trimester. Importantly, our meta-analysis provides marker genes for major placental cell types based on our extensive curation.
    CONCLUSIONS: This review and meta-analysis emphasizes the need for establishing a consensus for annotating placental cell types from scRNAseq data. The marker genes identified here can be deployed for defining human placental cell types, thereby facilitating and improving the reproducibility of trophoblast cell annotation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    脊髓损伤(SCI)是一种严重致残和毁灭性的神经系统疾病,显著影响患者生活质量。它给患者和家属带来难以承受的心理和经济压力,给社会带来沉重负担。
    在这项研究中,我们整合了数据集GSE5296和GSE47681作为训练组,分析假手术组和SCI组小鼠之间的基因突变,并基于差异表达基因进行了基因本体论(GO)富集分析和京都基因和基因组百科全书(KEGG)富集分析。随后,我们进行了加权基因相关网络分析(WGCNA)和Lasso回归分析.
    我们确定了四个特征性疾病基因:Icam1,Ch25h,Plaur和Tm4sf1。我们检查了SCI和免疫细胞之间的关系,并通过PCR和免疫组织化学实验验证了已鉴定的疾病相关基因在SCI大鼠中的表达。
    总而言之,我们已经确定并验证了与SCI相关的四个基因:Icam1,Ch25h,Plaur和Tm4sf1,可以为SCI治疗提供见解。
    UNASSIGNED: Spinal cord injury (SCI) is a profoundly disabling and devastating neurological condition, significantly impacting patients\' quality of life. It imposes unbearable psychological and economic pressure on both patients and their families, as well as placing a heavy burden on society.
    UNASSIGNED: In this study, we integrated datasets GSE5296 and GSE47681 as training groups, analyzed gene variances between sham group and SCI group mice, and conducted Gene Ontology (GO) enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis based on the differentially expressed genes. Subsequently, we performed Weighted Gene Correlation Network Analysis (WGCNA) and Lasso regression analyses.
    UNASSIGNED: We identified four characteristic disease genes: Icam1, Ch25h, Plaur and Tm4sf1. We examined the relationship between SCI and immune cells, and validated the expression of the identified disease-related genes in SCI rats using PCR and immunohistochemistry experiments.
    UNASSIGNED: In conclusion, we have identified and verified four genes related to SCI: Icam1, Ch25h, Plaur and Tm4sf1, which could offer insights for SCI treatment.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    年龄相关性听力损失(ARHL)是听力损失的最常见原因,也是影响全球老年人最普遍的疾病之一。尽管我们的实验室和其他人证明了它的多基因性质,对特定基因知之甚少,细胞类型,和参与ARHL的途径,阻碍治疗干预的发展。在这份手稿中,我们描述,第一次,在远交小鼠模型中使用snRNA-seq的衰老小鼠耳蜗的完整细胞类型特异性转录组与听觉阈值变化的关系。使用无监督聚类来识别耳蜗细胞类型,并通过三层方法进行注释-首先通过链接到已知标记基因的表达,然后使用NSForest算法选择最小簇特异性标记基因并减少维度特征空间,以将我们的簇与gEAR网站上现有的公开数据集进行统计比较,最后,通过使用多重错误稳健荧光原位杂交(MERFISH)和簇特异性标记基因作为探针来验证和完善注释。我们报告了60种独特的细胞类型,将定义的耳蜗细胞类型的数量增加了两倍以上。重要的是,我们显示出显著的特定细胞类型的增加和减少与听力敏锐度的丧失有关,涉及毛细胞亚型的特定亚群,神经节细胞亚型,在该ARHL模型中,血管纹内的细胞亚型。这些结果提供了对负责年龄相关听力损失的细胞和分子机制以及治疗靶向途径的看法。
    Age-related hearing loss (ARHL) is the most common cause of hearing loss and one of the most prevalent conditions affecting the elderly worldwide. Despite evidence from our lab and others about its polygenic nature, little is known about the specific genes, cell types, and pathways involved in ARHL, impeding the development of therapeutic interventions. In this manuscript, we describe, for the first time, the complete cell-type specific transcriptome of the aging mouse cochlea using snRNA-seq in an outbred mouse model in relation to auditory threshold variation. Cochlear cell types were identified using unsupervised clustering and annotated via a three-tiered approach-first by linking to expression of known marker genes, then using the NSForest algorithm to select minimum cluster-specific marker genes and reduce dimensional feature space for statistical comparison of our clusters with existing publicly-available data sets on the gEAR website, and finally, by validating and refining the annotations using Multiplexed Error Robust Fluorescence In Situ Hybridization (MERFISH) and the cluster-specific marker genes as probes. We report on 60 unique cell-types expanding the number of defined cochlear cell types by more than two times. Importantly, we show significant specific cell type increases and decreases associated with loss of hearing acuity implicating specific subsets of hair cell subtypes, ganglion cell subtypes, and cell subtypes within the stria vascularis in this model of ARHL. These results provide a view into the cellular and molecular mechanisms responsible for age-related hearing loss and pathways for therapeutic targeting.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    肝细胞癌(HCC)与丙型肝炎病毒(HCV)感染有关,是潜在的危险因素。尽管如此,由病毒引发的精确遗传调控机制,导致病毒诱导的肝癌发生,仍然不清楚。我们假设HCV蛋白可能通过调节途径调节异常甲基化HCC基因的活性。病毒-宿主调节途径,蛋白质之间的相互作用,基因表达,运输,和稳定性调节,是使用ANDSystem重建的。基因表达调控具有统计学意义。基因网络分析确定了70个HCC标记基因中的4个,其通过病毒蛋白的表达调节可能与HCC相关:DNA结合蛋白抑制剂ID-1(ID1),皮瓣核酸内切酶1(FEN1),细胞周期蛋白依赖性激酶抑制剂2A(CDKN2A),端粒酶逆转录酶(TERT)。它表明HCV/人蛋白杂合物中的以下病毒蛋白效应:HCVNS3(p70)蛋白激活人STAT3和NOTC1;NS2-3(p23),NS5B(p68),NS1(E2),并且核心(p21)激活SETD2;NS5A抑制SMYD3;并且NS3抑制CCN2。有趣的是,当NS3和E1(gp32)正调节CDKN2A时激活c-Jun,当抑制TERT时抑制c-Jun。发现的调节机制可能是创建药物和预防性治疗以降低HCV感染期间HCC发展的可能性的重点领域。
    Hepatocellular carcinoma (HCC) has been associated with hepatitis C viral (HCV) infection as a potential risk factor. Nonetheless, the precise genetic regulatory mechanisms triggered by the virus, leading to virus-induced hepatocarcinogenesis, remain unclear. We hypothesized that HCV proteins might modulate the activity of aberrantly methylated HCC genes through regulatory pathways. Virus-host regulatory pathways, interactions between proteins, gene expression, transport, and stability regulation, were reconstructed using the ANDSystem. Gene expression regulation was statistically significant. Gene network analysis identified four out of 70 HCC marker genes whose expression regulation by viral proteins may be associated with HCC: DNA-binding protein inhibitor ID - 1 (ID1), flap endonuclease 1 (FEN1), cyclin-dependent kinase inhibitor 2A (CDKN2A), and telomerase reverse transcriptase (TERT). It suggested the following viral protein effects in HCV/human protein heterocomplexes: HCV NS3(p70) protein activates human STAT3 and NOTC1; NS2-3(p23), NS5B(p68), NS1(E2), and core(p21) activate SETD2; NS5A inhibits SMYD3; and NS3 inhibits CCN2. Interestingly, NS3 and E1(gp32) activate c-Jun when it positively regulates CDKN2A and inhibit it when it represses TERT. The discovered regulatory mechanisms might be key areas of focus for creating medications and preventative therapies to decrease the likelihood of HCC development during HCV infection.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:基因表达的单细胞RNA测序(scRNA-seq)测量显示出研究水稻根系细胞异质性的巨大前景。由于固有的高维度和稀疏性,如何精确地注释细胞身份是植物scRNA-seq分析中尚未解决的主要问题。
    结果:为了应对这一挑战,我们提出了NRTPredictor,一个整体学习系统,通过完整的模型可解释性来预测水稻根细胞阶段并挖掘生物标志物。使用测试数据集评估了NRTPredictor的性能,准确率为98.01%,召回率为95.45%。凭借NRTPredictor提供的可解释性的能力,我们的模型识别了110个部分参与苯丙素生物合成的标记基因。水稻根系的表达模式可以通过上述候选基因定位,显示了NRTPredictor的优越性。对scRNA和大量RNA-seq数据的综合分析显示,洪水中表皮细胞亚群的异常表达,Pi,盐的压力。
    结论:综合来看,我们的结果表明,NRTPredictor是自动预测水稻根细胞阶段的有用工具,并为破译水稻根细胞异质性和洪水的分子机制提供了宝贵的资源,Pi,盐的压力。基于所提出的模型,一个免费的网络服务器已经建立,这是在https://www。cgris.net/nrtp.
    BACKGROUND: Single-cell RNA sequencing (scRNA-seq) measurements of gene expression show great promise for studying the cellular heterogeneity of rice roots. How precisely annotating cell identity is a major unresolved problem in plant scRNA-seq analysis due to the inherent high dimensionality and sparsity.
    RESULTS: To address this challenge, we present NRTPredictor, an ensemble-learning system, to predict rice root cell stage and mine biomarkers through complete model interpretability. The performance of NRTPredictor was evaluated using a test dataset, with 98.01% accuracy and 95.45% recall. With the power of interpretability provided by NRTPredictor, our model recognizes 110 marker genes partially involved in phenylpropanoid biosynthesis. Expression patterns of rice root could be mapped by the above-mentioned candidate genes, showing the superiority of NRTPredictor. Integrated analysis of scRNA and bulk RNA-seq data revealed aberrant expression of Epidermis cell subpopulations in flooding, Pi, and salt stresses.
    CONCLUSIONS: Taken together, our results demonstrate that NRTPredictor is a useful tool for automated prediction of rice root cell stage and provides a valuable resource for deciphering the rice root cellular heterogeneity and the molecular mechanisms of flooding, Pi, and salt stresses. Based on the proposed model, a free webserver has been established, which is available at https://www.cgris.net/nrtp .
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    单细胞RNA测序(scRNA-seq)允许获得单个细胞的基因组和转录组谱。这些数据使得在细胞水平上表征组织成为可能。在这种情况下,利用scRNA-seq数据的主要分析之一是识别组织内的细胞类型以估计细胞群的定量组成。由于大量可用的scRNA-seq数据,细胞分型的自动分类方法,基于最新的深度学习技术,是需要的。这里,我们提出了基因本体驱动的广泛和深度学习(GOWDL)模型,用于对几种组织中的细胞类型进行分类。GOWDL实现了一种混合体系结构,该体系结构考虑了基因本体论中发现的功能注释和特定细胞类型的典型标记基因。我们进行了交叉验证和独立的外部测试,将我们的算法与其他12种最先进的预测因子进行比较。分类评分表明,GOWDL在五种不同的组织中达到了最好的结果,除了召回,我们得到了大约92%的最佳工具,而97%的最佳工具。最后,我们介绍了一项基于GOWDL的分级方法对乳腺癌免疫细胞群进行分类的案例研究.
    Single-cell RNA-sequencing (scRNA-seq) allows for obtaining genomic and transcriptomic profiles of individual cells. That data make it possible to characterize tissues at the cell level. In this context, one of the main analyses exploiting scRNA-seq data is identifying the cell types within tissue to estimate the quantitative composition of cell populations. Due to the massive amount of available scRNA-seq data, automatic classification approaches for cell typing, based on the most recent deep learning technology, are needed. Here, we present the gene ontology-driven wide and deep learning (GOWDL) model for classifying cell types in several tissues. GOWDL implements a hybrid architecture that considers the functional annotations found in Gene Ontology and the marker genes typical of specific cell types. We performed cross-validation and independent external testing, comparing our algorithm with 12 other state-of-the-art predictors. Classification scores demonstrated that GOWDL reached the best results over five different tissues, except for recall, where we got about 92% versus 97% of the best tool. Finally, we presented a case study on classifying immune cell populations in breast cancer using a hierarchical approach based on GOWDL.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:太平洋牡蛎,Crassostreagigas,是世界上重要的经济贝类。通过遗传育种,已经做出了巨大的努力来提高其生长速率。然而,候选标记基因,通路,和参与牡蛎生长调节的潜在lncRNAs在很大程度上仍然未知。为了识别基因,lncRNAs,和参与生长调节的途径,C.gigasspat在低温(15℃)下培养,以产生生长抑制模型,用常温(25℃)培养的spat进行比较转录组分析。
    结果:总计,在正常生长的牡蛎之间鉴定出8627个差异表达基因(DEGs)和1072个差异表达lncRNAs(DELs)(在25℃下培养,以下简称NG)和缓慢生长的牡蛎(在15℃下培养,以下简称SG)。功能富集分析表明,这些DEGs大多富集在AMPK信号通路中,MAPK信号通路,胰岛素信号通路,自噬,凋亡,钙信号通路,和内吞过程。LncRNAs分析确定了265个顺式作用对和618个反式作用对,它们可能参与牡蛎生长调节。LNC_001270,LNC_003322,LNC_011563,LNC_006260和LNC_012905的表达水平可诱导培养温度和食物丰度。这些lncRNAs位于反义,上游,或SREBP1/p62,CDC42,CaM,FAS,和PIK3CA基因,分别。此外,反式作用lncRNAs的表达,包括XR_9000022.2,LNC_008019,LNC_015817,LNC_000838,LNC_00839,LNC_011859,LNC_007294,LNC_006429,XR_002198885.1和XR_902224.2也与AMPK信号通路中富集基因的表达显著相关,胰岛素信号通路,自噬,凋亡,钙信号通路,和内吞过程。
    结论:在这项研究中,我们确定了关键的生长相关基因和lncRNAs,它们可以用作候选标记来说明太平洋牡蛎生长调节的分子机制.
    BACKGROUND: The Pacific oyster, Crassostrea gigas, is an economically important shellfish around the world. Great efforts have been made to improve its growth rate through genetic breeding. However, the candidate marker genes, pathways, and potential lncRNAs involved in oyster growth regulation remain largely unknown. To identify genes, lncRNAs, and pathways involved in growth regulation, C. gigas spat was cultured at a low temperature (15 ℃) to yield a growth-inhibited model, which was used to conduct comparative transcriptome analysis with spat cultured at normal temperature (25 ℃).
    RESULTS: In total, 8627 differentially expressed genes (DEGs) and 1072 differentially expressed lncRNAs (DELs) were identified between the normal-growth oysters (cultured at 25 ℃, hereinafter referred to as NG) and slow-growth oysters (cultured at 15 ℃, hereinafter referred to as SG). Functional enrichment analysis showed that these DEGs were mostly enriched in the AMPK signaling pathway, MAPK signaling pathway, insulin signaling pathway, autophagy, apoptosis, calcium signaling pathway, and endocytosis process. LncRNAs analysis identified 265 cis-acting pairs and 618 trans-acting pairs that might participate in oyster growth regulation. The expression levels of LNC_001270, LNC_003322, LNC_011563, LNC_006260, and LNC_012905 were inducible to the culture temperature and food abundance. These lncRNAs were located at the antisense, upstream, or downstream of the SREBP1/p62, CDC42, CaM, FAS, and PIK3CA genes, respectively. Furthermore, the expression of the trans-acting lncRNAs, including XR_9000022.2, LNC_008019, LNC_015817, LNC_000838, LNC_00839, LNC_011859, LNC_007294, LNC_006429, XR_002198885.1, and XR_902224.2 was also significantly associated with the expression of genes enriched in AMPK signaling pathway, insulin signaling pathway, autophagy, apoptosis, calcium signaling pathway, and endocytosis process.
    CONCLUSIONS: In this study, we identified the critical growth-related genes and lncRNAs that could be utilized as candidate markers to illustrate the molecular mechanisms underlying the growth regulation of Pacific oysters.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号