differential expression

差异表达
  • 文章类型: Journal Article
    糖基化是一种非酶翻译后修饰,来自蛋白质中还原糖和游离氨基之间的反应,早期糖基化产物(果糖基赖氨酸,FL)和晚期糖基化终产物(AGEs)形成。糖基化和AGEs的积累与肝细胞癌(HCC)的发生密切相关。这里,我们报道了使用具有稳定同位素标记的组织蛋白质组学在HCC中的差异糖基化的表征;早期糖基化修饰的肽用硼酸酯亲和色谱(BAC)富集,和AGEs修饰的肽用碱性反相分离进行分馏。通过这种综合方法,以不超过1%的错误发现率(FDR)鉴定对应于1484种蛋白质上4007位点的3717和1137种早期和晚期糖基化肽。一百五十五个位点用早期和晚期末端糖基化产物修饰。5种早期和7种晚期糖基化肽被定量为相对于配对的邻近组织在HCC组织中差异表达。先前已经报道了对应于差异糖化肽的大多数(10个中的8个)蛋白质在HCC中具有失调。这些结果可以加深我们对糖基化的了解,并为治疗提供见解。
    Glycation is a non-enzymatic posttranslational modification coming from the reaction between reducing sugars and free amino groups in proteins, where early glycation products (fructosyl-lysine, FL) and advanced glycation end products (AGEs) are formed. The occurrence of glycation and accumulation of AGEs have been closely associated with hepatocellular carcinoma (HCC). Here, we reported the characterization of differential glycation in HCC using tissue proteomics with stable isotopic labeling; early glycation-modified peptides were enriched with boronate affinity chromatography (BAC), and AGEs-modified peptides were fractionated with basic reversed-phase separation. By this integrated approach, 3717 and 1137 early and advanced glycated peptides corresponding to 4007 sites on 1484 proteins were identified with a false discovery rate (FDR) of no more than 1%. One hundred fifty-five sites were modified with both early and advanced end glycation products. Five early and 7 advanced glycated peptides were quantified to be differentially expressed in HCC tissues relative to paired adjacent tissues. Most (8 out of 10) of the proteins corresponding to the differential glycated peptides have previously been reported with dysregulation in HCC. The results together may deepen our knowledge of glycation as well as provide insights for therapeutics.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    dorsalis(Hendel)(双翅目:Tephritidae)是全球最具破坏性的农业害虫之一,因为它具有很高的繁殖和入侵能力。对其性腺发育特征的阐明和性别相关基因的鉴定将为基于生殖的害虫防治提供有用的遗传基础。这里,对背芽孢杆菌的性腺转录组进行了测序,并分析了新的性腺特异性表达基因。在睾丸(TE)中发现了1338、336、35和479个差异表达基因(DEGs),卵巢(OV),女性附件腺(FAG),和男性附件腺(MAG),分别。此外,确定了463个高表达的性腺特异性基因,TE具有最高数量的特定高表达基因,402,其次是OV的51,9在MAG中,FAG中只有1个。引人注目的是,大约一半的高表达性腺特异性基因未表征。然后,发现202个未表征的高表达TE特异性基因中的35、17、3、2和1个编码含有跨膜结构域的蛋白质,信号肽,高流动性组盒,锌指域,和BTB/POZ域,分别。有趣的是,大约40%的未表征的高度表达的性腺特异性基因编码蛋白质没有预测具有功能基序或结构域。最后,分析了六个新的高表达性腺特异性基因的时空表达和序列表征。总之,我们的研究结果为未来性别相关基因的功能分析和害虫防治的潜在靶位点提供了有价值的数据集.
    Bactrocera dorsalis (Hendel) (Diptera: Tephritidae) is one of the most devastating agricultural pests worldwide due to its high reproductive and invasive abilities. The elucidation of its gonadal developmental characteristics and the identification of sex-related genes will provide a useful genetic basis for reproductive-based pest control. Here, the gonadal transcriptome of B. dorsalis was sequenced, and novel gonad-specific expressed genes were analyzed. A total of 1338, 336, 35, and 479 differentially expressed genes (DEGs) were found in the testis (TE), ovary (OV), female accessory gland (FAG), and male accessory gland (MAG), respectively. Furthermore, 463 highly expressed gonad-specific genes were identified, with the TE having the highest number of specific highly expressed genes, at 402, followed by 51 in the OV, 9 in the MAG, and only 1 in the FAG. Strikingly, approximately half of highly expressed gonad-specific genes were uncharacterized. Then, it was found that 35, 17, 3, 2, and 1 of 202 uncharacterized highly expressed TE-specific genes encoded proteins that contained transmembrane domains, signal peptides, high-mobility group boxes, the zinc finger domain, and the BTB/POZ domain, respectively. Interestingly, approximately 40% of uncharacterized highly expressed gonad-specific genes encoding proteins were not predicted to possess functional motifs or domains. Finally, the spatiotemporal expression and sequence characterization of six novel highly expressed gonad-specific genes were analyzed. Altogether, our findings provide a valuable dataset for future functional analyses of sex-related genes and potential target sites for pest control.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    作为古老的生物,树蕨类植物作为低等和高等植物物种之间的进化桥梁起着至关重要的作用,提供各种功利主义利益。然而,他们面临着过度开发等挑战,气候变化,不利的环境条件,和害虫,导致保护问题。在这项研究中,我们概述了两种典型的树蕨类植物叶片的代谢和转录组资源,A.spinulosa和A.metteniana,并首次探索抗性基因。代谢组的景观表明,化合物skimmin可能具有医学意义。共检测到111种差异累积代谢物(DAMs),途径富集分析突出了14条显著富集的途径,包括2-氧代羧酸代谢可能与环境适应有关。共发现14639个差异表达基因(DEGs),其中606个为抗性(R)基因。我们确定BAM1为显著差异表达的R基因,是R基因相互作用网络中的核心基因之一。最大似然系统发育树和PPI网络都揭示了BAM1,FLS2和TMK之间的密切关系。此外,BAM1与新绿原酸和山奈酚-7-O-葡萄糖苷呈显著正相关。这些代谢物,以其抗氧化和抗炎特性而闻名,可能在树蕨类植物的防御反应中起着至关重要的作用。这项研究提供了有价值的见解之间的代谢和转录组的差异,spinulosa和A.metteniana,增强我们对树蕨类植物抗性基因的理解。
    As ancient organisms, tree ferns play a crucial role as an evolutionary bridge between lower and higher plant species, providing various utilitarian benefits. However, they face challenges such as overexploitation, climate change, adverse environmental conditions, and insect pests, resulting in conservation concerns. In this study, we provide an overview of metabolic and transcriptomic resources of leaves in two typical tree ferns, A. spinulosa and A. metteniana, and explore the resistance genes for the first time. The landscape of metabolome showed that the compound skimmin may hold medicinal significance. A total of 111 differentially accumulated metabolites (DAMs) were detected, with pathway enrichment analysis highlighting 14 significantly enriched pathways, including 2-oxocarboxylic acid metabolism possibly associated with environmental adaptations. A total of 14,639 differentially expressed genes (DEGs) were found, among which 606 were resistance (R) genes. We identified BAM1 as a significantly differentially expressed R gene, which is one of the core genes within the R gene interaction network. Both the maximum-likelihood phylogenetic tree and the PPI network revealed a close relationship between BAM1, FLS2, and TMK. Moreover, BAM1 showed a significant positive correlation with neochlorogenic acid and kaempferol-7-O-glucoside. These metabolites, known for their antioxidant and anti-inflammatory properties, likely play a crucial role in the defense response of tree ferns. This research provides valuable insights into the metabolic and transcriptomic differences between A. spinulosa and A. metteniana, enhancing our understanding of resistance genes in tree ferns.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    随着基因检测技术的发展,我们发现了许多不同的基因,lncRNA就是其中之一。LncRNAs是指长度超过200bp的非蛋白质编码RNA分子。是LUAD等人类恶性疾病研究的重点之一。LncRNAs作为癌基因或抑制剂调节肿瘤的发生和进展。LncRNAs的差异表达通过影响细胞增殖促进或抑制肺腺癌的进展,转移,入侵,和细胞凋亡,从而影响患者的预后和生存率。因此,LncRNAs可以作为癌症诊断和治疗的潜在靶点。通过检测肿瘤标志物对该病进行早期诊断。由于肺腺癌早期不易诊断,肿瘤标志物容易忽视,LncRNAs在肺腺癌的诊断和治疗中起着重要作用。本文的主要目的是总结LncRNAs对肺腺癌的已知作用,LncRNAs差异表达对肺腺癌进展的影响,以及相关的信号转导通路。并为今后肺腺癌相关LncRNAs的研究提供新的思路。
    With the development of gene testing technology, we have found many different genes, and lncRNA is one of them. LncRNAs refer to a non-protein coding RNA molecule with a length of more than 200bp, which is one of the focuses of research on human malignant diseases such as LUAD. LncRNAs act as an oncogene or inhibitor to regulate the occurrence and progression of tumors. The differential expression of LncRNAs promotes or inhibits the progression of lung adenocarcinoma by affecting cell proliferation, metastasis, invasion, and apoptosis, thus affecting the prognosis and survival rate of patients. Therefore, LncRNAs can be used as a potential target for diagnosis and treatment of cancer. The early diagnosis of the disease was made through the detection of tumor markers. Because lung adenocarcinoma is not easy to diagnose in the early stage and tumor markers are easy to ignore, LncRNAs play an important role in the diagnosis and treatment of lung adenocarcinoma. The main purpose of this article is to summarize the known effects of LncRNAs on lung adenocarcinoma, the effect of differential expression of LncRNAs on the progression of lung adenocarcinoma, and related signal transduction pathways. And to provide a new idea for the future research of lung adenocarcinoma-related LncRNAs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    RNA-Seq数据分析是基因组学研究的重要组成部分,将庞大而复杂的数据集转化为有意义的生物学见解。这是一个以快速发展和持续创新为标志的领域,任何寻求释放RNA-Seq数据潜力的人都需要彻底了解。在这一章中,我们描述了RNA-seq数据分析的复杂局面,阐明一个全面的管道,导航通过整个复杂的过程。从质量控制开始,本章强调确保RNA-seq数据的完整性至关重要,因为它为后续分析奠定了基础。然后处理预处理,其中原始序列数据经过必要的修改和增强,设置校准阶段的阶段。这个阶段涉及将处理过的序列映射到参考基因组,解码这些序列的起源和功能的关键步骤。进入RNA-seq分析的核心,然后,本章探讨了差异表达分析-鉴定在不同条件或样本组中表现出不同表达水平的基因的过程。认识到这些差异表达基因的生物学背景至关重要;因此,本章过渡到功能分析。这里,基因本体论和通路分析等方法和工具有助于在更广泛的生物学框架内了解已识别基因的作用和相互作用。然而,本章并不停留在传统的分析方法上。拥抱不断发展的数据科学范式,它深入研究了RNA-seq数据的机器学习应用,在降维以及无监督和监督学习方面引入先进技术。这些方法允许在数据中辨别通过传统方法可能难以察觉的模式和关系。
    RNA-Seq data analysis stands as a vital part of genomics research, turning vast and complex datasets into meaningful biological insights. It is a field marked by rapid evolution and ongoing innovation, necessitating a thorough understanding for anyone seeking to unlock the potential of RNA-Seq data. In this chapter, we describe the intricate landscape of RNA-seq data analysis, elucidating a comprehensive pipeline that navigates through the entirety of this complex process. Beginning with quality control, the chapter underscores the paramount importance of ensuring the integrity of RNA-seq data, as it lays the groundwork for subsequent analyses. Preprocessing is then addressed, where the raw sequence data undergoes necessary modifications and enhancements, setting the stage for the alignment phase. This phase involves mapping the processed sequences to a reference genome, a step pivotal for decoding the origins and functions of these sequences.Venturing into the heart of RNA-seq analysis, the chapter then explores differential expression analysis-the process of identifying genes that exhibit varying expression levels across different conditions or sample groups. Recognizing the biological context of these differentially expressed genes is pivotal; hence, the chapter transitions into functional analysis. Here, methods and tools like Gene Ontology and pathway analyses help contextualize the roles and interactions of the identified genes within broader biological frameworks. However, the chapter does not stop at conventional analysis methods. Embracing the evolving paradigms of data science, it delves into machine learning applications for RNA-seq data, introducing advanced techniques in dimension reduction and both unsupervised and supervised learning. These approaches allow for patterns and relationships to be discerned in the data that might be imperceptible through traditional methods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    通过捕获scRNA-seq数据的复杂特征来分析细胞类型之间的差异表达(DE)至关重要。最近,已经开发了基于不同建模框架的针对scRNA-seq数据分析的不同方法,假设,考虑各种数据特征的策略和检验统计量。scDEA是最近发展起来的一种基于集成学习的DE分析方法,使用兰开斯特的组合产生p值,由12种单独的DE分析方法生成,并产生比单独方法更准确和稳定的结果。我们研究的目的是提出一种新的基于集成学习的DE分析方法,scHD4E,仅在4种不同的方法中使用最佳表演者。通过使用六个真实scRNA-seq数据集的评估过程,选择了最佳执行者4种方法。我们对五个实验数据集进行了全面的实验,以基于样本量效应评估我们提出的方法,批处理效果,I型错误控制,基因本体论富集分析,运行时,鉴定出匹配的DE基因,方法之间的语义相似性度量。我们还执行类似的分析(除了最后三个术语),并计算性能度量,如准确性,F1得分,马修相关系数等。用于模拟数据集。结果表明,在上述所有方面,scHD4E的性能都优于所有个体和scDEA方法。我们希望scHD4E将为现代数据科学家提供服务,以检测scRNA-seq数据分析中的DEG。为了实现我们提出的方法,已经开发了一个GithubR包scHD4E及其闪亮的应用程序,并在以下链接中提供:https://github.com/bbiswas1989/scHD4E和https://github.com/bbiswas1989/scHD4E-Shiny。
    Differential expression (DE) analysis between cell types for scRNA-seq data by capturing its complicated features is crucial. Recently, different methods have been developed for targeting the scRNA-seq data analysis based on different modeling frameworks, assumptions, strategies and test statistic in considering various data features. The scDEA is an ensemble learning-based DE analysis method developed recently, yielding p-values using Lancaster\'s combination, generated by 12 individual DE analysis methods, and producing more accurate and stable results than individual methods. The objective of our study is to propose a new ensemble learning-based DE analysis method, scHD4E, using top performers in only 4 separate methods. The top performer 4 methods have been selected through an evaluation process using six real scRNA-seq data sets. We conducted comprehensive experiments for five experimental data sets to evaluate our proposed method based on the sample size effects, batch effects, type I error control, gene ontology enrichment analysis, runtime, identified matched DE genes, and semantic similarity measurement between methods. We also perform similar analyses (except the last 3 terms) and compute performance measures like accuracy, F1 score, Mathew\'s correlation coefficient etc. for a simulated data set. The results show that scHD4E is performs better than all the individual and scDEA methods in all the above perspectives. We expect that scHD4E will serve the modern data scientists for detecting the DEGs in scRNA-seq data analysis. To implement our proposed method, a Github R package scHD4E and its shiny application has been developed, and available in the following links: https://github.com/bbiswas1989/scHD4E and https://github.com/bbiswas1989/scHD4E-Shiny.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    转录组学分析已广泛用于比较实验,以揭示各种物种的生物学机制。然而,仍然缺乏一个简单的工具来优化和集成来自多个R包的功能。在这项研究中,我们开发了TOmicsVis(转录组学可视化)(CRAN:https://cran。r-project.org/package=TOmicsVis,v2.0.0),一个R包,为转录组学分析和可视化提供全面的解决方案。它利用46个R包来设计40个合适的函数,用于多组转录组项目的流线型分析,它涵盖六个主要类别:样本统计,性状分析,差异表达,高级分析,GO和KEGG浓缩,和表操作。TOmicsVis可以在本地或在线执行(https://shiny。hiplot。cn/tomicsvis-shiny/),这为无需编码培训的研究人员提供了极大的便利。这些用户友好的可视化功能和内置的分析功能使研究人员能够及时监测实验数据动态并快速探索转录组学数据。
    Transcriptomic analysis has been widely used in comparative experiments to uncover biological mechanisms in various species. However, a simple tool is still lacking to optimize and integrate the features from multiple R packages. In this study, we developed TOmicsVis (Transcriptomics Visualization) (CRAN: https://cran.r-project.org/package=TOmicsVis, v2.0.0), an R package that provides a comprehensive solution for transcriptomics analysis and visualization. It utilizes 46 R packages to design 40 suitable functions for the streamlined analysis of multigroup transcriptomic projects, which covers six main categories: Sample Statistics, Traits Analysis, Differential Expression, Advanced Analysis, GO and KEGG Enrichment, and Table Operation. TOmicsVis can be performed either locally or online (https://shiny.hiplot.cn/tomicsvis-shiny/), which provides significant convenience for researchers without coding training. These user-friendly visualization functions and built-in analysis capabilities enable researchers to monitor experimental data dynamics promptly and explore transcriptomics data quickly.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:基于成像的空间分辨转录组学(im-SRT)技术的最新进展现在能够实现靶向基因及其在固定组织中位置的高通量谱分析。基因表达数据的标准化通常需要考虑可能混淆潜在生物信号的技术因素。
    结果:这里,我们研究了不同基因计数归一化方法与不同靶向基因面板在分析和解释im-SRT数据中的潜在影响.使用不同的模拟基因面板,过度代表在特定组织区域或细胞类型中表达的基因,我们证明了基于每个细胞检测到的基因计数的归一化方法如何以区域或细胞类型特定的方式差异影响归一化的基因表达量。我们表明,这些标准化诱导效应可能会降低下游分析的可靠性,包括差异基因表达,基因折叠变化,和空间可变基因分析,引入假阳性和假阴性的结果相比,从基因面板获得的结果是更有代表性的组织的组成细胞类型的基因表达。使用不使用检测到的基因计数进行基因表达幅度调整的归一化方法未观察到这些效果。如细胞体积或细胞面积归一化。
    结论:我们建议在可行的情况下使用基于非基因计数的标准化方法,并在必要时使用基于基因计数的标准化方法之前评估基因面板代表性。总的来说,我们提醒标准化方法和基因面板的选择可能会影响im-SRT数据的生物学解释.
    Recent advances in imaging-based spatially resolved transcriptomics (im-SRT) technologies now enable high-throughput profiling of targeted genes and their locations in fixed tissues. Normalization of gene expression data is often needed to account for technical factors that may confound underlying biological signals.
    Here, we investigate the potential impact of different gene count normalization methods with different targeted gene panels in the analysis and interpretation of im-SRT data. Using different simulated gene panels that overrepresent genes expressed in specific tissue regions or cell types, we demonstrate how normalization methods based on detected gene counts per cell differentially impact normalized gene expression magnitudes in a region- or cell type-specific manner. We show that these normalization-induced effects may reduce the reliability of downstream analyses including differential gene expression, gene fold change, and spatially variable gene analysis, introducing false positive and false negative results when compared to results obtained from gene panels that are more representative of the gene expression of the tissue\'s component cell types. These effects are not observed with normalization approaches that do not use detected gene counts for gene expression magnitude adjustment, such as with cell volume or cell area normalization.
    We recommend using non-gene count-based normalization approaches when feasible and evaluating gene panel representativeness before using gene count-based normalization methods if necessary. Overall, we caution that the choice of normalization method and gene panel may impact the biological interpretation of the im-SRT data.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:Maedi-visna病毒(MVV)是一种感染绵羊单核细胞/巨噬细胞谱系细胞的慢病毒,山羊,和野生反刍动物,导致肺炎,乳腺炎,关节炎,和脑炎。对MVV感染的免疫反应是复杂的,缺乏对其感染和发病机制的全面了解。本研究使用RNA测序技术研究了暴露于MVV的绵羊的肺组织的体内转录组模式。
    结果:结果表明2,739个基因显著差异表达,1,643个下调基因和1,096个上调基因。发现了许多对于MVV感染可能是独特的变量。基因本体论分析显示,在与免疫系统和对病毒感染的生物学反应直接相关的方面,有很大比例的基因被富集。《京都基因和基因组百科全书》分析显示,最丰富的途径与病毒-宿主细胞相互作用和炎症反应有关。许多免疫相关基因,包括编码几种细胞因子和干扰素调节因子的那些,在差异表达基因(DEGs)的蛋白质-蛋白质相互作用网络中进行了鉴定。使用实时聚合酶链反应和蛋白质印迹分析评估DEGs的表达。CXCL13、CXCL6、CXCL11、CCR1、CXCL8、CXCL9、CXCL10、TNFSF8、TNFRSF8、IL7R、IFN-γ,CCL2和MMP9上调。进行免疫组织化学分析以鉴定浸润MVV感染组织的免疫细胞的类型。B细胞,CD4+和CD8+T细胞,巨噬细胞是与肺部MVV感染相关的最普遍的免疫细胞。
    结论:总体而言,这项研究的发现提供了对体内宿主对MVV感染反应的全面理解,并为自然宿主发病机制的基因调控网络提供了新的视角.
    BACKGROUND: Maedi-visna virus (MVV) is a lentivirus that infects monocyte/macrophage lineage cells in sheep, goats, and wild ruminants and causes pneumonia, mastitis, arthritis, and encephalitis. The immune response to MVV infection is complex, and a complete understanding of its infection and pathogenesis is lacking. This study investigated the in vivo transcriptomic patterns of lung tissues in sheep exposed to MVV using the RNA sequencing technology.
    RESULTS: The results indicated that 2,739 genes were significantly differentially expressed, with 1,643 downregulated genes and 1,096 upregulated genes. Many variables that could be unique to MVV infections were discovered. Gene Ontology analysis revealed that a significant proportion of genes was enriched in terms directly related to the immune system and biological responses to viral infections. Kyoto Encyclopedia of Genes and Genomes analysis revealed that the most enriched pathways were related to virus-host cell interactions and inflammatory responses. Numerous immune-related genes, including those encoding several cytokines and interferon regulatory factors, were identified in the protein-protein interaction network of differentially expressed genes (DEGs). The expression of DEGs was evaluated using real-time polymerase chain reaction and western blot analysis. CXCL13, CXCL6, CXCL11, CCR1, CXCL8, CXCL9, CXCL10, TNFSF8, TNFRSF8, IL7R, IFN-γ, CCL2, and MMP9 were upregulated. Immunohistochemical analysis was performed to identify the types of immune cells that infiltrated MVV-infected tissues. B cells, CD4+ and CD8+ T cells, and macrophages were the most prevalent immune cells correlated with MVV infection in the lungs.
    CONCLUSIONS: Overall, the findings of this study provide a comprehensive understanding of the in vivo host response to MVV infection and offer new perspectives on the gene regulatory networks that underlie pathogenesis in natural hosts.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    RNA测序(RNA-seq)已成为评估全基因组基因表达的强大工具。彻底改变了生物学的各个领域。然而,分析大型RNA-seq数据集可能具有挑战性,特别是对于缺乏生物信息学经验的学生或研究人员。为了应对这些挑战,我们提供了一个全面的指南,以提供分析RNA-seq数据的分步工作流程,从原始读数到功能富集分析,从实验设计的考虑开始。这是为了帮助学生和研究人员与任何有机体一起工作,无论组装的基因组是否可用。在本指南中,我们使用各种公认的生物信息学工具来浏览RNA-seq分析,并讨论相同任务的不同工具的优缺点。我们的协议侧重于清晰度,再现性,和实用性,使用户能够轻松浏览RNA-seq数据分析的复杂性,并从数据集中获得有价值的生物学见解。此外,GitHub存储库中提供了所有脚本和示例数据集,以促进分析管道的实施。©2024作者WileyPeriodicalsLLC出版的当前协议。基本方案1:来自具有可用参考基因组的模型植物的数据分析基本方案2:基因本体论富集分析基本方案3:来自非模型植物的数据的从头组装。
    RNA sequencing (RNA-seq) has emerged as a powerful tool for assessing genome-wide gene expression, revolutionizing various fields of biology. However, analyzing large RNA-seq datasets can be challenging, especially for students or researchers lacking bioinformatics experience. To address these challenges, we present a comprehensive guide to provide step-by-step workflows for analyzing RNA-seq data, from raw reads to functional enrichment analysis, starting with considerations for experimental design. This is designed to aid students and researchers working with any organism, irrespective of whether an assembled genome is available. Within this guide, we employ various recognized bioinformatics tools to navigate the landscape of RNA-seq analysis and discuss the advantages and disadvantages of different tools for the same task. Our protocol focuses on clarity, reproducibility, and practicality to enable users to navigate the complexities of RNA-seq data analysis easily and gain valuable biological insights from the datasets. Additionally, all scripts and a sample dataset are available in a GitHub repository to facilitate the implementation of the analysis pipeline. © 2024 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Analysis of data from a model plant with an available reference genome Basic Protocol 2: Gene ontology enrichment analysis Basic Protocol 3: De novo assembly of data from non-model plants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号