high-throughput nucleotide sequencing

高通量核苷酸测序
  • 文章类型: Journal Article
    高通量技术的快速发展,特别是下一代测序(NGS),通过研究遗传变异如SNP,彻底改变了癌症研究,拷贝数变化,基因表达,和蛋白质水平。这些技术提升了精准肿瘤学的重要性,创造了对生物标志物识别和验证的需求。这篇综述探讨了肿瘤学的复杂相互作用,癌症生物学,和生物信息学工具,强调统计学习中的挑战,实验验证,数据处理,和质量控制支撑着这个变革领域。本文概述了生物信息学工具在癌症基因组学研究中的方法和应用。包含数据结构工具,途径分析,网络分析,用于分析生物标志物签名的工具,体细胞变体解释,基因组数据分析,和可视化工具。开源工具和存储库,如癌症基因组图谱(TCGA),基因组数据共用空间(GDC),cBioPortal,UCSC基因组浏览器,ArrayExpress,和基因表达综合(GEO)已经出现,以简化癌症组学数据分析。生物信息学对癌症研究产生了重大影响,发现新的生物标志物,驱动突变,致癌途径,和治疗目标。整合多组数据,网络分析,和晚期ML将是未来生物标志物发现和患者预后预测的关键。
    The rapid advancement of high-throughput technologies, particularly next-generation sequencing (NGS), has revolutionized cancer research by enabling the investigation of genetic variations such as SNPs, copy number variations, gene expression, and protein levels. These technologies have elevated the significance of precision oncology, creating a demand for biomarker identification and validation. This review explores the complex interplay of oncology, cancer biology, and bioinformatics tools, highlighting the challenges in statistical learning, experimental validation, data processing, and quality control that underpin this transformative field. This review outlines the methodologies and applications of bioinformatics tools in cancer genomics research, encompassing tools for data structuring, pathway analysis, network analysis, tools for analyzing biomarker signatures, somatic variant interpretation, genomic data analysis, and visualization tools. Open-source tools and repositories like The Cancer Genome Atlas (TCGA), Genomic Data Commons (GDC), cBioPortal, UCSC Genome Browser, Array Express, and Gene Expression Omnibus (GEO) have emerged to streamline cancer omics data analysis. Bioinformatics has significantly impacted cancer research, uncovering novel biomarkers, driver mutations, oncogenic pathways, and therapeutic targets. Integrating multi-omics data, network analysis, and advanced ML will be pivotal in future biomarker discovery and patient prognosis prediction.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Case Reports
    背景:2B型血管性血友病(VWD)是一种不太常见的亚型,难以诊断。该病例报告和文献综述重点介绍了罕见的2B型VWD新生儿发病,最初误诊为新生儿同种免疫性血小板减少症(NAIT)。
    方法:新生儿出现严重血小板减少症,对NAIT治疗无反应。由于血小板减少症家族史不清楚,因此进行了基因检测。
    结果:下一代测序显示p.Arg1306TrpvonWillebrand因子变体,确认2BVWD类型。
    结论:这项研究强调了基因检测在诊断具有挑战性的新生儿血小板减少症病例中的关键作用。不管家族史,目的阐明新生儿发病2B型VWD的临床表现和病程。
    BACKGROUND: Type 2B von Willebrand disease (VWD) is a less common subtype and is difficult to diagnose. This case report and literature review highlights a rare neonatal onset of type 2B VWD initially misdiagnosed as neonatal alloimmune thrombocytopenia (NAIT).
    METHODS: The neonate presented with severe thrombocytopenia and was unresponsive to NAIT treatments. Genetic testing was conducted because of the unclear family history of thrombocytopenia.
    RESULTS: Next-generation sequencing revealed a p.Arg1306Trp von Willebrand factor variant, confirming type 2B VWD.
    CONCLUSIONS: This study underscores the critical role of genetic testing in diagnosing challenging cases of neonatal thrombocytopenia, irrespective of family history, and aims to elucidate the clinical manifestations and course of neonatal onset type 2B VWD.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    生物医学技术的最新进展和高维下一代测序(NGS)数据集的激增导致数据的体积和密度显着增长。NGS高维数据,以大量基因组学为特征,转录组学,蛋白质组学,和相对于生物样本数量的宏基因组学特征,对降低特征维数提出了重大挑战。NGS数据的高维度对数据分析提出了重大挑战,包括增加的计算负担,潜在的过拟合,以及解释结果的困难。特征选择和特征提取是通过降低数据的维数来解决这些挑战的两种关键技术,从而提高模型性能,可解释性,和计算效率。特征选择和特征提取可以分为统计和机器学习方法。本研究对各种统计数据进行了全面和比较的审查,机器学习,以及专门为人类的NGS和微阵列数据解释量身定制的基于深度学习的特征选择和提取技术。进行了彻底的文献检索,以收集有关这些技术的信息,专注于基于阵列和NGS的数据分析。各种技术,包括深度学习架构,机器学习算法,和统计方法,已经探索了微阵列,批量RNA-Seq,和单细胞,此处调查的基于单细胞RNA-Seq(scRNA-Seq)技术的数据集。该研究概述了这些技术,突出它们的应用,优势,以及高维NGS数据上下文中的局限性。这篇评论为读者应用特征选择和特征提取技术来增强预测模型的性能提供了更好的见解,揭示潜在的生物模式,并获得对庞大而复杂的NGS和微阵列数据的更深入的见解。
    Recent advancements in biomedical technologies and the proliferation of high-dimensional Next Generation Sequencing (NGS) datasets have led to significant growth in the bulk and density of data. The NGS high-dimensional data, characterized by a large number of genomics, transcriptomics, proteomics, and metagenomics features relative to the number of biological samples, presents significant challenges for reducing feature dimensionality. The high dimensionality of NGS data poses significant challenges for data analysis, including increased computational burden, potential overfitting, and difficulty in interpreting results. Feature selection and feature extraction are two pivotal techniques employed to address these challenges by reducing the dimensionality of the data, thereby enhancing model performance, interpretability, and computational efficiency. Feature selection and feature extraction can be categorized into statistical and machine learning methods. The present study conducts a comprehensive and comparative review of various statistical, machine learning, and deep learning-based feature selection and extraction techniques specifically tailored for NGS and microarray data interpretation of humankind. A thorough literature search was performed to gather information on these techniques, focusing on array-based and NGS data analysis. Various techniques, including deep learning architectures, machine learning algorithms, and statistical methods, have been explored for microarray, bulk RNA-Seq, and single-cell, single-cell RNA-Seq (scRNA-Seq) technology-based datasets surveyed here. The study provides an overview of these techniques, highlighting their applications, advantages, and limitations in the context of high-dimensional NGS data. This review provides better insights for readers to apply feature selection and feature extraction techniques to enhance the performance of predictive models, uncover underlying biological patterns, and gain deeper insights into massive and complex NGS and microarray data.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Case Reports
    背景:广州管圆线虫病是由广州管圆线虫病幼虫引起的严重但罕见的寄生虫感染。人类这种食源性疾病的主要特征是嗜酸性粒细胞性脑膜炎。最近,全球报告的病例逐渐增加。由于缺乏典型的临床症状,标志,和特定的实验室测试,这种疾病的早期诊断带来了重大挑战。未能及时诊断和治疗这种情况可能导致死亡。
    方法:我们介绍一例13岁男性患者,最初出现发热和头痛。患者初步诊断为细菌性脑膜炎,并接受抗菌药物治疗。然而,病人的病情恶化了,他发展了进行性意识障碍。最终,脑脊液样本的宏基因组下一代测序(mNGS)测试表明广州管圆线虫感染。阿苯达唑和泼尼松治疗后,病人完全康复了。我们将此病例报告作为文献综述的一部分,以强调mNGS在广州管圆线虫病早期诊断中的潜在应用。
    结论:mNGS技术在广州管圆线虫病的诊断中起着至关重要的作用。随着这项技术的不断发展和应用,我们相信它将在诊断中发挥越来越重要的作用,治疗,并监测广州管圆线虫病。
    BACKGROUND: Angiostrongyliasis cantonensis is a severe yet rare parasitic infection caused by the larvae of Angiostrongylus cantonensis. The primary characteristic feature of this foodborne illness in humans is eosinophilic meningitis. Recently, there has been a gradual increase in reported cases globally. Due to the lack of typical clinical symptoms, signs, and specific laboratory tests, early diagnosis of this disease poses significant challenges. Failure to diagnose and treat this condition promptly can result in fatalities.
    METHODS: We present the case of a 13-year-old male patient who initially presented with fever and headache. The patient was preliminarily diagnosed with bacterial meningitis and received treatment with antibacterial drugs. However, the patient\'s condition worsened, and he developed progressive consciousness disturbances. Eventually, metagenomic next-generation sequencing (mNGS) testing of cerebrospinal fluid samples indicated Angiostrongylus cantonensis infection. Following treatment with albendazole and prednisone, the patient made a full recovery. We include this case report as part of a literature review to emphasize the potential applications of mNGS in the early diagnosis of Angiostrongyliasis cantonensis.
    CONCLUSIONS: mNGS technology plays a crucial role in the diagnosis of angiostrongyliasis cantonensis. As this technology continues to evolve and be applied, we believe it will play an increasingly important role in diagnosing, treating, and monitoring angiostrongyliasis cantonensis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Systematic Review
    研究口腔内的个体生态位是理解抗菌素耐药性基因(ARGs)分布的合理的第一步;然而,它不能代表整个口服耐药组。我们系统评价的目的是通过审查个体生态位的组成来提供口服耐药组的图谱。在2015年1月至2023年8月期间,从所有英语出版物中检索到了580篇论文,这些论文调查了五个电子数据库中口头ARG的存在。本综述包括15项研究[10项PCR和5项下一代测序(NGS)]。方法的异质性排除了荟萃分析。ARGs存在于整个口腔中,在6个位置上确定了158个独特的ARGs-上和牙龈下生物膜,粘膜,口咽,根管系统(RCS)和唾液。牙龈上生物膜具有最高的抗性丰富度,而RCS最少。四环素是发现的主要抗菌素耐药性(AMR)类别。确定了三个核心基因-tet(M),tet(O)和ermB。这篇综述强调了NGS研究全面表征口腔耐药组的必要性。这是未来组学研究的逻辑基础,以真正理解耐药组的范围及其对AMR的贡献。
    Studying individual ecological niches within the oral cavity is a logical first step to understanding the distribution of antimicrobial resistance genes (ARGs); however, it is not representative of the whole oral resistome. The aim of our systematic review was to provide a map of the oral resistome by reviewing the composition of individual niches. A total of 580 papers were retrieved from a search of all English language publications investigating the presence of oral ARGs in five electronic databases between January 2015 and August 2023. Fifteen studies [10 PCR and 5 next-generation sequencing (NGS)] were included in this review. The heterogeneity of methods precluded meta-analysis. ARGs are present throughout the oral cavity with 158 unique ARGs identified across 6 locations - supra and sub-gingival biofilm, mucosa, oropharynx, root canal system (RCS) and saliva. The supragingival biofilm had the highest resistome richness, while the RCS had the least. Tetracycline was the dominant antimicrobial resistance (AMR) class found. Three core genes were identified - tet(M), tet(O) and ermB.This review highlights the necessity of NGS studies to comprehensively characterize the oral resistome in its entirety. This is the logical foundation for future \'omics studies to truly understand the scope of the resistome and its contribution to AMR.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    人畜共患病毒被广泛视为未来大流行的主要威胁。蝙蝠是最多样化的哺乳动物,有1400多个物种分布在地球上的大多数栖息地。到目前为止,31个已知的病毒家族与蝙蝠有关,尽管对大多数病毒的了解不足。不断努力去发现,了解并监控这些蝙蝠病毒,因此是一个公共卫生领域。此系统综述旨在对报告PubMed中发现的新型蝙蝠病毒的出版物进行分类,Scopus,和WebofScience数据库,在2018-2022年的5年内。各种实验参数,包括采样位置,方法论,蝙蝠物种多样性,与已知病毒相似,新病毒的物种划分,和基因组测序策略,从41种出版物中提取并分析。总的来说,在2018年至2022年之间,从19个病毒家族中鉴定出72种新病毒,特别是来自基因组病毒科(DNA病毒)和冠状病毒科(RNA病毒)。那就是说,只有有限数量的蝙蝠家族出现了广泛的特点,尽管在不同的采样方法中明显转向了下一代测序方法和用于病毒鉴定的宏基因组学流程.这篇综述旨在全面分析过去五年来全球为识别和表征蝙蝠物种中的新兴病毒所做的努力,并提供这些研究中使用的当前技术和方法的详细概述。
    Zoonotic viruses are widely seen as the primary threat for future pandemics. Bats are the most diverse group of mammals, with more than 1400 species distributed across most habitats on Earth. So far, 31 known virus families were associated with bats, although the understanding of most viruses were insufficient. Continuous efforts to discover, understand and monitor these bats viruses, is thereby an area of public health interest. This systematic review was designed to catalogue publications reporting novel bat virus discoveries within PubMed, SCOPUS, and Web of Science databases, within a 5-year period from 2018 to 2022. Various experimental parameters, including sampling locations, methodology, bat species diversity, similarity to known viruses, species demarcation of new viruses, and genomic sequencing strategies, were extracted from 41 publications and analyzed. In total, 72 novel viruses from 19 virus families were identified between 2018 and 2022, particularly from Genomoviridae (DNA viruses) and Coronaviridae (RNA viruses). That said, only a limited number of bat families featured extensively despite noticeable shift towards next generation sequencing methods and metagenomics pipeline for virus identification across different sampling methods. This review aims to provide a comprehensive analysis of the global efforts made over the past five years to identify and characterize emerging viruses in bat species, and to provide a detailed overview of the current technologies and methodologies used in these studies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    福尔马林固定石蜡包埋(FFPE)组织代表了转化癌症研究的宝贵来源。然而,各种下游方法的广泛应用仍然具有挑战性。这里,我们旨在评估使用FFPE乳腺癌(BC)组织的基因组和基因表达分析工作流程的可行性.我们进行了系统的文献综述,以评估FFPE和来自BC患者的新鲜冷冻匹配组织样本之间的一致性,用于DNA和RNA下游应用。比较了三种不同的核酸提取试剂盒对FFPEBC临床样品的分析性能。我们还在同时提取的DNA和RNA上应用了新开发的靶向DNA下一代测序(NGS)370基因面板和nCounterBC360®平台,分别,使用来自II期临床试验的FFPE组织。在3701个初始搜索结果中,系统评价40篇。在各种下游应用平台中观察到高度的一致性。此外,通过靶向DNANGS和基因表达谱分析证明了同时提取DNA/RNA试剂盒的性能.排除低于5%变体等位基因频率的变体对于克服FFPE诱导的伪影是必要的。从FFPE材料中同时提取DNA/RNA的靶向基因组分析是可行的,为其在临床试验/队列中的实施提供见解。
    Formalin-fixed paraffin-embedded (FFPE) tissue represents a valuable source for translational cancer research. However, the widespread application of various downstream methods remains challenging. Here, we aimed to assess the feasibility of a genomic and gene expression analysis workflow using FFPE breast cancer (BC) tissue. We conducted a systematic literature review for the assessment of concordance between FFPE and fresh-frozen matched tissue samples derived from patients with BC for DNA and RNA downstream applications. The analytical performance of three different nucleic acid extraction kits on FFPE BC clinical samples was compared. We also applied a newly developed targeted DNA Next-Generation Sequencing (NGS) 370-gene panel and the nCounter BC360® platform on simultaneously extracted DNA and RNA, respectively, using FFPE tissue from a phase II clinical trial. Of the 3701 initial search results, 40 articles were included in the systematic review. High degree of concordance was observed in various downstream application platforms. Moreover, the performance of simultaneous DNA/RNA extraction kit was demonstrated with targeted DNA NGS and gene expression profiling. Exclusion of variants below 5% variant allele frequency was essential to overcome FFPE-induced artefacts. Targeted genomic analyses were feasible in simultaneously extracted DNA/RNA from FFPE material, providing insights for their implementation in clinical trials/cohorts.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    糖尿病肾病(DKD)是终末期肾病(ESRD)的主要原因,其发病机制尚未明确。目前的研究表明,DKD涉及多种细胞类型和肾外因素,阐明发病机制和确定新的治疗靶点尤为重要。单细胞RNA测序(scRNA-seq)技术是在单细胞水平上对单个细胞的转录组进行高通量测序,这是一种通过比较遗传信息来探索疾病发展的有效技术,反映了细胞之间遗传信息的差异,识别不同的细胞亚群。越来越多的证据支持scRNA-seq在揭示糖尿病发病机制和加强我们对DKD分子机制的理解中的作用。这次我们回顾了scRNA-seq数据。然后,我们分析和讨论了scRNA-seq技术在DKD研究中的应用,包括细胞类型的注释,新细胞类型(或亚型)的鉴定,细胞间通讯的识别,细胞分化轨迹分析,基因表达检测,和分析基因调控网络,最后,我们探讨了scRNA-seq技术在DKD研究中的未来前景。
    Diabetic kidney disease (DKD) is the leading cause of end-stage renal disease (ESRD), and its pathogenesis has not been clarified. Current research suggests that DKD involves multiple cell types and extra-renal factors, and it is particularly important to clarify the pathogenesis and identify new therapeutic targets. Single-cell RNA sequencing (scRNA-seq) technology is high-throughput sequencing of the transcriptomes of individual cells at the single-cell level, which is an effective technology for exploring the development of diseases by comparing genetic information, reflecting the differences in genetic information between cells, and identifying different cell subpopulations. Accumulating evidence supports the role of scRNA-seq in revealing the pathogenesis of diabetes and strengthening our understanding of the molecular mechanisms of DKD. We reviewed the scRNA-seq data this time. Then, we analyzed and discussed the applications of scRNA-seq technology in DKD research, including annotation of cell types, identification of novel cell types (or subtypes), identification of intercellular communication, analysis of cell differentiation trajectories, gene expression detection, and analysis of gene regulatory networks, and lastly, we explored the future perspectives of scRNA-seq technology in DKD research.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:循环肿瘤DNA(ctDNA)检测已成为指导晚期非小细胞肺癌(aNSCLC)一线(1L)靶向治疗的有前景的工具。本研究旨在评估基于ctDNA的下一代测序(NGS)对致癌驱动突变的临床有效性(CV)和临床实用性(CU),以通过系统的文献回顾和荟萃分析为aNSCLC的1L治疗决策提供信息。
    方法:在PubMed/MEDLINE和Embase进行了系统的文献检索,以确定报道ctDNA检测CV/CU在aNSCLC患者中的随机对照试验或观察性研究。使用双变量随机效应模型进行Meta分析,以估计合并的敏感性和特异性。对CU研究的无进展/总生存期(PFS/OS)进行了总结。
    结果:总共确定了20项研究:仅17CV,仅限2个CU,和1,13项研究被纳入多基因检测的荟萃分析。任何突变的ctDNA检测的总体敏感性和特异性分别为0.69(95%CI0.63-0.74)和0.99(95%CI0.97-1.00),分别。然而,灵敏度因驱动基因而异,ROS1的0.29(95%CI0.13-0.53)至KRAS的0.77(95%CI0.63-0.86)。两项研究比较了PFS和ctDNA与基于组织的测试,然后进行1L靶向治疗,没有发现显着差异。一项研究报告了ctDNA匹配和组织匹配疗法的OS曲线,但未提供风险比。
    结论:ctDNA检测显示aNSCLC患者的总体诊断准确性可接受,然而,灵敏度因驱动突变而变化很大。需要进一步的研究,特别是对于不常见的驱动突变,为了更好地理解ctDNA检测的CU在指导aNSCLC靶向治疗中的作用。
    OBJECTIVE: Circulating tumor DNA (ctDNA) testing has become a promising tool to guide first-line (1L) targeted treatment for advanced non-small cell lung cancer (aNSCLC). This study aims to estimate the clinical validity (CV) and clinical utility (CU) of ctDNA-based next-generation sequencing (NGS) for oncogenic driver mutations to inform 1L treatment decisions in aNSCLC through a systematic literature review and meta-analysis.
    METHODS: A systematic literature search was conducted in PubMed/MEDLINE and Embase to identify randomized control trials or observational studies reporting CV/CU on ctDNA testing in patients with aNSCLC. Meta-analyses were performed using bivariate random-effects models to estimate pooled sensitivity and specificity. Progression-free/overall survival (PFS/OS) was summarized for CU studies.
    RESULTS: A total of 20 studies were identified: 17 CV only, 2 CU only, and 1 both, and 13 studies were included for the meta-analysis on multi-gene detection. The overall sensitivity and specificity for ctDNA detection of any mutation were 0.69 (95% CI 0.63-0.74) and 0.99 (95% CI 0.97-1.00), respectively. However, sensitivity varied greatly by driver gene, ranging from 0.29 (95% CI 0.13-0.53) for ROS1 to 0.77 (95% CI 0.63-0.86) for KRAS. Two studies that compared PFS with ctDNA versus tissue-based testing followed by 1L targeted therapy found no significant differences. One study reported OS curves on ctDNA-matched and tissue-matched therapies but no hazard ratios were provided.
    CONCLUSIONS: ctDNA testing demonstrated an overall acceptable diagnostic accuracy in patients with aNSCLC, however, sensitivity varied greatly by driver mutation. Further research is needed, especially for uncommon driver mutations, to better understand the CU of ctDNA testing in guiding targeted treatments for aNSCLC.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    胃癌(GC)是一种具有显著表型和遗传变异的庞杂异质性疾病。传统的分类系统主要依赖于临床病理特征和常规生物标志物的评估,并且可能无法捕获个体GC的不同临床过程。组学技术的最新发现,如下一代测序,蛋白质组学和代谢组学为GC中潜在的遗传改变和生物学事件提供了重要的见解。识别GC亚型的聚类策略可能通过开发针对特定亚型的疗法来提供改善GC治疗和临床试验结果的新工具。然而,在临床实践中实施GC分子分类的可行性和治疗意义需要解决。本综述审查了当前的分子分类,描绘了临床相关分子特征的主流景观,分析了它们与传统GC分类的相关性,并讨论了潜在的临床应用。
    Gastric cancer (GC) is a complex and heterogeneous disease with significant phenotypic and genetic variation. Traditional classification systems rely mainly on the evaluation of clinical pathological features and conventional biomarkers and might not capture the diverse clinical processes of individual GCs. The latest discoveries in omics technologies such as next‑generation sequencing, proteomics and metabolomics have provided crucial insights into potential genetic alterations and biological events in GC. Clustering strategies for identifying subtypes of GC might offer new tools for improving GC treatment and clinical trial outcomes by enabling the development of therapies tailored to specific subtypes. However, the feasibility and therapeutic significance of implementing molecular classifications of GC in clinical practice need to addressed. The present review examines the current molecular classifications, delineates the prevailing landscape of clinically relevant molecular features, analyzes their correlations with traditional GC classifications, and discusses potential clinical applications.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号