Long-read sequencing (LRS)

  • 文章类型: Journal Article
    背景:肌营养不良蛋白病是最常见的X连锁遗传性肌肉疾病,致病基因是DMD.外显子重复是DMD基因中常见的致病变异类型,然而,5\'末端包含外显子1的外显子重复不太常见。当评估DMD基因外显子重复的致病性时,必须考虑它们对阅读框架的影响。传统的分子方法,如多重连接依赖性探针扩增(MLPA)和下一代测序(NGS),常用于诊所。然而,它们不能区分断点的精确物理位置和基因组重排的结构特征。长读取测序(LRS)可以有效地克服这种限制。
    结果:我们使用LRS技术对三个家族进行了全基因组测序,并分析了DMD基因的结构变异,涉及外显子1和/或外显子2的重复。鉴定了DMDDp427m同种型和/或Dp427c同种型中包含外显子1的两种不同变体类型,以前很少报道。在谱系1中,在DMD规范转录本(Dp427m)和Dp427c转录本中外显子1的连续外显子1-2重复变异的男性个体是正常的,表明该变体可能是良性的。在谱系3中,患者携带涉及DMDDp427c转录物的外显子1的复杂SV,显示明显的表型。断点的位置和结构变体(SV)的特征由LRS识别,能够对变种的致病性进行分类。
    结论:我们的研究揭示了包含Dp427c/Dp427m启动子区域的DMD变体的复杂性,并强调了在评估DMD5末端外显子重复的致病性时谨慎解释的重要性。特别是在没有受影响的先证者的承运人筛选方案中。
    BACKGROUND: Dystrophinopathies are the most common X-linked inherited muscle diseases, and the disease-causing gene is DMD. Exonic duplications are a common type of pathogenic variants in the DMD gene, however, 5\' end exonic duplications containing exon 1 are less common. When assessing the pathogenicity of exonic duplications in the DMD gene, consideration must be given to their impact on the reading frame. Traditional molecular methods, such as multiplex ligation-dependent probe amplification (MLPA) and next-generation sequencing (NGS), are commonly used in clinics. However, they cannot discriminate the precise physical locations of breakpoints and structural features of genomic rearrangement. Long-read sequencing (LRS) can effectively overcome this limitation.
    RESULTS: We used LRS technology to perform whole genome sequencing on three families and analyze the structural variations of the DMD gene, which involves the duplications of exon 1 and/or exon 2. Two distinct variant types encompassing exon 1 in the DMD Dp427m isoform and/or Dp427c isoform are identified, which have been infrequently reported previously. In pedigree 1, the male individuals harboring duplication variant of consecutive exons 1-2 in the DMD canonical transcript (Dp427m) and exon 1 in the Dp427c transcript are normal, indicating the variant is likely benign. In pedigree 3, the patient carries complex SVs involving exon 1 of the DMD Dp427c transcript showing an obvious phenotype. The locations of the breakpoints and the characteristics of structural variants (SVs) are identified by LRS, enabling the classification of the variants\' pathogenicity.
    CONCLUSIONS: Our research sheds light on the complexity of DMD variants encompassing Dp427c/Dp427m promoter regions and emphasizes the importance of cautious interpretation when assessing the pathogenicity of DMD 5\' end exonic duplications, particularly in carrier screening scenarios without an affected proband.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    地中海贫血是世界上分布最广泛的单基因疾病之一,影响人数最多。它可以表现出从无症状到致命的广泛表型,这与α-和β-珠蛋白链之间的不平衡程度有关。因此,具有不同基因型的个体可以表现出相似的表型。基因分析总是需要做出正确的诊断。然而,在中国人群中使用的地中海贫血的常规遗传分析仅识别出23种常见变异,导致许多病例未被诊断或被误诊。在这项研究中,我们对30例血液学筛查结果不能用常规基因检测结果解释的受试者应用了基于长读数测序的方法,称为地中海贫血等位基因综合分析(CATSA).另外的变体的鉴定和基因型的校正允许对24名受试者的临床表型进行解释。已被独立实验证实是正确的。此外,我们发现了一个包含整个HBB和HBD基因以及部分HBBP1基因的新型8.4kb缺失,扩大β-地中海贫血的基因型谱。CATSA在诊断由罕见变异引起的地中海贫血方面比其他基因测试具有很大的优势。
    Thalassemia is one of the most widely distributed monogenic disorders in the world and affects the largest number of people. It can manifest a wide spectrum of phenotypes from asymptomatic to fatal, which is associated with the degree of imbalance between α- and β-globin chains. Therefore, individuals with different genotypes could present with a similar phenotype. Genetic analysis is always needed to make a correct diagnosis. However, routine genetic analysis of thalassemia used in the Chinese population identifies only 23 common variants, resulting in many cases undiagnosed or being misdiagnosed. In this study, we applied a long-read sequencing-based approach termed comprehensive analysis of thalassemia alleles (CATSA) to 30 subjects whose hematologic screening results could not be explained by the routine genetic test results. The identification of additional variants and the correction of genotypes allowed the interpretation of the clinical phenotype in 24 subjects, which have been confirmed to be correct by independent experiments. Moreover, we identified a novel 8.4-kb deletion containing the entire HBB and HBD genes as well as part of the HBBP1 gene, expanding the genotype spectrum of β-thalassemia. CATSA showed a great advantage over other genetic tests in the diagnosis of thalassemia caused by rare variants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    先天性肾上腺增生(CAH)是一组常染色体隐性遗传疾病,主要特征是皮质类固醇合成受损。临床表型包括肾上腺皮质功能减退,电解质干扰,性腺发育异常,身材矮小,其中严重的轻度肾上腺皮质和盐浪费可能危及生命。遗传分析有助于CAH的临床诊断。然而,引起21-OHD的基因CYP21A2与高度同源的CYP21A1P假基因串联排列,这使得使用多重连接依赖性探针扩增(MLPA)加Sanger测序或下一代测序(NGS)的传统方法难以确定确切的基因型。我们将基于长读数测序的方法称为CAH综合分析(CACAH),对48例通过临床特征诊断为CAH的新生儿和传统的MLPA加Sanger测序方法进行回顾性分析。评价其在新生儿CAH临床诊断中的疗效。与MLPA加Sanger测序法相比,CACAH在检测位于CAH相关基因的外显子和外显子-内含子边界区域的SNV/indel变体方面表现出100%的一致性。它可以直接确定顺反关系,而无需分析亲本基因型,缩短了诊断时间.此外,CACAH能够区分不同的CYP21A1P/CYP21A2和TNXA/TNXB嵌合体,并检测其他变体(CYP21A2变体c.-121C>T,c.*13G>A,c.*52C>T,c.*440C>T,c.*443T>C,和TNXB变体c.12463+2T>C,c.12204+5G>A)。我们还在两个新生儿中单独鉴定了TNXB变体c.11435_11524+30del,而不是作为TNXA/TNXB-CH-1嵌合体的一部分,这可能是通过基因转换引入的。所有这些特征使临床医生能够更好地解释受试者的表型并更有效地管理它们。与传统的MLPA和Sanger测序方法相比,CACAH具有很大的优势,在新生儿CAH的基因诊断和筛查中显示出巨大的潜力。
    Congenital adrenal hyperplasia (CAH) is a group of autosomal recessive disorders predominantly characterized by impaired corticosteroid synthesis. Clinical phenotypes include hypoadrenocorticism, electrolyte disturbances, abnormal gonadal development, and short stature, of which severe hyponadrenocorticism and salt wasting can be life-threatening. Genetic analysis can help in the clinical diagnosis of CAH. However, the 21-OHD-causing gene CYP21A2 is arranged in tandem with the highly homologous CYP21A1P pseudogene, making it difficult to determine the exact genotypes using the traditional method of multiplex ligation-dependent probe amplification (MLPA) plus Sanger sequencing or next-generation sequencing (NGS). We applied a long-read sequencing-based approach termed comprehensive analysis of CAH (CACAH) to 48 newborns with CAH that were diagnosed by clinical features and the traditional MLPA plus Sanger sequencing method for retrospective analysis, to evaluate its efficacy in the clinical diagnosis of neonatal CAH. Compared with the MLPA plus Sanger sequencing method, CACAH showed 100 % consistency in detecting SNV/indel variants located in exons and exon-intron boundary regions of CAH-related genes. It can directly determine the cis-trans relationship without the need to analyze parental genotypes, which reduces the time to diagnosis. Moreover, CACAH was able to distinguish different CYP21A1P/CYP21A2 and TNXA/TNXB chimeras, and detect additional variants (CYP21A2 variants c.-121C > T, c.*13G > A, c.*52C > T, c.*440C > T, c.*443 T > C, and TNXB variants c.12463 + 2 T > C, c.12204 + 5G > A). We also identified the TNXB variant c.11435_11524 + 30del alone instead of as a part of the TNXA/TNXB-CH-1 chimera in two newborns, which might be introduced by gene conversion. All of these characteristics enabled clinicians to better explain the phenotype of subjects and manage them more effectively. CACAH has a great advantage over the traditional MLPA and Sanger sequencing methods, showing substantial potential in the genetic diagnosis and screening of neonatal CAH.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:发作性共济失调是罕见的神经系统疾病,其特征是反复发作的失衡和协调困难。获得明确的分子诊断带来了挑战,由于临床表现高度异质性,关于潜在遗传学的文献是有限的。虽然高通量测序技术的出现对孟德尔疾病遗传学做出了重大贡献,对具有不确定意义的变体的解释以及个别方法固有的其他局限性仍然使许多患者无法确诊.这项研究旨在研究在一组具有偶发性表现的共济失调的复杂病例中鉴定和验证分子候选物的实用性。方法:根据标准基因检测,招募了8例尽管进行了广泛的临床检查但仍缺乏分子诊断的患者。对从外周血单核细胞分离的样品进行全基因组和RNA测序。表达和剪接数据的整合促进了基因组变体的优先化。随后,长读数测序在这些候选变异体的验证中起着至关重要的作用.结果:全基因组测序发现四个基因(SPG7,ATXN2,ELOVL4,PMPCB)的致病变异。一种错觉和无稽之谈,两者都是以前报道的可能致病的,在个体#1中以反式配置(SPG7:c.2228T>C/p。I743T,c.1861C>T/p。Q621*)。ATXN2微卫星扩增(CAG32)在另一个晚发性病例中。在两个独立的个体中,预测剪接位点附近的内含子变体(ELOVL4:c.541+5G>A;PMPCB:c.1154+5G>C)诱导功能缺失剪接,但从未被报道为致病。长读测序证实了复合杂合变体的构型,重复扩展长度,以及这些致病变异的剪接景观。在ZFYVE26中发现了ATXN2扩增的潜在遗传修饰剂(c.3022C>T/p。R1008*)。结论:尽管通过临床基因检测未能确定致病变异,多组学方法使50%的患者能够进行分子诊断,还为剩余案例中的变体优先级排序提供了有价值的见解。这些发现证明了长读数测序对于在各种情况下验证候选变体的价值。我们的研究证明了利用互补组学技术来解开共济失调等未解决的罕见疾病患者的潜在遗传学的有效性。分子诊断不仅在改善患者护理管理方面具有重要的前景,而且还减轻了诊断错误的负担,更广泛地提高生活质量。
    Background: Episodic ataxias are rare neurological disorders characterized by recurring episodes of imbalance and coordination difficulties. Obtaining definitive molecular diagnoses poses challenges, as clinical presentation is highly heterogeneous, and literature on the underlying genetics is limited. While the advent of high-throughput sequencing technologies has significantly contributed to Mendelian disorders genetics, interpretation of variants of uncertain significance and other limitations inherent to individual methods still leaves many patients undiagnosed. This study aimed to investigate the utility of multi-omics for the identification and validation of molecular candidates in a cohort of complex cases of ataxia with episodic presentation. Methods: Eight patients lacking molecular diagnosis despite extensive clinical examination were recruited following standard genetic testing. Whole genome and RNA sequencing were performed on samples isolated from peripheral blood mononuclear cells. Integration of expression and splicing data facilitated genomic variants prioritization. Subsequently, long-read sequencing played a crucial role in the validation of those candidate variants. Results: Whole genome sequencing uncovered pathogenic variants in four genes (SPG7, ATXN2, ELOVL4, PMPCB). A missense and a nonsense variant, both previously reported as likely pathogenic, configured in trans in individual #1 (SPG7: c.2228T>C/p.I743T, c.1861C>T/p.Q621*). An ATXN2 microsatellite expansion (CAG32) in another late-onset case. In two separate individuals, intronic variants near splice sites (ELOVL4: c.541 + 5G>A; PMPCB: c.1154 + 5G>C) were predicted to induce loss-of-function splicing, but had never been reported as disease-causing. Long-read sequencing confirmed the compound heterozygous variants configuration, repeat expansion length, as well as splicing landscape for those pathogenic variants. A potential genetic modifier of the ATXN2 expansion was discovered in ZFYVE26 (c.3022C>T/p.R1008*). Conclusion: Despite failure to identify pathogenic variants through clinical genetic testing, the multi-omics approach enabled the molecular diagnosis in 50% of patients, also giving valuable insights for variant prioritization in remaining cases. The findings demonstrate the value of long-read sequencing for the validation of candidate variants in various scenarios. Our study demonstrates the effectiveness of leveraging complementary omics technologies to unravel the underlying genetics in patients with unresolved rare diseases such as ataxia. Molecular diagnoses not only hold significant promise in improving patient care management, but also alleviates the burden of diagnostic odysseys, more broadly enhancing quality of life.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Case Reports
    地中海贫血是一种常见的遗传性血红蛋白疾病,由一种或多种珠蛋白亚基缺乏引起。HBB基因的替代变异和缺失是β-地中海贫血的主要原因,其中大片段缺失很少见,并且难以通过常规的基于聚合酶链反应(PCR)的方法检测到。
    在这项研究中,我们报道了一个26岁的汉族男子,发现血常规异常。对先证者及其家人进行了血红蛋白测试,其中只有先证者的母亲有正常参数。地中海贫血等位基因(CATSA,基于长读取测序的方法)来鉴定致病变异。我们最终发现了一个新的10.8kb缺失,包括先证者及其父亲和兄弟的β-珠蛋白(HBB)基因(Chr11:5216601-5227407,GRch38/hg38),这与他们的血红蛋白检测结果一致。通过多重连接依赖性探针扩增(MLPA)和缺口聚合酶链反应(Gap-PCR)以及Sanger测序确认了缺失的拷贝数和确切断点。分别。
    在中国的HBB基因中发现了这种新的大缺失,我们扩展了β-地中海贫血的基因型谱,并显示了长读数测序(LRS)在全面和精确检测地中海贫血变体方面的优势。
    UNASSIGNED: Thalassemia is a common inherited hemoglobin disorder caused by a deficiency of one or more globin subunits. Substitution variants and deletions in the HBB gene are the major causes of β-thalassemia, of which large fragment deletions are rare and difficult to be detected by conventional polymerase chain reaction (PCR)-based methods.
    UNASSIGNED: In this study, we reported a 26-year-old Han Chinese man, whose routine blood parameters were found to be abnormal. Hemoglobin testing was performed on the proband and his family members, of whom only the proband\'s mother had normal parameters. The comprehensive analysis of thalassemia alleles (CATSA, a long-read sequencing-based approach) was performed to identify the causative variants. We finally found a novel 10.8-kb deletion including the β-globin (HBB) gene (Chr11:5216601-5227407, GRch38/hg38) of the proband and his father and brother, which were consistent with their hemoglobin testing results. The copy number and exact breakpoints of the deletion were confirmed by multiplex ligation-dependent probe amplification (MLPA) and gap-polymerase chain reaction (Gap-PCR) as well as Sanger sequencing, respectively.
    UNASSIGNED: With this novel large deletion found in the HBB gene in China, we expand the genotype spectrum of β-thalassemia and show the advantages of long-read sequencing (LRS) for comprehensive and precise detection of thalassemia variants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    长读测序(LRS)技术为探索基因组提供了非常强大的工具。虽然在早期,这些方法受到技术限制,他们最近在阅读长度方面取得了重大进展,吞吐量,和准确性和生物信息学工具有了很大的提高。这里,我们的目标是回顾LRS技术的现状,新方法的发展,以及对基因组学研究的影响。我们将探索这些技术带来的最有影响力的最新发现,重点是基因组和转录组的高分辨率测序以及DNA和RNA修饰的直接检测。我们还将讨论LRS方法如何更全面地了解人类遗传变异,转录组学,和未来几年的表观遗传学。
    Long-read sequencing (LRS) technologies have provided extremely powerful tools to explore genomes. While in the early years these methods suffered technical limitations, they have recently made significant progress in terms of read length, throughput, and accuracy and bioinformatics tools have strongly improved. Here, we aim to review the current status of LRS technologies, the development of novel methods, and the impact on genomics research. We will explore the most impactful recent findings made possible by these technologies focusing on high-resolution sequencing of genomes and transcriptomes and the direct detection of DNA and RNA modifications. We will also discuss how LRS methods promise a more comprehensive understanding of human genetic variation, transcriptomics, and epigenetics for the coming years.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    长读测序(LRS)可以解析重复性区域,短读取(SR)数据的限制。成本和仪器尺寸的降低导致诊断和研究中LRS的稳定增长。这里,我们对2018年至2021年之间测序的FAST5数据进行了基础分析,并分析了跨越广泛GC含量(25-67%)的大型数据集(n=200)中与gDNA相关的数据.我们检查了重新建立基础的数据是否会改善混合组件,and,对于一个较小的队列,在抗菌素耐药性(AMR)基因和可移动遗传元件的背景下,比较了长读(LR)组装。在比较SR和LR仪器时,我们包括了成本分析。我们比较了R9和R10的化学性质,并报道了R9流动池不仅具有更大的产量,而且提高了读取质量。LR组件中ARG的存在/不存在和/或变体检测通常存在差异。基于Flye的组装体通常在检测染色体和质粒上ARG的存在方面是有效的。Raven的表现更快,但恢复的小质粒不一致,特别是携带blaKPC的15kbCol样质粒。卡努议会是最分散的,基因组大小大于预期。LR程序集无法一致地确定Unicycler参考所标识的同一ARG的多个副本。即使对ONT化学和碱基判定进行了改进,长时间读取的程序集可能导致对数据的误解。如果当前依赖LR数据,有必要执行多个组件,虽然这是资源(计算)密集型,但尚不容易获得/使用。
    Long-read sequencing (LRS) can resolve repetitive regions, a limitation of short read (SR) data. Reduced cost and instrument size has led to a steady increase in LRS across diagnostics and research. Here, we re-basecalled FAST5 data sequenced between 2018 and 2021 and analyzed the data in relation to gDNA across a large dataset (n = 200) spanning a wide GC content (25-67%). We examined whether re-basecalled data would improve the hybrid assembly, and, for a smaller cohort, compared long read (LR) assemblies in the context of antimicrobial resistance (AMR) genes and mobile genetic elements. We included a cost analysis when comparing SR and LR instruments. We compared the R9 and R10 chemistries and reported not only a larger yield but increased read quality with R9 flow cells. There were often discrepancies with ARG presence/absence and/or variant detection in LR assemblies. Flye-based assemblies were generally efficient at detecting the presence of ARG on both the chromosome and plasmids. Raven performed more quickly but inconsistently recovered small plasmids, notably a ∼15-kb Col-like plasmid harboring bla KPC . Canu assemblies were the most fragmented, with genome sizes larger than expected. LR assemblies failed to consistently determine multiple copies of the same ARG as identified by the Unicycler reference. Even with improvements to ONT chemistry and basecalling, long-read assemblies can lead to misinterpretation of data. If LR data are currently being relied upon, it is necessary to perform multiple assemblies, although this is resource (computing) intensive and not yet readily available/useable.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号