Long-read sequencing (LRS)

  • 文章类型: Journal Article
    背景:肌营养不良蛋白病是最常见的X连锁遗传性肌肉疾病,致病基因是DMD.外显子重复是DMD基因中常见的致病变异类型,然而,5\'末端包含外显子1的外显子重复不太常见。当评估DMD基因外显子重复的致病性时,必须考虑它们对阅读框架的影响。传统的分子方法,如多重连接依赖性探针扩增(MLPA)和下一代测序(NGS),常用于诊所。然而,它们不能区分断点的精确物理位置和基因组重排的结构特征。长读取测序(LRS)可以有效地克服这种限制。
    结果:我们使用LRS技术对三个家族进行了全基因组测序,并分析了DMD基因的结构变异,涉及外显子1和/或外显子2的重复。鉴定了DMDDp427m同种型和/或Dp427c同种型中包含外显子1的两种不同变体类型,以前很少报道。在谱系1中,在DMD规范转录本(Dp427m)和Dp427c转录本中外显子1的连续外显子1-2重复变异的男性个体是正常的,表明该变体可能是良性的。在谱系3中,患者携带涉及DMDDp427c转录物的外显子1的复杂SV,显示明显的表型。断点的位置和结构变体(SV)的特征由LRS识别,能够对变种的致病性进行分类。
    结论:我们的研究揭示了包含Dp427c/Dp427m启动子区域的DMD变体的复杂性,并强调了在评估DMD5末端外显子重复的致病性时谨慎解释的重要性。特别是在没有受影响的先证者的承运人筛选方案中。
    BACKGROUND: Dystrophinopathies are the most common X-linked inherited muscle diseases, and the disease-causing gene is DMD. Exonic duplications are a common type of pathogenic variants in the DMD gene, however, 5\' end exonic duplications containing exon 1 are less common. When assessing the pathogenicity of exonic duplications in the DMD gene, consideration must be given to their impact on the reading frame. Traditional molecular methods, such as multiplex ligation-dependent probe amplification (MLPA) and next-generation sequencing (NGS), are commonly used in clinics. However, they cannot discriminate the precise physical locations of breakpoints and structural features of genomic rearrangement. Long-read sequencing (LRS) can effectively overcome this limitation.
    RESULTS: We used LRS technology to perform whole genome sequencing on three families and analyze the structural variations of the DMD gene, which involves the duplications of exon 1 and/or exon 2. Two distinct variant types encompassing exon 1 in the DMD Dp427m isoform and/or Dp427c isoform are identified, which have been infrequently reported previously. In pedigree 1, the male individuals harboring duplication variant of consecutive exons 1-2 in the DMD canonical transcript (Dp427m) and exon 1 in the Dp427c transcript are normal, indicating the variant is likely benign. In pedigree 3, the patient carries complex SVs involving exon 1 of the DMD Dp427c transcript showing an obvious phenotype. The locations of the breakpoints and the characteristics of structural variants (SVs) are identified by LRS, enabling the classification of the variants\' pathogenicity.
    CONCLUSIONS: Our research sheds light on the complexity of DMD variants encompassing Dp427c/Dp427m promoter regions and emphasizes the importance of cautious interpretation when assessing the pathogenicity of DMD 5\' end exonic duplications, particularly in carrier screening scenarios without an affected proband.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:发作性共济失调是罕见的神经系统疾病,其特征是反复发作的失衡和协调困难。获得明确的分子诊断带来了挑战,由于临床表现高度异质性,关于潜在遗传学的文献是有限的。虽然高通量测序技术的出现对孟德尔疾病遗传学做出了重大贡献,对具有不确定意义的变体的解释以及个别方法固有的其他局限性仍然使许多患者无法确诊.这项研究旨在研究在一组具有偶发性表现的共济失调的复杂病例中鉴定和验证分子候选物的实用性。方法:根据标准基因检测,招募了8例尽管进行了广泛的临床检查但仍缺乏分子诊断的患者。对从外周血单核细胞分离的样品进行全基因组和RNA测序。表达和剪接数据的整合促进了基因组变体的优先化。随后,长读数测序在这些候选变异体的验证中起着至关重要的作用.结果:全基因组测序发现四个基因(SPG7,ATXN2,ELOVL4,PMPCB)的致病变异。一种错觉和无稽之谈,两者都是以前报道的可能致病的,在个体#1中以反式配置(SPG7:c.2228T>C/p。I743T,c.1861C>T/p。Q621*)。ATXN2微卫星扩增(CAG32)在另一个晚发性病例中。在两个独立的个体中,预测剪接位点附近的内含子变体(ELOVL4:c.541+5G>A;PMPCB:c.1154+5G>C)诱导功能缺失剪接,但从未被报道为致病。长读测序证实了复合杂合变体的构型,重复扩展长度,以及这些致病变异的剪接景观。在ZFYVE26中发现了ATXN2扩增的潜在遗传修饰剂(c.3022C>T/p。R1008*)。结论:尽管通过临床基因检测未能确定致病变异,多组学方法使50%的患者能够进行分子诊断,还为剩余案例中的变体优先级排序提供了有价值的见解。这些发现证明了长读数测序对于在各种情况下验证候选变体的价值。我们的研究证明了利用互补组学技术来解开共济失调等未解决的罕见疾病患者的潜在遗传学的有效性。分子诊断不仅在改善患者护理管理方面具有重要的前景,而且还减轻了诊断错误的负担,更广泛地提高生活质量。
    Background: Episodic ataxias are rare neurological disorders characterized by recurring episodes of imbalance and coordination difficulties. Obtaining definitive molecular diagnoses poses challenges, as clinical presentation is highly heterogeneous, and literature on the underlying genetics is limited. While the advent of high-throughput sequencing technologies has significantly contributed to Mendelian disorders genetics, interpretation of variants of uncertain significance and other limitations inherent to individual methods still leaves many patients undiagnosed. This study aimed to investigate the utility of multi-omics for the identification and validation of molecular candidates in a cohort of complex cases of ataxia with episodic presentation. Methods: Eight patients lacking molecular diagnosis despite extensive clinical examination were recruited following standard genetic testing. Whole genome and RNA sequencing were performed on samples isolated from peripheral blood mononuclear cells. Integration of expression and splicing data facilitated genomic variants prioritization. Subsequently, long-read sequencing played a crucial role in the validation of those candidate variants. Results: Whole genome sequencing uncovered pathogenic variants in four genes (SPG7, ATXN2, ELOVL4, PMPCB). A missense and a nonsense variant, both previously reported as likely pathogenic, configured in trans in individual #1 (SPG7: c.2228T>C/p.I743T, c.1861C>T/p.Q621*). An ATXN2 microsatellite expansion (CAG32) in another late-onset case. In two separate individuals, intronic variants near splice sites (ELOVL4: c.541 + 5G>A; PMPCB: c.1154 + 5G>C) were predicted to induce loss-of-function splicing, but had never been reported as disease-causing. Long-read sequencing confirmed the compound heterozygous variants configuration, repeat expansion length, as well as splicing landscape for those pathogenic variants. A potential genetic modifier of the ATXN2 expansion was discovered in ZFYVE26 (c.3022C>T/p.R1008*). Conclusion: Despite failure to identify pathogenic variants through clinical genetic testing, the multi-omics approach enabled the molecular diagnosis in 50% of patients, also giving valuable insights for variant prioritization in remaining cases. The findings demonstrate the value of long-read sequencing for the validation of candidate variants in various scenarios. Our study demonstrates the effectiveness of leveraging complementary omics technologies to unravel the underlying genetics in patients with unresolved rare diseases such as ataxia. Molecular diagnoses not only hold significant promise in improving patient care management, but also alleviates the burden of diagnostic odysseys, more broadly enhancing quality of life.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Case Reports
    地中海贫血是一种常见的遗传性血红蛋白疾病,由一种或多种珠蛋白亚基缺乏引起。HBB基因的替代变异和缺失是β-地中海贫血的主要原因,其中大片段缺失很少见,并且难以通过常规的基于聚合酶链反应(PCR)的方法检测到。
    在这项研究中,我们报道了一个26岁的汉族男子,发现血常规异常。对先证者及其家人进行了血红蛋白测试,其中只有先证者的母亲有正常参数。地中海贫血等位基因(CATSA,基于长读取测序的方法)来鉴定致病变异。我们最终发现了一个新的10.8kb缺失,包括先证者及其父亲和兄弟的β-珠蛋白(HBB)基因(Chr11:5216601-5227407,GRch38/hg38),这与他们的血红蛋白检测结果一致。通过多重连接依赖性探针扩增(MLPA)和缺口聚合酶链反应(Gap-PCR)以及Sanger测序确认了缺失的拷贝数和确切断点。分别。
    在中国的HBB基因中发现了这种新的大缺失,我们扩展了β-地中海贫血的基因型谱,并显示了长读数测序(LRS)在全面和精确检测地中海贫血变体方面的优势。
    UNASSIGNED: Thalassemia is a common inherited hemoglobin disorder caused by a deficiency of one or more globin subunits. Substitution variants and deletions in the HBB gene are the major causes of β-thalassemia, of which large fragment deletions are rare and difficult to be detected by conventional polymerase chain reaction (PCR)-based methods.
    UNASSIGNED: In this study, we reported a 26-year-old Han Chinese man, whose routine blood parameters were found to be abnormal. Hemoglobin testing was performed on the proband and his family members, of whom only the proband\'s mother had normal parameters. The comprehensive analysis of thalassemia alleles (CATSA, a long-read sequencing-based approach) was performed to identify the causative variants. We finally found a novel 10.8-kb deletion including the β-globin (HBB) gene (Chr11:5216601-5227407, GRch38/hg38) of the proband and his father and brother, which were consistent with their hemoglobin testing results. The copy number and exact breakpoints of the deletion were confirmed by multiplex ligation-dependent probe amplification (MLPA) and gap-polymerase chain reaction (Gap-PCR) as well as Sanger sequencing, respectively.
    UNASSIGNED: With this novel large deletion found in the HBB gene in China, we expand the genotype spectrum of β-thalassemia and show the advantages of long-read sequencing (LRS) for comprehensive and precise detection of thalassemia variants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    长读测序(LRS)可以解析重复性区域,短读取(SR)数据的限制。成本和仪器尺寸的降低导致诊断和研究中LRS的稳定增长。这里,我们对2018年至2021年之间测序的FAST5数据进行了基础分析,并分析了跨越广泛GC含量(25-67%)的大型数据集(n=200)中与gDNA相关的数据.我们检查了重新建立基础的数据是否会改善混合组件,and,对于一个较小的队列,在抗菌素耐药性(AMR)基因和可移动遗传元件的背景下,比较了长读(LR)组装。在比较SR和LR仪器时,我们包括了成本分析。我们比较了R9和R10的化学性质,并报道了R9流动池不仅具有更大的产量,而且提高了读取质量。LR组件中ARG的存在/不存在和/或变体检测通常存在差异。基于Flye的组装体通常在检测染色体和质粒上ARG的存在方面是有效的。Raven的表现更快,但恢复的小质粒不一致,特别是携带blaKPC的15kbCol样质粒。卡努议会是最分散的,基因组大小大于预期。LR程序集无法一致地确定Unicycler参考所标识的同一ARG的多个副本。即使对ONT化学和碱基判定进行了改进,长时间读取的程序集可能导致对数据的误解。如果当前依赖LR数据,有必要执行多个组件,虽然这是资源(计算)密集型,但尚不容易获得/使用。
    Long-read sequencing (LRS) can resolve repetitive regions, a limitation of short read (SR) data. Reduced cost and instrument size has led to a steady increase in LRS across diagnostics and research. Here, we re-basecalled FAST5 data sequenced between 2018 and 2021 and analyzed the data in relation to gDNA across a large dataset (n = 200) spanning a wide GC content (25-67%). We examined whether re-basecalled data would improve the hybrid assembly, and, for a smaller cohort, compared long read (LR) assemblies in the context of antimicrobial resistance (AMR) genes and mobile genetic elements. We included a cost analysis when comparing SR and LR instruments. We compared the R9 and R10 chemistries and reported not only a larger yield but increased read quality with R9 flow cells. There were often discrepancies with ARG presence/absence and/or variant detection in LR assemblies. Flye-based assemblies were generally efficient at detecting the presence of ARG on both the chromosome and plasmids. Raven performed more quickly but inconsistently recovered small plasmids, notably a ∼15-kb Col-like plasmid harboring bla KPC . Canu assemblies were the most fragmented, with genome sizes larger than expected. LR assemblies failed to consistently determine multiple copies of the same ARG as identified by the Unicycler reference. Even with improvements to ONT chemistry and basecalling, long-read assemblies can lead to misinterpretation of data. If LR data are currently being relied upon, it is necessary to perform multiple assemblies, although this is resource (computing) intensive and not yet readily available/useable.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号