Oxford Nanopore technology (ONT)

牛津纳米孔技术 (ONT)
  • 文章类型: Journal Article
    背景和目的:亲生父母的某些染色体结构变异(SV)可导致复发性自然流产(RSAs)。减数分裂过程中的不等交叉会导致配子染色体的不平衡重排,例如复制或缺失。不幸的是,常规技术,如核型分析,荧光原位杂交(FISH),染色体微阵列分析(CMA),拷贝数变异测序(CNV-seq)不能检测所有类型的SV。在这项研究中,我们证明光学基因组作图(OGM)能够以高分辨率快速准确地检测RSA患者的SVs,并在基因水平上提供有关断点区域的更多信息.方法:招募7对因流产胚胎染色体重排不平衡而遭受RSA的夫妇,并从其外周血中分离出超高分子量(UHMW)DNA。在BionanoSolve数据分析软件上通过从头组装创建共识基因组图谱。通过参考基因组GRCh38/hg38的比对鉴定SV和断点。使用牛津纳米孔测序或Sanger测序验证确切的断点序列。结果:OGM成功检测到招募夫妇中的各种SV。此外,发现了其他复杂的染色体重排(CCR)和四个隐蔽的平衡相互易位(BRT),进一步完善RSA的潜在遗传原因。在这项研究中发现了两个被破坏的基因,FOXK2[46,XY,t(7;17)(q31.3;q25)]和PLXDC2[46,XX,t(10;16)(p12.31;q23.1)],先前已证明与男性生育力和胚胎运输有关。结论:OGM能准确检测染色体SVs,尤其是隐秘的BRT和CCR。它是人类常规基因诊断的有用补充,例如核型分析,并且比常规遗传诊断更准确地检测隐匿性BRT和CCR。
    Background and aims: Certain chromosomal structural variations (SVs) in biological parents can lead to recurrent spontaneous abortions (RSAs). Unequal crossing over during meiosis can result in the unbalanced rearrangement of gamete chromosomes such as duplication or deletion. Unfortunately, routine techniques such as karyotyping, fluorescence in situ hybridization (FISH), chromosomal microarray analysis (CMA), and copy number variation sequencing (CNV-seq) cannot detect all types of SVs. In this study, we show that optical genome mapping (OGM) quickly and accurately detects SVs for RSA patients with a high resolution and provides more information about the breakpoint regions at gene level. Methods: Seven couples who had suffered RSA with unbalanced chromosomal rearrangements of aborted embryos were recruited, and ultra-high molecular weight (UHMW) DNA was isolated from their peripheral blood. The consensus genome map was created by de novo assembly on the Bionano Solve data analysis software. SVs and breakpoints were identified via alignments of the reference genome GRCh38/hg38. The exact breakpoint sequences were verified using either Oxford Nanopore sequencing or Sanger sequencing. Results: Various SVs in the recruited couples were successfully detected by OGM. Also, additional complex chromosomal rearrangement (CCRs) and four cryptic balanced reciprocal translocations (BRTs) were revealed, further refining the underlying genetic causes of RSA. Two of the disrupted genes identified in this study, FOXK2 [46,XY,t(7; 17)(q31.3; q25)] and PLXDC2 [46,XX,t(10; 16)(p12.31; q23.1)], had been previously shown to be associated with male fertility and embryo transit. Conclusion: OGM accurately detects chromosomal SVs, especially cryptic BRTs and CCRs. It is a useful complement to routine human genetic diagnostics, such as karyotyping, and detects cryptic BRTs and CCRs more accurately than routine genetic diagnostics.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    石杉树是一种传统的中草药植物,近年来因其产品HupA可用作乙酰胆碱酯酶抑制剂(AChEI)而受到特别关注。尽管已经研究了H.crispata的叶绿体(cp)基因组,由于先前报道的H.squarrosa已被修订为Phlegmariurussquarrosus,因此没有关于Humperzia线粒体(mt)基因组的报道。使用长读数纳米孔和Illumina测序平台的组合对H.crispata的mt基因组进行测序。整个H.crispatamt基因组被组装成一个长度为412,594bp的圆形,共有91个基因,包括45个tRNA,6个rRNAs,37个蛋白质编码基因(PCGs),还有3个伪基因.值得注意的是,rps8基因存在于squarrosus中,而假基因rps8存在于刺梨中,这是大多数蕨类植物和裸子植物所缺乏的。内含子编码的成熟酶(mat-atp9i85和mat-cobi787)基因存在于沙棘和沙棘中。但是在其他检查过的鱼黄植物中丢失了,蕨类植物,和裸子植物。共线性分析表明,与其他蕨类植物相比,鳞片和四叶草的mt基因组高度保守。相对同义密码子使用(RSCU)分析表明,最常见的氨基酸是苯丙氨酸(Phe)(4.77%),异亮氨酸(Ile)(4.71%),赖氨酸(Lys)(4.26%),而精氨酸(Arg)(0.32%),很少发现组氨酸(His)(0.42%)。简单序列重复(SSR)分析显示,在条状芽孢杆菌的mt基因组中总共鉴定出114个SSR,占整个mt基因组的0.35%。单体重复是SSR的大多数类型,占SSR总数的91.89%。此外,总共有1948次穿插重复(158次向前,147回文,在H.crispatamt基因组中鉴定出长度从30bp到14,945bp的5个反向重复序列),并且30-39bp的重复序列是最丰富的类型。基因转移分析表明,在条件成熟的cp和mt基因组之间共发现了12个同源片段。占总cp和mt基因组的0.93%和2.48%,分别。系统进化树显示,柳树是柳树的姐妹。Ka/Ks分析结果表明,大多数PCG,除了atp6基因,在进化过程中进行纯化选择。我们的研究提供了关于H.crispatamt基因组特征的广泛信息,并将有助于解开进化关系,以及黄质植物中的分子鉴定。
    Huperzia crispata is a traditional Chinese herb plant and has attracted special attention in recent years for its products Hup A can serve as an acetylcholinesterase inhibitor (AChEI). Although the chloroplast (cp) genome of H. crispata has been studied, there are no reports regarding the Huperzia mitochondrial (mt) genome since the previously reported H. squarrosa has been revised as Phlegmariurus squarrosus. The mt genome of H. crispata was sequenced using a combination of long-read nanopore and Illumina sequencing platforms. The entire H. crispata mt genome was assembled in a circular with a length of 412,594 bp and a total of 91 genes, including 45 tRNAs, 6 rRNAs, 37 protein-coding genes (PCGs), and 3 pseudogenes. Notably, the rps8 gene was present in P. squarrosus and a pseudogene rps8 was presented in H. crispata, which was lacking in most of Pteridophyta and Gymnospermae. Intron-encoded maturase (mat-atp9i85 and mat-cobi787) genes were present in H. crispata and P. squarrosus, but lost in other examined lycophytes, ferns, and Gymnospermae plants. Collinearity analysis showed that the mt genome of H. crispata and P. squarrossus is highly conservative compared to other ferns. Relative synonymous codon usage (RSCU) analysis showed that the amino acids most frequently found were phenylalanine (Phe) (4.77%), isoleucine (Ile) (4.71%), lysine (Lys) (4.26%), while arginine (Arg) (0.32%), and histidine (His) (0.42%) were rarely found. Simple sequence repeats (SSR) analysis revealed that a total of 114 SSRs were identified in the mt genome of H. crispata and account for 0.35% of the whole mt genome. Monomer repeats were the majority types of SSRs and represent 91.89% of the total SSRs. In addition, a total of 1948 interspersed repeats (158 forward, 147 palindromic, and 5 reverse repeats) with a length ranging from 30 bp to 14,945 bp were identified in the H. crispata mt genome and the 30-39-bp repeats were the most abundant type. Gene transfer analysis indicated that a total of 12 homologous fragments were discovered between the cp and mt genomes of H. crispata, accounting for 0.93% and 2.48% of the total cp and mt genomes, respectively. The phylogenetic trees revealed that H. crispata was the sister of P. squarrosus. The Ka/Ks analysis results suggested that most PCGs, except atp6 gene, were subject to purification selection during evolution. Our study provides extensive information on the features of the H. crispata mt genome and will help unravel evolutionary relationships, and molecular identification within lycophytes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    长读测序(LRS)可以解析重复性区域,短读取(SR)数据的限制。成本和仪器尺寸的降低导致诊断和研究中LRS的稳定增长。这里,我们对2018年至2021年之间测序的FAST5数据进行了基础分析,并分析了跨越广泛GC含量(25-67%)的大型数据集(n=200)中与gDNA相关的数据.我们检查了重新建立基础的数据是否会改善混合组件,and,对于一个较小的队列,在抗菌素耐药性(AMR)基因和可移动遗传元件的背景下,比较了长读(LR)组装。在比较SR和LR仪器时,我们包括了成本分析。我们比较了R9和R10的化学性质,并报道了R9流动池不仅具有更大的产量,而且提高了读取质量。LR组件中ARG的存在/不存在和/或变体检测通常存在差异。基于Flye的组装体通常在检测染色体和质粒上ARG的存在方面是有效的。Raven的表现更快,但恢复的小质粒不一致,特别是携带blaKPC的15kbCol样质粒。卡努议会是最分散的,基因组大小大于预期。LR程序集无法一致地确定Unicycler参考所标识的同一ARG的多个副本。即使对ONT化学和碱基判定进行了改进,长时间读取的程序集可能导致对数据的误解。如果当前依赖LR数据,有必要执行多个组件,虽然这是资源(计算)密集型,但尚不容易获得/使用。
    Long-read sequencing (LRS) can resolve repetitive regions, a limitation of short read (SR) data. Reduced cost and instrument size has led to a steady increase in LRS across diagnostics and research. Here, we re-basecalled FAST5 data sequenced between 2018 and 2021 and analyzed the data in relation to gDNA across a large dataset (n = 200) spanning a wide GC content (25-67%). We examined whether re-basecalled data would improve the hybrid assembly, and, for a smaller cohort, compared long read (LR) assemblies in the context of antimicrobial resistance (AMR) genes and mobile genetic elements. We included a cost analysis when comparing SR and LR instruments. We compared the R9 and R10 chemistries and reported not only a larger yield but increased read quality with R9 flow cells. There were often discrepancies with ARG presence/absence and/or variant detection in LR assemblies. Flye-based assemblies were generally efficient at detecting the presence of ARG on both the chromosome and plasmids. Raven performed more quickly but inconsistently recovered small plasmids, notably a ∼15-kb Col-like plasmid harboring bla KPC . Canu assemblies were the most fragmented, with genome sizes larger than expected. LR assemblies failed to consistently determine multiple copies of the same ARG as identified by the Unicycler reference. Even with improvements to ONT chemistry and basecalling, long-read assemblies can lead to misinterpretation of data. If LR data are currently being relied upon, it is necessary to perform multiple assemblies, although this is resource (computing) intensive and not yet readily available/useable.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:meleagridis是一种原生动物寄生虫,是组织造口病的病原体,一种重要的家禽疾病,其重要性因缺乏任何治疗和预防而得到强调。最近成功的寄生虫体外衰减引发了有关潜在机制的问题。
    结果:使用OxfordNanoporeTechnology(ONT)和Illumina平台招募了来自H.meleagridis相同亲本谱系的强毒和减毒菌株的全基因组序列数据。将其组合以生成具有高碱基精度的兆碱基大小的重叠群。检查基因组的差异鉴定了减毒菌株的编码序列内的两个实质性缺失。此外,一个单核苷酸多态性(SNP)和indel靶向编码序列导致过早终止密码子的形成,这导致减毒菌株中两个基因的截短。此外,H.meleagridis的基因组用于表征寄生虫的临床相关蛋白质类别。与阴道毛滴虫基因组的比较分析,毛滴虫胎儿和溶组织内阿米巴在H.meleagridis谱系中鉴定出〜2700谱系特异性基因丢失和9个基因家族扩增。
    结论:作为一个整体,获得的数据为了解H.meleagridis减毒的分子基础提供了初步线索,并为未来对这一重要家禽病原体的研究构成了基因组学平台。
    BACKGROUND: Histomonas meleagridis is a protozoan parasite and the causative agent of histomonosis, an important poultry disease whose significance is underlined by the absence of any treatment and prophylaxis. The recent successful in vitro attenuation of the parasite urges questions about the underlying mechanisms.
    RESULTS: Whole genome sequence data from a virulent and an attenuated strain originating from the same parental lineage of H. meleagridis were recruited using Oxford Nanopore Technology (ONT) and Illumina platforms, which were combined to generate megabase-sized contigs with high base-level accuracy. Inspecting the genomes for differences identified two substantial deletions within a coding sequence of the attenuated strain. Additionally, one single nucleotide polymorphism (SNP) and indel targeting coding sequences caused the formation of premature stop codons, which resulted in the truncation of two genes in the attenuated strain. Furthermore, the genome of H. meleagridis was used for characterizing protein classes of clinical relevance for parasitic protists. The comparative analysis with the genomes of Trichomonas vaginalis, Tritrichomonas foetus and Entamoeba histolytica identified ~ 2700 lineage-specific gene losses and 9 gene family expansions in the H. meleagridis lineage.
    CONCLUSIONS: Taken as a whole, the obtained data provide the first hints to understand the molecular basis of attenuation in H. meleagridis and constitute a genomics platform for future research on this important poultry pathogen.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    长读测序的出现提供了一种新的评估方法,可以检测许多罕见遗传疾病中的基因组结构变异(SV)。对于自闭症谱系障碍(ASD)病例,在染色体上的蛋白质编码基因区域中未能发现致病变异,我们提出了一个可扩展的工作流程来表征影响基因组非编码元件的SVs风险因子.我们使用长和短阅读测序技术对有三个患有ASD的孩子的阿联酋家庭进行了全基因组测序。建立了一系列分析管道以鉴定具有高灵敏度和特异性的一组SV。在15倍的覆盖率下,我们观察到,与使用短读取技术检测到的变异体(509个变异体)相比,长读取测序技术(987个变异体)检测到的SV数量显著更高(p值<1.1020×10-57).进一步的比较显示97.9%的长读取测序变体跨越1-100kb大小范围内(p值<9.080×10-67)并影响超过5000个基因。此外,长读段变异体检测到604个非编码RNA(p值<9.02×10-9),包含58%的microRNA,31.9%lncRNA,和9.1%的snoRNA。即使覆盖率很低,长读测序已被证明是检测影响基因组复杂元件的SV的可靠技术。
    The advent of long-read sequencing offers a new assessment method of detecting genomic structural variation (SV) in numerous rare genetic diseases. For autism spectrum disorders (ASD) cases where pathogenic variants fail to be found in the protein-coding genic regions along chromosomes, we proposed a scalable workflow to characterize the risk factor of SVs impacting non-coding elements of the genome. We applied whole-genome sequencing on an Emirati family having three children with ASD using long and short-read sequencing technology. A series of analytical pipelines were established to identify a set of SVs with high sensitivity and specificity. At 15-fold coverage, we observed that long-read sequencing technology (987 variants) detected a significantly higher number of SVs when compared to variants detected using short-read technology (509 variants) (p-value < 1.1020 × 10-57). Further comparison showed 97.9% of long-read sequencing variants were spanning within the 1-100 kb size range (p-value < 9.080 × 10-67) and impacting over 5000 genes. Moreover, long-read variants detected 604 non-coding RNAs (p-value < 9.02 × 10-9), comprising 58% microRNA, 31.9% lncRNA, and 9.1% snoRNA. Even at low coverage, long-read sequencing has shown to be a reliable technology in detecting SVs impacting complex elements of the genome.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号