Oxford nanopore

牛津纳米孔
  • 文章类型: Journal Article
    禽致病性大肠杆菌(APEC)引起禽大肠杆菌病,准确区分感染性分离株对于控制其传播至关重要。多位点序列分型(MLST)是一种准确、高效的流行病学监测毒株鉴定方法。这项研究旨在开发一种快速和高通量的工作流程,使用牛津纳米孔技术(ONT)平台对多个大肠杆菌分离株的Achtman分型方案的7个管家基因进行测序,以进行大规模的APEC研究。大肠杆菌菌株从家禽养殖场中分离出来,管家基因被扩增,和扩增子在R9.4MinION流动池上使用NanoporeGridION测序仪(ONT,牛津,英国)遵循初始工作流程(ONT-MLST)。此外,通过在ONT-MLST工作流程中引入大规模DNA提取和多重PCR来修改工作流程,并将其应用于242个新分离株,从以前的工作流程中分离出18个,和5个ATCC参考菌株在NanoporeMinIONMk1C测序仪上使用Flongle流动池(ONT,牛津,英国)。最后,报告并分析了从受感染的鸡和家禽养殖场环境中收集的308个分离株的序列类型(ST)结果.数据表明,属于ST159、ST8578和ST355的大肠杆菌具有感染肉鸡的多个器官的潜力。此外,人畜共患STs,从家禽养殖场检测到ST69、ST10、ST38和ST131。凭借ONT高吞吐量的优势,这项研究为大规模大肠杆菌分型提供了一个快速的工作流程,并确定了与家禽中APEC感染相关的频繁分离的序列类型。
    Avian pathogenic Escherichia coli (APEC) cause avian colibacillosis and accurately distinguishing infectious isolates is critical for controlling its transmission. Multilocus sequence typing (MLST) is an accurate and efficient strain identification method for epidemiological surveillance. This research aimed to develop a fast and high-throughput workflow that simultaneously sequences the Achtman typing scheme\'s 7 housekeeping genes of multiple E. coli isolates using the Oxford Nanopore Technologies (ONT) platform for large-scale APEC study. E. coli strains were isolated from poultry farms, the housekeeping genes were amplified, and amplicons were sequenced on an R9.4 MinION flow cell using the Nanopore GridION sequencer (ONT, Oxford, UK) following the initial workflow (ONT-MLST). Moreover, the workflow was revised by introducing large-scale DNA extraction and multiplex PCR into the ONT-MLST workflow and applied to 242 new isolates, 18 isolates from the previous workflow, and 5 ATCC reference strains using Flongle flow cell on the Nanopore MinION Mk1C sequencer (ONT, Oxford, UK). Finally, the sequence type (ST) results of the 308 isolates collected from infected chickens and poultry farm environments were reported and analyzed. Data indicated that E. coli belonging to ST159, ST8578, and ST355 have the potential to infect multiple organs in broiler. In addition, zoonotic STs, ST69, ST10, ST38, and ST131, were detected from poultry farms. With the advantages of the high throughput of ONT, this study provides a rapid workflow for large-scale E. coli typing and identified frequently isolated sequence types related to APEC infection in poultry.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    欧洲蜜蜂(EFB)是美国欧洲蜜蜂(Apismellifera)中的一种流行疾病,这可能导致殖民地的衰落和崩溃。对EFB的细菌成分进行了充分的研究,但是感染菌落内病毒感染的多样性尚未被探索。在这项研究中,我们使用12个蜜蜂蜂巢的元转录组学测序,症状(+,n=6)和无症状(-,n=6)对于EFB,调查与疾病相关的病毒感染。我们组装了41个病毒基因组,属于三个家族(伊夫病毒科,逆生病毒科,和Sinhaliviridae),所有以前报道的蜜蜂,包括西奈湖病毒,畸形的翅膀病毒,sambrood病毒,黑皇后细胞病毒,和以色列急性瘫痪病毒。在患有严重EFB的菌落中,我们观察到病毒基因组(34个基因组)的发生率较高,而从健康菌落中回收的病毒基因组(7个基因组)较少,并且完全不存在双壳病毒科基因组.我们观察到特定的西奈湖病毒进化枝与EFB+或EFB-菌落相关,除了受EFB影响的菌落外,这些菌落表现出相对丰度的增加。多变量分析强调,位点和EFB疾病状态的组合影响RNA病毒组成,虽然EFB状态本身并没有显着影响它,为不同院子里的殖民地之间的比较提出了挑战。这些发现有助于了解受EFB损害的蜜蜂菌落中的病毒动力学,并强调在研究EFB时需要进行未来的研究以考虑病毒组成。重要意义这项关于受欧洲细菌育种病(EFB)影响的蜜蜂菌落病毒系的研究揭示了在流行的细菌性育种病的背景下蜜蜂菌落中病毒种群的动态。与受严重EFB影响的菌落相关的不同西奈湖病毒和sambrood病毒进化枝的鉴定表明病毒组成与疾病状态之间存在潜在的联系。强调需要进一步研究病毒在EFB感染过程中的作用。在EFB感染期间观察到的sambrod病毒的增加表明潜在的病毒生态失调,对蜜蜂育苗健康有潜在影响。这些发现为养蜂实践提供了宝贵的见解,为未来的研究奠定了基础,旨在了解和减轻细菌和病毒感染对商业蜜蜂运营和EFB管理的影响。
    European foulbrood (EFB) is a prevalent disease in the European honey bee (Apis mellifera) in the United States, which can lead to colony decline and collapse. The bacterial components of EFB are well-studied, but the diversity of viral infections within infected colonies has not been explored. In this study, we use meta-transcriptomics sequencing of 12 honey bee hives, symptomatic (+, n = 6) and asymptomatic (-, n = 6) for EFB, to investigate viral infection associated with the disease. We assembled 41 viral genomes, belonging to three families (Iflaviridae, Dicistroviridae, and Sinhaliviridae), all previously reported in honey bees, including Lake Sinai virus, deformed wing virus, sacbrood virus, Black queen cell virus, and Israeli acute paralysis virus. In colonies with severe EFB, we observed a higher occurrence of viral genomes (34 genomes) in contrast to fewer recovered from healthy colonies (seven genomes) and a complete absence of Dicistroviridae genomes.We observed specific Lake Sinai virus clades associated exclusively with EFB + or EFB - colonies, in addition to EFB-afflicted colonies that exhibited an increase in relative abundance of sacbrood viruses. Multivariate analyses highlighted that a combination of site and EFB disease status influenced RNA virome composition, while EFB status alone did not significantly impact it, presenting a challenge for comparisons between colonies kept in different yards. These findings contribute to the understanding of viral dynamics in honey bee colonies compromised by EFB and underscore the need for future investigations to consider viral composition when investigating EFB.IMPORTANCEThis study on the viromes of honey bee colonies affected by European foulbrood (EFB) sheds light on the dynamics of viral populations in bee colonies in the context of a prevalent bacterial brood disease. The identification of distinct Lake Sinai virus and sacbrood virus clades associated with colonies affected by severe EFB suggests a potential connection between viral composition and disease status, emphasizing the need for further investigation into the role of viruses during EFB infection. The observed increase in sacbrood viruses during EFB infection suggests a potential viral dysbiosis, with potential implications for honey bee brood health. These findings contribute valuable insights related to beekeeping practices, offering a foundation for future research aimed at understanding and mitigating the impact of bacterial and viral infection in commercial honey bee operations and the management of EFB.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    我们的目标是评估下一代测序(NGS)与Sanger相比的准确性。我们对来自两个HIV+个体的提取组织DNA进行了HIV-1gp160的单基因组扩增(SGA)。用Sanger对扩增子(n=30)进行测序,或用条形码引物重新扩增并在使用牛津纳米孔技术[ONT]和太平洋生物科学[PB]进行测序之前合并。对于每个扩增子,NGS读段的共有序列是通过(1)将读段映射到Sanger序列(“基于参考的”)或(2)将读段映射到“伪参考”序列获得的,即,NGS读段子集的共有序列(“无参考”)。基于遗传相似性对PB读数进行聚类。获得23/30扩增子的Sanger共有序列,与Sanger相比,所有NGS共有序列相同[n=9]或几乎相同[n=14]。对于Sanger/NGS之间的九个不匹配,NGS序列中的核苷酸与该患者的所有其他序列相匹配。在没有Sanger序列的7/30扩增子中,NGS序列在五个扩增子中有35个模糊的调用,两个扩增子中的模糊度和0。电泳图分析显示后两个扩增子的单个测序引物失败(与单个模板一致),和其他五个的重叠峰(与多个模板一致)。聚类结果紧随Sanger/NGS共识结果,其中来自单个模板的扩增子也具有单个簇,反之亦然(除了一个例外,这可能是条形码错误识别的结果)。来自簇的代表性序列与Sanger/NGS相比包含2-13个差异。总之,我们表明,与Sanger相比,ONT和PB都可以产生具有相似或更高准确性的扩增子共有序列,而且重要的是,不需要已知的参考序列。在某些情况下,聚类可能有助于预测或确认多个起始模板的存在。
    Our goal was to assess the accuracy of next generation sequencing (NGS) compared with Sanger. We performed single genome amplification (SGA) of HIV-1 gp160 on extracted tissue DNA from two HIV+ individuals. Amplicons (n = 30) were sequenced with Sanger or reamplified with barcoded primers and pooled before sequencing using Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PB). For each amplicon, a consensus sequence for NGS reads was obtained by (1) mapping reads to the Sanger sequence when available (\"reference-based\") or (2) mapping reads to a \"pseudo-reference\" sequence, i.e., a consensus sequence of a subset of NGS reads (\"reference-free\"). PB reads were clustered based on genetic similarity. A Sanger consensus sequence was obtained for 23/30 amplicons, for which all NGS consensus sequences were identical (n = 9) or nearly identical (n = 14) compared with Sanger. For the nine mismatches between Sanger/NGS, the nucleotide in the NGS sequence matched all other sequences from that patient. Of the 7/30 amplicons without a Sanger sequence, NGS sequences had ≥35 ambiguous calls in five amplicons and 0 ambiguities in two amplicons. Analysis of the electropherograms showed failure of a single sequencing primer for the latter two amplicons (consistent with a single template) and overlapping peaks for the other five (consistent with multiple templates). Clustering results closely followed the Sanger/NGS consensus results, where amplicons derived from a single template also had a single cluster and vice versa (with one exception, which could be the result of barcode misidentification). Representative sequences from the clusters contained 2-13 differences compared with Sanger/NGS. In summary, we show that both ONT and PB can produce amplicon consensus sequences with similar or higher accuracy compared with Sanger and, importantly, without the need for a known reference sequence. Clustering could be useful in some circumstances to predict or confirm the presence of multiple starting templates.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    海马弧菌是革兰氏阳性细菌,偶尔会从明显健康的母马(Equuscaballus)的生殖道或生殖道异常的母马中分离出来。除了一些基于16SrRNA基因的GenBank序列和一份关于完整基因组组装的最新报告外,该细菌没有详细的基因组序列和临床实验数据。最近,我们观察到,在加拿大大西洋地区,从与母马繁殖失败相关的样本中检测到的生物体异常增加。两种菌落形态(即,小,和大)在培养基中检测到,通过MALDI-TOF质谱和16SrRNA基因测序鉴定为海马A。这里,我们报告了形态型变异的全基因组测序和表征。年夜表型的基因组长度在2.42~2.43之间,小表型为1.99Mbs。大菌落表型之间的直系同源核苷酸同一性为〜99%,大菌落和小菌落表型在77.86和78.52%之间,这可能保证将两种形态类型分类为不同的物种。基于16SrRNA基因或串联的管家基因的系统发育分析将小菌落变体和大菌落变体分为两个不同的基因型簇。UvrA蛋白,它是核苷酸切除修复(NER)系统的一部分,和3-异丙酰基苹果酸脱水酶小亚基蛋白表达的leuD基因被鉴定为潜在的毒力因子在大和小的菌落形态,分别。然而,需要进行详细的功能研究,以确定这些和其他已鉴定的假设蛋白质在细胞代谢中的确切作用以及海马A.
    Arcanobacterium hippocoleae is a Gram-positive fastidious bacterium and is occasionally isolated from the reproductive tract of apparently healthy mares (Equus caballus) or from mares with reproductive tract abnormalities. Apart from a few 16S rRNA gene-based GenBank sequences and one recent report on complete genome assembly, detailed genomic sequence and clinical experimental data are not available on the bacterium. Recently, we observed an unusual increase in the detection of the organism from samples associated with mare reproductive failures in Atlantic Canada. Two colony morphotypes (i.e., small, and large) were detected in culture media, which were identified as A. hippocoleae by MALDI-TOF mass spectrometry and 16S rRNA gene sequencing. Here, we report the whole genome sequencing and characterization of the morphotype variants. The genome length of the large phenotypes was between 2.42 and 2.43, and the small phenotype was 1.99 Mbs. The orthologous nucleotide identity between the large colony phenotypes was ~99%, and the large and small colony phenotypes was between 77.86 and 78.52%, which may warrant the classification of the two morphotypes into different species. Phylogenetic analysis based on 16S rRNA genes or concatenated housekeeping genes grouped the small and large colony variants into two different genotypic clusters. The UvrA protein, which is part of the nucleotide excision repair (NER) system, and 3-isopropoylmalate dehydratase small subunit protein expressed by the leuD gene were identified as potential virulence factors in the large and small colony morphotypes, respectively. However, detailed functional studies will be required to determine the exact roles of these and other identified hypothetical proteins in the cellular metabolism and potential pathogenicity of A. hippocoleae in mares.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    我们提供了一种名为UNO-G1W1的新型荧光假单胞菌噬菌体的完整基因组序列。从单个冰盖取样中分离该噬菌体。基因组在纳米孔Minion上测序,用直接末端重复噬菌体管道产生,并用Illumina短读数抛光。序列同一性将噬菌体分类为耳病毒。
    We provide the complete genome sequence for a novel Pseudomonas fluorescens bacteriophage named UNO-G1W1. This phage was isolated from a single ice cover sampling. The genome was sequenced on the Nanopore MinION, generated with the direct terminal repeat-phage-pipeline and polished with Illumina short reads. Sequence identity classifies the phage as an otagovirus.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    越来越明显的是,RNA采用的结构是构象动态的;RNA样品的各种结构状态控制着它们与其他核酸的相互作用,蛋白质,和配体来调节无数的生物过程。尽管已经开发了几种生物物理方法并用于研究结构化RNA的动态景观,由于可变的大小和灵活性,技术限制限制了它们在所有类型的RNA中的应用。将化学探测实验与下一代和直接测序相结合的最新进展已成为探索RNA构象动力学的替代方法。在这次审查中,我们提供了用于研究RNA构象动力学的基于测序的技术的方法学概述.我们讨论了不同的技术如何使我们能够更好地理解来自各种不同类别的RNA对多个构象状态进行采样的倾向。最后,我们提供了这些技术如何重塑我们对RNA结构的看法的例子。
    It has become increasingly evident that the structures RNAs adopt are conformationally dynamic; the various structured states that RNAs sample govern their interactions with other nucleic acids, proteins, and ligands to regulate a myriad of biological processes. Although several biophysical approaches have been developed and used to study the dynamic landscape of structured RNAs, technical limitations have limited their application to all classes of RNA due to variable size and flexibility. Recent advances combining chemical probing experiments with next-generation- and direct sequencing have emerged as an alternative approach to exploring the conformational dynamics of RNA. In this review, we provide a methodological overview of the sequencing-based techniques used to study RNA conformational dynamics. We discuss how different techniques have enabled us to better understand the propensity of RNAs from a variety of different classes to sample multiple conformational states. Finally, we present examples of the ways these techniques have reshaped how we think about RNA structure.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:子囊出芽酵母是普遍存在的环境微生物,在食品生产和医药中很重要。由于最近密集的基因组研究,基于单系分类群的鉴定,酵母的分类学变得更加有条理。这包括对人类重要的属,比如Kazachstania.直到现在,Hazachstaniahumilis(以前是Humilis念珠菌)被认为是酸面团特异性酵母。此外,任何抗菌活性与该物种无关。
    结果:以前,我们分离了一个酵母菌株,该菌株在黑暗发酵生物反应器中损害了生物氢的产生,并抑制了革兰氏阳性和革兰氏阴性细菌的生长。这里,使用下一代测序技术,我们对这个名为K.humilisMAW1的菌株的基因组进行了测序。这是不源自发酵食品的胡叶木分离物的第一个基因组。我们使用了新的系统发育方法,采用了18S-ITS-D1-D2区域,以显示哈木树MAW1在Kazachstania属其他成员中的位置。通过全球表型分析检查了该菌株,包括利用的碳源和胁迫条件对生长的影响。使用公认的细菌模型大肠杆菌AB1157,我们表明在酸性培养基中培养的K.humilisMAW1通过细胞分裂的干扰抑制细菌生长,表现为细丝的形成。为了更深入地了解胡志明市MAW1的抑制作用,我们选择了23种具有公认的细菌毒性活性的酵母蛋白,并将其用于胡志明市MAW1基因组组装的Blast搜索。所得到的存在于胡枝子MAW1基因组中的一组基因包括编码可能干扰细菌细胞包膜结构的1,3-β-葡聚糖糖苷酶和1,3-β-葡聚糖合成抑制剂的基因。
    结论:我们表征了一种非酸面团衍生的树枝菌菌株,包括其基因组序列和生理方面。MAW1,以及其他humilis菌株,显示了交配型基因座的新组织。此处揭示的pH依赖性抑制细菌生长的能力先前尚未在该物种中得到认可。我们的研究有助于建立基于基因组序列的分类系统;更好地理解K.humilis作为发酵过程中的细胞工厂,并探索微生物群落中的细菌-酵母相互作用。
    BACKGROUND: Ascomycetous budding yeasts are ubiquitous environmental microorganisms important in food production and medicine. Due to recent intensive genomic research, the taxonomy of yeast is becoming more organized based on the identification of monophyletic taxa. This includes genera important to humans, such as Kazachstania. Until now, Kazachstania humilis (previously Candida humilis) was regarded as a sourdough-specific yeast. In addition, any antibacterial activity has not been associated with this species.
    RESULTS: Previously, we isolated a yeast strain that impaired bio-hydrogen production in a dark fermentation bioreactor and inhibited the growth of Gram-positive and Gram-negative bacteria. Here, using next generation sequencing technologies, we sequenced the genome of this strain named K. humilis MAW1. This is the first genome of a K. humilis isolate not originating from a fermented food. We used novel phylogenetic approach employing the 18 S-ITS-D1-D2 region to show the placement of the K. humilis MAW1 among other members of the Kazachstania genus. This strain was examined by global phenotypic profiling, including carbon sources utilized and the influence of stress conditions on growth. Using the well-recognized bacterial model Escherichia coli AB1157, we show that K. humilis MAW1 cultivated in an acidic medium inhibits bacterial growth by the disturbance of cell division, manifested by filament formation. To gain a greater understanding of the inhibitory effect of K. humilis MAW1, we selected 23 yeast proteins with recognized toxic activity against bacteria and used them for Blast searches of the K. humilis MAW1 genome assembly. The resulting panel of genes present in the K. humilis MAW1 genome included those encoding the 1,3-β-glucan glycosidase and the 1,3-β-glucan synthesis inhibitor that might disturb the bacterial cell envelope structures.
    CONCLUSIONS: We characterized a non-sourdough-derived strain of K. humilis, including its genome sequence and physiological aspects. The MAW1, together with other K. humilis strains, shows the new organization of the mating-type locus. The revealed here pH-dependent ability to inhibit bacterial growth has not been previously recognized in this species. Our study contributes to the building of genome sequence-based classification systems; better understanding of K.humilis as a cell factory in fermentation processes and exploring bacteria-yeast interactions in microbial communities.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    非洲猪瘟病毒(ASFV)是一种核质大DNA病毒(NCLDV),编码其自身的宿主样RNA聚合酶(RNAP)和产生成熟mRNA所需的因子。通过ASFVRNAP形成准确的mRNA3'末端取决于转录终止,可能是由序列基序和转录因子的组合实现的,虽然人们对这些知之甚少。任何RNAP的终止很少是100%有效的,终止子处的转录“readthrough”可以产生长的mRNAs,这可能会干扰下游基因的表达。ASFV转录组分析揭示了异质mRNA3'末端的景观,可能是真正终止位点和mRNA降解和加工的结果的组合。虽然短读测序(SRS)如3'RNA-seq表明mRNA3'末端在特定位点的积累,它不能告知哪些启动子和转录起始位点(TSS)指导它们的合成,即,关于核苷酸分辨率下完整和未加工的mRNA的信息。
    这里,我们报告了使用长读数测序(LRS)对全长ASFV转录本进行的严格分析.我们系统地比较了在早期和晚期感染期间从SRS3'RNA-seq预测的转录终止位点与LRS绘制的3'末端。
    使用体外转录测定,我们显示重组ASFVRNAP在非模板链的polyT延伸处终止转录,类似于古细菌RNAP或真核RNAPIII,没有二级RNA结构或预测的病毒终止因子的帮助。我们的结果巩固了这种富含T的基序(RNA中富含U)作为ASFV中的通用转录终止信号。许多基因使用相同的终止子,而基因也可以使用一系列终止子来产生长度变化很大的转录同种型。后一种现象的一个关键因素是我们观察到的高度丰富的终止子读通,与早期感染相比,晚期感染更为普遍。
    这表明在晚期基因启动子控制下的ASFVmRNAs利用与早期启动子不同的终止机制和因子,和/或细胞因子在感染晚期对病毒转录组景观的影响不同。
    African swine fever virus (ASFV) is a nucleocytoplasmic large DNA virus (NCLDV) that encodes its own host-like RNA polymerase (RNAP) and factors required to produce mature mRNA. The formation of accurate mRNA 3\' ends by ASFV RNAP depends on transcription termination, likely enabled by a combination of sequence motifs and transcription factors, although these are poorly understood. The termination of any RNAP is rarely 100% efficient, and the transcriptional \"readthrough\" at terminators can generate long mRNAs which may interfere with the expression of downstream genes. ASFV transcriptome analyses reveal a landscape of heterogeneous mRNA 3\' termini, likely a combination of bona fide termination sites and the result of mRNA degradation and processing. While short-read sequencing (SRS) like 3\' RNA-seq indicates an accumulation of mRNA 3\' ends at specific sites, it cannot inform about which promoters and transcription start sites (TSSs) directed their synthesis, i.e., information about the complete and unprocessed mRNAs at nucleotide resolution.
    Here, we report a rigorous analysis of full-length ASFV transcripts using long-read sequencing (LRS). We systematically compared transcription termination sites predicted from SRS 3\' RNA-seq with 3\' ends mapped by LRS during early and late infection.
    Using in-vitro transcription assays, we show that recombinant ASFV RNAP terminates transcription at polyT stretches in the non-template strand, similar to the archaeal RNAP or eukaryotic RNAPIII, unaided by secondary RNA structures or predicted viral termination factors. Our results cement this T-rich motif (U-rich in the RNA) as a universal transcription termination signal in ASFV. Many genes share the usage of the same terminators, while genes can also use a range of terminators to generate transcript isoforms varying enormously in length. A key factor in the latter phenomenon is the highly abundant terminator readthrough we observed, which is more prevalent during late compared with early infection.
    This indicates that ASFV mRNAs under the control of late gene promoters utilize different termination mechanisms and factors to early promoters and/or that cellular factors influence the viral transcriptome landscape differently during the late stages of infection.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Preprint
    超长牛津纳米孔(ONT)测序读数与长,准确的PacBioHiFi读数使人类基因组得以完成,并刺激了类似的努力来完成许多其他物种的基因组。然而,这种方法是完整的,“端粒到端粒”基因组组装依赖于多个测序平台,限制其可达性。ONT\"Duplex\"测序读取,DNA的两条链都被读取以提高质量,承诺高每基地的精度。要评估此新数据类型,我们为三个广泛研究的基因组生成了ONTDuplex数据:人类HG002,番茄1706(番茄),和玉米B73(玉米)。对于二倍体,杂合子HG002基因组,我们还使用“Pore-C”染色质接触作图来完全定相单倍型。我们发现Duplex数据的准确性与HiFi测序相似,但是读取长度要长几十千,和Pore-C数据与现有的二倍体组装算法兼容。读取长度和准确性的这种组合使得能够构建高质量的初始组件,然后可以使用超长读取进一步解决,最后用Pore-C分期成染色体规模的单倍型。产生的组件具有超过99.999%(Q50)的基本精度和接近完美的连续性,大多数染色体组装成单个重叠群。我们得出的结论是,ONT测序是HiFi测序的一种可行的替代方法,用于从头基因组组装,并有可能为完整基因组的重建提供单一仪器的解决方案。
    The combination of ultra-long Oxford Nanopore (ONT) sequencing reads with long, accurate PacBio HiFi reads has enabled the completion of a human genome and spurred similar efforts to complete the genomes of many other species. However, this approach for complete, \"telomere-to-telomere\" genome assembly relies on multiple sequencing platforms, limiting its accessibility. ONT \"Duplex\" sequencing reads, where both strands of the DNA are read to improve quality, promise high per-base accuracy. To evaluate this new data type, we generated ONT Duplex data for three widely-studied genomes: human HG002, Solanum lycopersicum Heinz 1706 (tomato), and Zea mays B73 (maize). For the diploid, heterozygous HG002 genome, we also used \"Pore-C\" chromatin contact mapping to completely phase the haplotypes. We found the accuracy of Duplex data to be similar to HiFi sequencing, but with read lengths tens of kilobases longer, and the Pore-C data to be compatible with existing diploid assembly algorithms. This combination of read length and accuracy enables the construction of a high-quality initial assembly, which can then be further resolved using the ultra-long reads, and finally phased into chromosome-scale haplotypes with Pore-C. The resulting assemblies have a base accuracy exceeding 99.999% (Q50) and near-perfect continuity, with most chromosomes assembled as single contigs. We conclude that ONT sequencing is a viable alternative to HiFi sequencing for de novo genome assembly, and has the potential to provide a single-instrument solution for the reconstruction of complete genomes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    非洲猪瘟病毒导致家猪致命的出血性疾病。NAMP1/1995分离株最初被描述为B646L基因型XVIII;然而,全基因组测序显示此分配不正确.
    African swine fever virus causes a lethal hemorrhagic disease of domestic pigs. The NAM P1/1995 isolate was originally described as B646L genotype XVIII; however, full genome sequencing revealed that this assignment was incorrect.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号