关键词: Arabidopsis ecotypes long-read sequencing natural variation pan-genome structural variation

Mesh : Arabidopsis / genetics Ecotype Genome, Plant Chromosomes, Plant / genetics Molecular Sequence Annotation Genetic Variation

来  源:   DOI:10.1093/gbe/evae169   PDF(Pubmed)

Abstract:
The plant Arabidopsis thaliana is a model system used by researchers through much of plant research. Recent efforts have focused on discovering the genomic variation found in naturally occurring ecotypes isolated from around the world. These ecotypes have come from diverse climates and therefore have faced and adapted to a variety of abiotic and biotic stressors. The sequencing and comparative analysis of these genomes can offer insight into the adaptive strategies of plants. While there are a large number of ecotype genome sequences available, the majority were created using short-read technology. Mapping of short-reads containing structural variation to a reference genome bereft of that variation leads to incorrect mapping of those reads, resulting in a loss of genetic information and introduction of false heterozygosity. For this reason, long-read de novo sequencing of genomes is required to resolve structural variation events. In this article, we sequenced the genomes of eight natural variants of A. thaliana using nanopore sequencing. This resulted in highly contiguous assemblies with >95% of the genome contained within five contigs. The sequencing results from this study include five ecotypes from relict and African populations, an area of untapped genetic diversity. With this study, we increase the knowledge of diversity we have across A. thaliana ecotypes and contribute to ongoing production of an A. thaliana pan-genome.
摘要:
植物拟南芥是研究人员通过许多植物研究使用的模型系统。最近的努力集中在发现从世界各地分离的天然存在的生态型中发现的基因组变异。这些生态型来自不同的气候,因此面临并适应了各种非生物和生物应激源。这些基因组的测序和比较分析可以提供对植物适应性策略的洞察。虽然有大量的生态型基因组序列可用,大多数是使用短读技术创建的。将包含结构变异的短读段映射到缺少该变异的参考基因组会导致这些读段的错误映射,导致遗传信息的丢失和假杂合性的引入。出于这个原因,需要基因组的长读从头测序来解决结构变异事件。在本文中,我们使用纳米孔测序对8种自然变种的基因组进行了测序。这导致高度连续的组装,其中>95%的基因组包含在5个重叠群内。这项研究的测序结果包括来自遗留物和非洲人群的5种生态型,尚未开发的遗传多样性领域。通过这项研究,我们增加了对拟南芥生态型多样性的认识,并有助于拟南芥泛基因组的持续生产。
公众号