Mesh : Eukaryota / genetics RNA, Ribosomal, 18S / genetics Databases, Genetic Databases, Nucleic Acid Animals Genes, rRNA / genetics Phylogeny

来  源:   DOI:10.1093/database/baae043   PDF(Pubmed)

Abstract:
Molecular identification of micro- and macroorganisms based on nuclear markers has revolutionized our understanding of their taxonomy, phylogeny and ecology. Today, research on the diversity of eukaryotes in global ecosystems heavily relies on nuclear ribosomal RNA (rRNA) markers. Here, we present the research community-curated reference database EUKARYOME for nuclear ribosomal 18S rRNA, internal transcribed spacer (ITS) and 28S rRNA markers for all eukaryotes, including metazoans (animals), protists, fungi and plants. It is particularly useful for the identification of arbuscular mycorrhizal fungi as it bridges the four commonly used molecular markers-ITS1, ITS2, 18S V4-V5 and 28S D1-D2 subregions. The key benefits of this database over other annotated reference sequence databases are that it is not restricted to certain taxonomic groups and it includes all rRNA markers. EUKARYOME also offers a number of reference long-read sequences that are derived from (meta)genomic and (meta)barcoding-a unique feature that can be used for taxonomic identification and chimera control of third-generation, long-read, high-throughput sequencing data. Taxonomic assignments of rRNA genes in the database are verified based on phylogenetic approaches. The reference datasets are available in multiple formats from the project homepage, http://www.eukaryome.org.
摘要:
基于核标记的微生物和大型生物的分子鉴定彻底改变了我们对其分类学的理解,系统发育和生态学。今天,全球生态系统中真核生物多样性的研究在很大程度上依赖于核核糖体RNA(rRNA)标记。这里,我们提出了研究社区策划的参考数据库,用于核核糖体18SrRNA,所有真核生物的内部转录间隔区(ITS)和28SrRNA标记,包括后生动物(动物),原生生物,真菌和植物。它对于识别丛枝菌根真菌特别有用,因为它桥接了四个常用的分子标记ITS1,ITS2,18SV4-V5和28SD1-D2子区域。该数据库相对于其他注释的参考序列数据库的关键优点是它不限于某些分类组,并且包括所有rRNA标记。EUKARYOME还提供了许多来自(元)基因组和(元)条形码的参考长读序列,这是一个独特的特征,可用于分类学鉴定和第三代嵌合体控制,长读,高通量测序数据。基于系统发育方法验证了数据库中rRNA基因的分类分配。参考数据集可从项目主页获得多种格式,http://www。真核生物.org.
公众号