关键词: Hadruridae arachnid nanopore pore-c reference genome scorpion

Mesh : Animals Scorpions / genetics Genome Chromosomes / genetics Phylogeny Molecular Sequence Annotation Evolution, Molecular

来  源:   DOI:10.1093/gbe/evae097   PDF(Pubmed)

Abstract:
Over 400 million years old, scorpions represent an ancient group of arachnids and one of the first animals to adapt to life on land. Presently, the lack of available genomes within scorpions hinders research on their evolution. This study leverages ultralong nanopore sequencing and Pore-C to generate the first chromosome-level assembly and annotation for the desert hairy scorpion, Hadrurus arizonensis. The assembled genome is 2.23 Gb in size with an N50 of 280 Mb. Pore-C scaffolding reoriented 99.6% of bases into nine chromosomes and BUSCO identified 998 (98.6%) complete arthropod single copy orthologs. Repetitive elements represent 54.69% of the assembled bases, including 872,874 (29.39%) LINE elements. A total of 18,996 protein-coding genes and 75,256 transcripts were predicted, and extracted protein sequences yielded a BUSCO score of 97.2%. This is the first genome assembled and annotated within the family Hadruridae, representing a crucial resource for closing gaps in genomic knowledge of scorpions, resolving arachnid phylogeny, and advancing studies in comparative and functional genomics.
摘要:
超过4亿年的历史,蝎子代表着一群古老的蜘蛛,也是第一批适应陆地生活的动物之一。目前,蝎子缺乏可用的基因组阻碍了对它们进化的研究。这项研究利用超长纳米孔测序和Pore-C来生成沙漠多毛蝎子的第一个染色体水平组装和注释,阿拉伯哈德鲁.组装的基因组大小为2.23Gb,N50为280Mb。Pore-C支架将99.6%的碱基重新定向到9条染色体中,BUSCO鉴定出998(98.6%)完整的节肢动物单拷贝直系同源物。重复元素占组装底座的54.69%,包括872,874(29.39%)线元素。共预测了18,996个蛋白质编码基因和75,256个转录本,提取的蛋白质序列获得了97.2%的BUSCO评分。这是哈氏科家族中第一个组装和注释的基因组,代表了缩小蝎子基因组知识差距的关键资源,解决蜘蛛系统发育,并推进比较和功能基因组学的研究。
公众号