关键词: Chinese funnel-web spider Macrothele yani Spider genome Toxins Venom-gland transcritome and proteome

Mesh : Animals Genome Molecular Sequence Annotation Phylogeny Proteome Spider Venoms / genetics chemistry Spiders / genetics classification

来  源:   DOI:10.1016/j.ijbiomac.2024.131780

Abstract:
Macrothelidae is a family of mygalomorph spiders containing the extant genera Macrothele and Vacrothele. China is an important center of diversity for Macrothele with 65 % of the known species occurring there. Previous work on Macrothele was able to uncover several important toxin compounds including Raventoxin which may have applications in biomedicine and agricultural chemistry. Despite the importance of Macrothele spiders, high-quality reference genomes are still lacking, which hinders our understanding and application of the toxin compounds. In this study, we assembled the genome of the Macrothele yani to help fill gaps in our understanding of toxin biology in this lineage of spiders to encourage the future study and applications of these compounds. The final assembled genome was 6.79 Gb in total length, had a contig N50 of 21.44 Mb, and scaffold N50 of 156.16 Mb. Hi-C scaffolding assigned 98.19 % of the genome to 46 pseudo-chromosomes with a BUSCO score of 95.7 % for the core eukaryotic gene set. The assembled genome was found to contain 75.62 % repetitive DNA and a total of 39,687 protein-coding genes were annotated making it the spider genome with highest number of genes. Through integrated analysis of venom gland transcriptomics and venom proteomics, a total of 194 venom toxins were identified, including 38 disulfide-rich peptide neurotoxins, among which 12 were ICK knottin peptides. In summary, we present the first high-quality genome assembly at the chromosomal level for any Macrothelidae spider, filling an important gap in our knowledge of these spiders. Such high-quality genomic data will be invaluable as a reference in resolving Araneae spider phylogenies and in screening different spider species for novel compounds applicable to numerous medical and agricultural applications.
摘要:
巨细胞科是一个mygalomorph蜘蛛家族,包含现存的巨细胞和瓦克罗特科。中国是宏观物种多样性的重要中心,有65%的已知物种发生在中国。先前关于Macrothele的工作能够发现几种重要的毒素化合物,包括Raventoxin,它们可能在生物医学和农业化学中应用。尽管Macrothele蜘蛛很重要,仍然缺乏高质量的参考基因组,这阻碍了我们对毒素化合物的理解和应用。在这项研究中,我们组装了Macrotheleyani的基因组,以帮助填补我们对这种蜘蛛谱系中毒素生物学理解的空白,以鼓励这些化合物的未来研究和应用。最终组装的基因组总长度为6.79Gb,重叠群N50为21.44Mb,脚手架N50为156.16Mb。Hi-C支架将98.19%的基因组分配给46个假染色体,核心真核基因组的BUSCO评分为95.7%。发现组装的基因组包含75.62%的重复DNA,总共注释了39,687个蛋白质编码基因,使其成为基因数量最多的蜘蛛基因组。通过对毒腺转录组学和毒液蛋白质组学的综合分析,共鉴定出194种毒液毒素,包括38个富含二硫化物的肽类神经毒素,其中12种为ICK结蛋白肽。总之,我们提出了第一个高质量的染色体水平的基因组组装,填补了我们对这些蜘蛛知识的重要空白。这种高质量的基因组数据将作为解决Araneae蜘蛛系统发育和筛选不同蜘蛛物种以寻找适用于许多医学和农业应用的新型化合物的参考。
公众号