关键词: Lepidium apetalum Genome assembly Hi-C PacBio sequencing Transcriptome

Mesh : Plants, Medicinal / genetics Genome, Plant Lepidium / genetics Molecular Sequence Annotation Chromosomes, Plant / genetics Genomics / methods High-Throughput Nucleotide Sequencing Phylogeny

来  源:   DOI:10.1186/s12863-024-01243-9   PDF(Pubmed)

Abstract:
OBJECTIVE: As a traditional Chinese medicine, Lepidium apetalum is commonly used for purging the lung, relieving dyspnea, alleviating edema, and has the significant pharmacological effects on cardiovascular disease, hyperlipidemia, etc. In addition, the seeds of L. apetalum are rich in unsaturated fatty acids, sterols, glucosinolates and have a variety of biological activity compounds. To facilitate genomics, phylogenetic and secondary metabolite biosynthesis studies of L. apetalum, we assembled the high-resolution genome of L. apetalum.
METHODS: We completed chromosome-level genome assembly of the L. apetalum genome (2n = 32), using Illumina HiSeq and PacBio Sequel sequencing platform as well as high-throughput chromosome conformation capture (Hi-C) technique. The assembled genome was 296.80 Mb in size, 34.41% in GC content, and 23.89% in repeated sequence content, including 316 contigs with a contig N50 of 16.31 Mb. Hi-C scaffolding resulted in 16 chromosomes occupying 99.79% of the assembled genome sequences. A total of 46 584 genes and 105 pseudogenes were predicted, 98.37% of which can be annotated to Nr, GO, KEGG, TrEMBL, SwissPort, Pfam and KOG databases. The high-quality reference genome generated by this study will provide accurate genetic information for the molecular biology research of L. apetalum.
摘要:
目的:作为中药,apetalumpepidium通常用于净化肺部,缓解呼吸困难,缓解水肿,对心血管疾病有显著的药理作用,高脂血症,等。此外,L.apetalum的种子富含不饱和脂肪酸,固醇,芥子油苷和具有多种生物活性的化合物。为了促进基因组学,阿比塔兰乳杆菌的系统发育和次生代谢产物生物合成研究,我们组装了阿佩塔拉氏菌的高分辨率基因组。
方法:我们完成了apetalumL.petalum基因组的染色体水平基因组组装(2n=32),使用IlluminaHiSeq和PacBioSequel测序平台以及高通量染色体构象捕获(Hi-C)技术。组装的基因组大小为296.80Mb,GC含量34.41%,重复序列含量为23.89%,包括316个重叠群,重叠群N50为16.31Mb。Hi-C支架导致16条染色体占据组装的基因组序列的99.79%。共预测46584个基因和105个假基因,其中98.37%可以注释为Nr,GO,KEGG,TrEMBL,SwissPort,Pfam和KOG数据库。本研究产生的高质量参考基因组将为阿贝塔兰的分子生物学研究提供准确的遗传信息。
公众号