Transposable elements

转座因子
  • 文章类型: Journal Article
    骨髓增生异常综合征(MDS)的突变谱已经确定,相对少量的遗传畸变,包括SF3B1和SRSF2剪接体突变,导致特定的表型和预后亚群。我们对两个已发表的骨髓单核细胞(BMMNCs)和CD34+细胞的MDS队列进行了多组学因子分析(MOFA),具有三种数据模式(临床,基因型,和转录组学)。七种不同的观点,包括免疫特征,炎症/衰老,反转录转座子(RTE)表达,和细胞类型组成,从这些方法中得出,以确定对MDS预后有重大影响的潜在因素。SF3B1是BMMNC队列中13个突变中唯一的突变,表明与高炎症显著相关。在CD34+组群中也观察到较小程度的这种趋势。有趣的是,代表炎症的MOFA因子对高度炎症的MDS患者显示良好的预后。相比之下,SRSF2突变病例显示粒细胞-单核细胞祖细胞(GMP)模式和高水平的衰老,免疫衰老,和恶性骨髓细胞,与他们的不良预后一致。此外,MOFA将RTE表达确定为MDS的危险因素。这项工作阐明了我们评估MDS风险的综合方法的有效性,该方法超出了迄今为止针对MDS描述的所有评分系统。
    Mutational profiles of myelodysplastic syndromes (MDS) have established that a relatively small number of genetic aberrations, including SF3B1 and SRSF2 spliceosome mutations, lead to specific phenotypes and prognostic subgrouping. We performed a multi-omics factor analysis (MOFA) on two published MDS cohorts of bone marrow mononuclear cells (BMMNCs) and CD34 + cells with three data modalities (clinical, genotype, and transcriptomics). Seven different views, including immune profile, inflammation/aging, retrotransposon (RTE) expression, and cell-type composition, were derived from these modalities to identify the latent factors with significant impact on MDS prognosis. SF3B1 was the only mutation among 13 mutations in the BMMNC cohort, indicating a significant association with high inflammation. This trend was also observed to a lesser extent in the CD34 + cohort. Interestingly, the MOFA factor representing the inflammation shows a good prognosis for MDS patients with high inflammation. In contrast, SRSF2 mutant cases show a granulocyte-monocyte progenitor (GMP) pattern and high levels of senescence, immunosenescence, and malignant myeloid cells, consistent with their poor prognosis. Furthermore, MOFA identified RTE expression as a risk factor for MDS. This work elucidates the efficacy of our integrative approach to assess the MDS risk that goes beyond all the scoring systems described thus far for MDS.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    卫星DNA重复序列是在真核基因组中发现的重复DNA序列,通常由串联阵列中重复的短DNA基序组成。尽管有大量关于其他分类群卫星DNA重复的文献,专门针对Tettigoniidae的调查仍然明显缺乏。我们的研究旨在填补我们对形成Tettigoniidae基因组的卫星进化过程的理解的关键空白。重复体分析显示,地中海地中海的基因组占92%,白灵芝的最低值为34%,在其他Tettigoniidae物种中平均占67%。分析揭示了Tettigoniidae家族物种之间卫星DNA重复序列的显着差异,地中海分枝杆菌数量最高,246,迄今为止在昆虫中报告,数量最低,10,在灰翅目中。鲁斯波利亚·杜比亚和鲁斯波利亚·云南,是同属物种,展示104个和84个家庭的不同数量,分别。R.dubia中的卫星DNA重复表现出最高的丰度,占总基因组的17.2%,虽然在灰翅目中报道的丰度最低,5.65%。基因组大小与卫星DNA家族计数弱相关(rs=0.42,p=0.29),但是卫星丰度与家庭数量之间存在很强的相关性(rs=0.73,p=0.03)。此外,卫星DNA增益和损失模式的分析提供了对基因组中卫星DNA家族的扩增和均质化的见解,物种特异性重复序列表现出积极的扩增趋势。地中海分枝杆菌的染色体分布表明,在Chr12,Chr01和Chr04上观察到最高的积累,占17.79%,17.4%,占总染色体大小的17.22%,分别。卫星DNA家族的染色体特异性繁殖很明显,MthSat01仅在1号染色体上,MthSat170在2号染色体上,共享1.64%和2.33%。观察到的卫星DNA数量和丰度的保守性和变化,以及不同的得失模式,表明潜在的不同进化过程对这些昆虫基因组景观的影响,这需要进一步调查。此外,卫星DNA在特定染色体上的差异积累意味着潜在的染色体特异性功能或结构特征会影响卫星序列的保留和增殖。
    Satellite DNA repeats are repetitive DNA sequences found in eukaryotic genomes, typically consisting of short DNA motifs repeated in tandem arrays. Despite the vast body of literature on satellite DNA repeats in other taxa, investigations specifically targeting Tettigoniidae remain conspicuously absent. Our study aims to fill a critical gap in our understanding of satellitome evolutionary processes shaping Tettigoniidae genomes. Repeatome analysis revealed that the Meconema thalassinum genome comprises 92%, and Phryganogryllacris superangulata had the lowest value of 34%, with an average of 67% in other Tettigoniidae species. The analysis reveals significant variation in the number of satellite DNA repeats across species of the Tettigoniidae family, with M. thalassinum exhibiting the highest count, 246, reported in insects to date and the lowest count, 10, in Pholidoptera griseoptera. Ruspolia dubia and Ruspolia yunnana, which are congeneric species, showcase distinct counts of 104 and 84 families, respectively. Satellite DNA repeats in R. dubia exhibit the highest abundance, constituting 17.2% of the total genome, while the lowest abundance was reported in P. griseoptera, at 5.65%. The genome size correlates weakly with the satellite DNA family count (rs = 0.42, p = 0.29), but a strong correlation exists between satellite abundance and family number (rs = 0.73, p = 0.03). Moreover, the analysis of satellite DNA gain and loss patterns provides insights into the amplification and homogenization of satellite DNA families within the genome, with species-specific repeats exhibiting a positive trend toward amplification. The chromosomal distribution in M. thalassinum displayed that the highest accumulation was observed on Chr12, Chr01, and Chr04, constituting 17.79%, 17.4%, and 17.22% of the total chromosome size, respectively. The chromosome-specific propagation of satellite DNA families was evident, with MthSat01 solely on chromosome 1 and MthSat170 on chromosome 2, sharing 1.64% and 2.33%. The observed conservation and variations in satellite DNA number and abundances, along with distinct patterns of gain and loss, indicate the influence of potentially diverse evolutionary processes shaping the genomic landscape of these insects, which requires further investigation. Furthermore, the differential accumulation of satellite DNA on specific chromosomes implies that potential chromosome-specific functions or structural features influence the retention and proliferation of satellite sequences.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    多倍体植物的基因组存档了导致其当前形式的进化事件。然而,植物多倍体基因组对多倍体类型分类和探索基因组动力学的基因组比较算法提出了许多障碍。
    这里,用于检查多倍体基因组的基因组内和基因组间比较问题被重新定义为宏基因组问题,支持使用快速且可扩展的MinHashing方法。为了确定如何通过这种宏基因组方法描述多倍体的类型,从多倍体光谱中检查植物基因组的k聚体组成和频率,并具有一定范围的k聚体大小。在这种方法中,没有鉴定出亚基因组特异性k-mer;相反,利用全染色体k-mer亚空间。
    给定具有足够亚基因组特异性重复元件含量的染色体尺度基因组组装,揭示了文献验证的亚基因组和基因组进化关系,包括区分自体多倍体和推定的祖先基因组分配。负责的序列是转座因子迅速发展的景观。对MinHashing参数的研究表明,下采样的k-mer空间(基因组签名)产生了极好的序列相似性近似值。此外,对用于比较基因组特征的聚类方法进行了审查,以确保基于宏基因组学的方法的适用性.
    基于MinHashing的易于实施且计算效率高的序列比较策略可实现大型和复杂多倍体植物基因组的比较亚基因组学和基因组学。这样的比较为多倍体型亚基因组分配提供了证据。在给定染色体全局k-mer谱的亚基因组特异性重复信号可能不够的情况下,更具体但计算更复杂的替代方法胜过这种方法。
    UNASSIGNED: The genomes of polyploid plants archive the evolutionary events leading to their present forms. However, plant polyploid genomes present numerous hurdles to the genome comparison algorithms for classification of polyploid types and exploring genome dynamics.
    UNASSIGNED: Here, the problem of intra- and inter-genome comparison for examining polyploid genomes is reframed as a metagenomic problem, enabling the use of the rapid and scalable MinHashing approach. To determine how types of polyploidy are described by this metagenomic approach, plant genomes were examined from across the polyploid spectrum for both k-mer composition and frequency with a range of k-mer sizes. In this approach, no subgenome-specific k-mers are identified; rather, whole-chromosome k-mer subspaces were utilized.
    UNASSIGNED: Given chromosome-scale genome assemblies with sufficient subgenome-specific repetitive element content, literature-verified subgenomic and genomic evolutionary relationships were revealed, including distinguishing auto- from allopolyploidy and putative progenitor genome assignment. The sequences responsible were the rapidly evolving landscape of transposable elements. An investigation into the MinHashing parameters revealed that the downsampled k-mer space (genomic signatures) produced excellent approximations of sequence similarity. Furthermore, the clustering approach used for comparison of the genomic signatures is scrutinized to ensure applicability of the metagenomics-based method.
    UNASSIGNED: The easily implementable and highly computationally efficient MinHashing-based sequence comparison strategy enables comparative subgenomics and genomics for large and complex polyploid plant genomes. Such comparisons provide evidence for polyploidy-type subgenomic assignments. In cases where subgenome-specific repeat signal may not be adequate given a chromosomes\' global k-mer profile, alternative methods that are more specific but more computationally complex outperform this approach.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    转座因子占真核生物基因组的很大比例,许多被认为是古代病毒感染的残留物。目前的研究已经开始强调转座因子在免疫系统对感染的反应中的作用。然而,我们对感染过程中转座因子表达的大部分知识都受到来自每个研究的特定宿主和病原体因子的限制,这使得很难对转座因子在感染过程中的作用进行比较研究和开发更广泛的模式。这里,我们使用模型中可用的工具和资源,黑腹果蝇,分析受细菌影响的果蝇的多个基因表达数据集,真菌,和病毒感染。我们分析了病原体种类的差异,宿主基因型,宿主组织,和性别,以了解这些因素如何影响感染期间的转座因子表达。我们的结果突出了病原体之间共享和独特的转座因子表达模式,并表明病原体因子对宿主因子影响转座因子表达的作用更大。
    Transposable elements make up substantial proportions of eukaryotic genomes and many are thought to be remnants of ancient viral infections. Current research has begun to highlight the role transposable elements can play in the immune system response to infections. However, most of our knowledge about transposable element expression during infection is limited by the specific host and pathogen factors from each study, making it difficult to compare studies and develop broader patterns regarding the role of transposable elements during infection. Here, we use the tools and resources available in the model, Drosophila melanogaster, to analyze multiple gene expression datasets of flies subject to bacterial, fungal, and viral infections. We analyzed differences in pathogen species, host genotype, host tissue, and sex to understand how these factors impact transposable element expression during infection. Our results highlight both shared and unique transposable element expression patterns between pathogens and suggest a larger effect of pathogen factors over host factors for influencing transposable element expression.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    对复杂性状的强烈选择会导致性状均值偏斜和种群性状变异性降低。这种现象的一个例子可以在驯化物种中持续的人为选择压力驱动的等位基因频率变化和扭曲的性状分布中得到证明。狗的驯化与几种基因组变异有关;然而,当在基因组的非编码区中发现时,这些变体的功能影响可能并不总是简单的.在犬CFA6上5Mb区域的非编码位点中发现的四个多态性转座元件(TE)已经进化,这是由于定向选择与驯养犬中人类指导的超社交能力增强相关。我们发现犬GTF2I基因内含子17中的多态性TE,先前报道与犬科动物人类主导的超社交能力呈负相关,与改变的染色质循环有关,因此与不同的顺式调节景观有关。我们报道了E2F1-DNA结合峰与改变的环和GTF2I外显子18的更高表达一致的支持证据,表明选择性剪接。全球范围内,我们发现调节细胞外基质的途径在TE拷贝数方面存在差异.总的来说,我们报道的证据表明,狗的超社会行为的出现和相同的基因之间有一个有趣的分子趋同,当半合子时,产生以颅面缺陷和社交行为增强为特征的人类威廉姆斯·贝伦综合症。我们的结果还强调了染色质结构在社会进化中经常被忽视的潜在作用。
    Strong selection on complex traits can lead to skewed trait means and reduced trait variability in populations. An example of this phenomenon can be evidenced in allele frequency changes and skewed trait distributions driven by persistent human-directed selective pressures in domesticated species. Dog domestication is linked to several genomic variants; however, the functional impacts of these variants may not always be straightforward when found in non-coding regions of the genome. Four polymorphic transposable elements (TE) found within non-coding sites along a 5 Mb region on canine CFA6 have evolved due to directional selection associated with heightened human-directed hyper-sociability in domesticated dogs. We found that the polymorphic TE in intron 17 of the canine GTF2I gene, which was previously reported to be negatively correlated with canid human-directed hyper-sociability, is associated with altered chromatin looping and hence distinct cis-regulatory landscapes. We reported supporting evidence of an E2F1-DNA binding peak concordant with the altered loop and higher expression of GTF2I exon 18, indicative of alternative splicing. Globally, we discovered differences in pathways regulating the extra-cellular matrix with respect to TE copy number. Overall, we reported evidence suggesting an intriguing molecular convergence between the emergence of hypersocial behaviors in dogs and the same genes that, when hemizygous, produce human Williams Beuren Syndrome characterized by cranio-facial defects and heightened social behaviors. Our results additionally emphasize the often-overlooked potential role of chromatin architecture in social evolution.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    全基因组复制(WGD)循环的分子基础和后果以及随后通过亚基因组分级分离的基因丢失在很大程度上仍然难以捉摸。内生驱动因素,如转座因子,已被假定为塑造全基因组优势和偏倚分级分离,从而导致保守的最小分级分离(LF)和退化的最大分级分离(MF)亚基因组。相比之下,外源性因素的作用,例如由环境压力引起的,被忽视了。大约1100万年前经历了WGD事件的高山巴克勒芥末(Biscutellalaevigata;十字花科)的染色体规模组装与对热的转录反应相结合,冷,干旱和食草动物来评估基因表达如何与MF和LF亚基因组中的差异基因保留相关。抵消转座因子在减少MF亚基因组中附近基因的表达和保留方面的影响,剂量平衡被强调为在纯化选择下保留重复基因产物的主要内源性启动子。与“把爱好变成工作”模式一致,大约三分之一的环境响应重复显示新的表达模式,一个拷贝通常保持有条件表达,而另一个拷贝已经进化了组成型表达,强调外源因素是基因保留的主要驱动因素。显示不均匀的分馏模式,区域保持不偏不倚,而其他区域则显示出高偏倚和环境响应基因的显着富集,这个中多倍体基因组呈现的进化特征与内源性和外源性因素的相互作用相一致,这些因素在WGD分级分离周期后具有驱动的基因含量.
    The molecular underpinnings and consequences of cycles of whole-genome duplication (WGD) and subsequent gene loss through subgenome fractionation remain largely elusive. Endogenous drivers, such as transposable elements (TEs), have been postulated to shape genome-wide dominance and biased fractionation, leading to a conserved least-fractionated (LF) subgenome and a degenerated most-fractionated (MF) subgenome. In contrast, the role of exogenous factors, such as those induced by environmental stresses, has been overlooked. In this study, a chromosome-scale assembly of the alpine buckler mustard (Biscutella laevigata; Brassicaceae) that underwent a WGD event about 11 million years ago is coupled with transcriptional responses to heat, cold, drought, and herbivory to assess how gene expression is associated with differential gene retention across the MF and LF subgenomes. Counteracting the impact of TEs in reducing the expression and retention of nearby genes across the MF subgenome, dosage balance is highlighted as a main endogenous promoter of the retention of duplicated gene products under purifying selection. Consistent with the \"turn a hobby into a job\" model, about one-third of environment-responsive duplicates exhibit novel expression patterns, with one copy typically remaining conditionally expressed, whereas the other copy has evolved constitutive expression, highlighting exogenous factors as a major driver of gene retention. Showing uneven patterns of fractionation, with regions remaining unbiased, but with others showing high bias and significant enrichment in environment-responsive genes, this mesopolyploid genome presents evolutionary signatures consistent with an interplay of endogenous and exogenous factors having driven gene content following WGD-fractionation cycles.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:转座因子(TE)是真核生物基因组的主要组成部分。大量的证据表明,尽管它们曾经被认为是“基因组寄生虫”,转座子及其转录本执行特定功能,如早期胚胎发育的调节。了解TEs在吸虫等寄生虫中的作用变得至关重要。肝片吸虫,一种影响人类和牲畜的寄生虫,在不同的环境和主机中经历复杂的生命周期,到目前为止,关于其生命周期调节的知识还很少。
    方法:我们总结了有关肝肝菌中重复元件的数据,并在其生命周期阶段进行了大量RNA-seq分析。TE表达谱进行了分析,关注与先前描述的长链非编码RNA(lncRNAs)的差异表达和潜在同源性。
    结果:差异表达分析揭示了阶段特异性TE转录模式,尤其是在卵和囊虫阶段达到峰值。一些TE与已知的lncRNAs具有同源性,并含有推定的转录因子结合位点。有趣的是,与成虫相比,卵和尾虫中的TE转录水平最高,提示吸虫生命周期转变中的调节作用。
    结论:这些研究结果表明,TEs可能在调节吸虫生命周期转变中发挥作用。此外,TE与lncRNAs的同源性强调了它们在基因调控中的重要性。
    BACKGROUND: Transposable elements (TEs) are major components of eukaryotic genomes. The extensive body of evidence suggests that although they were once considered \"genomic parasites\", transposons and their transcripts perform specific functions, such as regulation of early embryo development. Understanding the role of TEs in such parasites as trematodes is becoming critically important. Fasciola hepatica, a parasite affecting humans and livestock, undergoes a complex life cycle in diverse environments and hosts, and knowledge about its life cycle regulation is scarce so far.
    METHODS: We summarized the data regarding the repetitive elements in F. hepatica and conducted bulk RNA-seq analysis across its life cycle stages. TE expression profiles were analyzed, focusing on differential expression and potential homology with previously described long non-coding RNAs (lncRNAs).
    RESULTS: Differential expression analysis revealed stage-specific TE transcription patterns, notably peaking during egg and metacercariae stages. Some TEs showed homology with known lncRNAs and contained putative transcription factor binding sites. Interestingly, TE transcription levels were highest in eggs and metacercariae compared to adults, suggesting regulatory roles in trematode life cycle transitions.
    CONCLUSIONS: These findings suggest that TEs may play roles in regulating trematode life cycle transitions. Moreover, TE homology with lncRNAs underscores their significance in gene regulation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    转座因子(TE)存在于几乎每个真核生物基因组中,并且对于产生从头遗传变异是重要的。然而,在昂贵且耗时的全基因组测序方法之外,在非模型物种中研究TE多态性的一组可用方法非常有限.转座子展示(TD)是一种简单而有效的技术,可通过使用靶向特定TE家族的引物鉴定扩增的片段长度多态性来表征样品中的多态性。到目前为止,这种技术几乎只用于植物。这里,我们为具有小基因组的昆虫物种(例如蚂蚁)提出了优化的TD协议(约200-600Mb)。我们表征了两种不同的遗传谱系之间的TE多态性,以及新世界血统的邻近人群之间。我们发现了活跃的LTR/Ty3反转录转座子,这导致了该物种种群的遗传多样化。
    Transposable elements (TEs) are found in virtually every eukaryotic genome and are important for generating de novo genetic variation. However, outside of costly and time-consuming whole-genome sequencing approaches, the set of available methods to study TE polymorphisms in non-model species is very limited. The Transposon Display (TD) is a simple yet effective technique to characterize polymorphisms across samples by identifying amplified fragment length polymorphisms using primers targeting specific TE families. So far, this technique has almost exclusively been used in plants. Here, we present an optimized TD protocol for insect species with small genomes such as ants (ca. 200-600 Mb). We characterized TE polymorphisms between two distinct genetic lineages of the invasive ant Cardiocondyla obscurior, as well as between neighboring populations of the New World lineage. We found active LTR/Ty3 retrotransposons, that contributed to the genetic diversification of populations in this species.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    表观基因组是一组相互作用的化学标记和分子,有助于塑造发育模式,表型可塑性和基因调控,部分原因是它对环境刺激的反应。在生态现实条件下,人们越来越有兴趣了解这种敏感性的功能和进化重要性。观察到自然种群中大量存在表观遗传变异,这促使人们猜测它可能促进对快速环境扰动的进化反应,比如在气候变化下发生的。一个常见的争论点是表观遗传变异是反映遗传变异还是独立于遗传变异。基因组和表观基因组通常表现为紧密相连和相互依赖。虽然许多表观遗传变化是由遗传决定的,反之亦然,DNA序列的变化受到表观遗传标记的影响。了解表观基因组,因此,基因组和环境相互作用是解释表观基因组变异更广泛的进化后果的重要步骤。借鉴在不同植物和动物物种中进行的实验和比较研究的结果,我们综合了我们目前对这些因素如何相互作用以形成自然种群中的表型变异的理解,重点是确定分类群体之间的异同。我们描述了表观基因组的主要组成部分,以及它们在分类单元内和分类单元之间的变化。我们回顾了表观基因组中的变异如何与遗传特征和环境决定因素相互作用,重点关注转座因子(TE)在整合表观基因组中的作用,基因组和环境。我们看看最近的研究,调查这些相互作用的功能和进化后果。尽管自然界中的表观遗传分化通常可能是随机表观突变的漂移或选择的结果,越来越多的证据表明,其中很大一部分可以稳定遗传,因此可以独立于遗传变化而促进进化。
    The epigenome is the suite of interacting chemical marks and molecules that helps to shape patterns of development, phenotypic plasticity and gene regulation, in part due to its responsiveness to environmental stimuli. There is increasing interest in understanding the functional and evolutionary importance of this sensitivity under ecologically realistic conditions. Observations that epigenetic variation abounds in natural populations have prompted speculation that it may facilitate evolutionary responses to rapid environmental perturbations, such as those occurring under climate change. A frequent point of contention is whether epigenetic variants reflect genetic variation or are independent of it. The genome and epigenome often appear tightly linked and interdependent. While many epigenetic changes are genetically determined, the converse is also true, with DNA sequence changes influenced by the presence of epigenetic marks. Understanding how the epigenome, genome and environment interact with one another is therefore an essential step in explaining the broader evolutionary consequences of epigenomic variation. Drawing on results from experimental and comparative studies carried out in diverse plant and animal species, we synthesize our current understanding of how these factors interact to shape phenotypic variation in natural populations, with a focus on identifying similarities and differences between taxonomic groups. We describe the main components of the epigenome and how they vary within and between taxa. We review how variation in the epigenome interacts with genetic features and environmental determinants, with a focus on the role of transposable elements (TEs) in integrating the epigenome, genome and environment. And we look at recent studies investigating the functional and evolutionary consequences of these interactions. Although epigenetic differentiation in nature is likely often a result of drift or selection on stochastic epimutations, there is growing evidence that a significant fraction of it can be stably inherited and could therefore contribute to evolution independently of genetic change.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    短枝草种已被选为草类作物功能基因组学的模型植物,并阐明单子叶植物中异源多倍体和多年性的起源,由于它们的基因组尺寸小和培养的可行性。然而,二倍体或多倍体Brachypodium谱系之间的基因组大小差异很大。我们已经使用基因组略读测序数据来揭示成分,丰度,和44个主要的Brachypodium谱系和细胞类型代表中重复元素的系统发育值。我们还旨在测试“多倍体基因组休克假说”(PGSH)在Brachypodium异源多倍体重复和基因组大小变异的三种不同进化情景下的可能机制和后果。我们的数据表明,在Brachypodium物种中,重复组覆盖的基因组比例在墨西哥芽孢杆菌4x的最高含量(67.97%)和最低的B.stacei-2x(20.77%)之间显示出3.3倍的差异,基因组大小的变化是重复元素得失的结果。LTR-Retand和Tekay逆转录转座子是Brachypodium基因组中最常见的重复元件,而食人魔反转录转座子只在墨西哥芽孢杆菌中发现。重复组系统发育网络显示出与质体和核rDNA以及转录组树的高度拓扑一致性,将祖先的外核谱系与最近进化的核心多年生谱系区分开来。5SrDNA图拓扑与Brachypodium多倍体的倍性水平和亚基因组的性质具有很强的匹配性。核心多年生B.sylvaticum具有很大的重复性和潜在的多倍体后二倍体起源的特征。我们的研究证明,重复组中的扩张和收缩是对PGSH的三种相反反应的原因。祖先异源四倍体墨西哥芽孢杆菌的基因组扩展加剧是TEs而不是WGD的全染色体增殖的结果,稳定后WGD基因组进化的年轻异源四倍体杂种芽孢杆菌的加性重复序列模式,和最近多年生核心多倍体的基因组学(B.pinnatum,phoenicoides)通过重组这些高度杂交的谱系而重复丢失。我们的分析有助于揭示Brachypodium草模型中重复组的进化和基因组大小变化。
    Brachypodium grass species have been selected as model plants for functional genomics of grass crops, and to elucidate the origins of allopolyploidy and perenniality in monocots, due to their small genome sizes and feasibility of cultivation. However, genome sizes differ greatly between diploid or polyploid Brachypodium lineages. We have used genome skimming sequencing data to uncover the composition, abundance, and phylogenetic value of repetitive elements in 44 representatives of the major Brachypodium lineages and cytotypes. We also aimed to test the possible mechanisms and consequences of the \"polyploid genome shock hypothesis\" (PGSH) under three different evolutionary scenarios of variation in repeats and genome sizes of Brachypodium allopolyploids. Our data indicated that the proportion of the genome covered by the repeatome in the Brachypodium species showed a 3.3-fold difference between the highest content of B. mexicanum-4x (67.97%) and the lowest of B. stacei-2x (20.77%), and that changes in the sizes of their genomes were a consequence of gains or losses in their repeat elements. LTR-Retand and Tekay retrotransposons were the most frequent repeat elements in the Brachypodium genomes, while Ogre retrotransposons were found exclusively in B. mexicanum. The repeatome phylogenetic network showed a high topological congruence with plastome and nuclear rDNA and transcriptome trees, differentiating the ancestral outcore lineages from the recently evolved core-perennial lineages. The 5S rDNA graph topologies had a strong match with the ploidy levels and nature of the subgenomes of the Brachypodium polyploids. The core-perennial B. sylvaticum presents a large repeatome and characteristics of a potential post-polyploid diploidized origin. Our study evidenced that expansions and contractions in the repeatome were responsible for the three contrasting responses to the PGSH. The exacerbated genome expansion of the ancestral allotetraploid B. mexicanum was a consequence of chromosome-wide proliferation of TEs and not of WGD, the additive repeatome pattern of young allotetraploid B. hybridum of stabilized post-WGD genome evolution, and the genomecontraction of recent core-perennials polyploids (B. pinnatum, B. phoenicoides) of repeat losses through recombination of these highly hybridizing lineages. Our analyses have contributed to unraveling the evolution of the repeatome and the genome size variation in model Brachypodium grasses.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号