Target sequence capture

靶序列捕获
  • 文章类型: Journal Article
    目的:支持丰富和独特的新喀里多尼亚植物区系的地理起源和进化机制仍然知之甚少。这归因于该岛复杂的地质历史以及缺乏解决良好的物种级系统发育。这里,我们推断新喀里多尼亚棕榈树的系统发育关系和发散时间,包括40种。我们使用此框架来阐明新喀里多尼亚棕榈谱系的生物地理学,并探索现存物种如何形成。
    方法:从通过靶向测序获得的151个核基因中,推断了一个包括37个新喀里多尼亚棕榈物种和77个近缘种的系统进化树。估算了化石校准的发散时间,并推断了祖先的范围。就海拔而言,祖先和现存的生态偏好,比较了新喀里多尼亚姐妹物种之间的降水和底物,以探索它们作为物种形成驱动因素的可能作用。
    结果:新喀里多尼亚手掌形成四个支撑良好的进化枝,其中关系得到很好的解决。我们的结果支持当前的分类,但建议应将Veillonia和Campecarpus复活,并且无法澄清Rhopalostylidinae是姐妹还是嵌套在Basseliniinae中。新喀里多尼亚的棕榈血统来自新英格兰和澳大利亚的祖先,它通过始新世和上新世之间的至少三个独立的扩散事件到达了该岛。棕榈随后至少五次从新喀里多尼亚散出,主要是太平洋岛屿。与物种形成事件相关的地理和生态转变因时间和属而异。与较年轻的事件相比,底物转换与较老的事件更频繁地相关。
    结论:邻近地区和当地栖息地的马赛克形成了新喀里多尼亚的棕榈植物区系,该岛在整个太平洋地区产生棕榈多样性方面发挥了重要作用。这种新的时空框架将使种群水平的生态和遗传研究能够解开支撑新喀里多尼亚棕榈特有的机制。
    OBJECTIVE: The geographical origin and evolutionary mechanisms underpinning the rich and distinctive New Caledonian flora remain poorly understood. This is attributable to the complex geological past of the island and to the scarcity of well-resolved species-level phylogenies. Here, we infer phylogenetic relationships and divergence times of New Caledonian palms, which comprise 40 species. We use this framework to elucidate the biogeography of New Caledonian palm lineages and to explore how extant species might have formed.
    METHODS: A phylogenetic tree including 37 New Caledonian palm species and 77 relatives from tribe Areceae was inferred from 151 nuclear genes obtained by targeted sequencing. Fossil-calibrated divergence times were estimated and ancestral ranges inferred. Ancestral and extant ecological preferences in terms of elevation, precipitation and substrate were compared between New Caledonian sister species to explore their possible roles as drivers of speciation.
    RESULTS: New Caledonian palms form four well-supported clades, inside which relationships are well resolved. Our results support the current classification but suggest that Veillonia and Campecarpus should be resurrected and fail to clarify whether Rhopalostylidinae is sister to or nested in Basseliniinae. New Caledonian palm lineages are derived from New Guinean and Australian ancestors, which reached the island through at least three independent dispersal events between the Eocene and Miocene. Palms then dispersed out of New Caledonia at least five times, mainly towards Pacific islands. Geographical and ecological transitions associated with speciation events differed across time and genera. Substrate transitions were more frequently associated with older events than with younger ones.
    CONCLUSIONS: Neighbouring areas and a mosaic of local habitats shaped the palm flora of New Caledonia, and the island played a significant role in generating palm diversity across the Pacific region. This new spatio-temporal framework will enable population-level ecological and genetic studies to unpick the mechanisms underpinning New Caledonian palm endemism.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Astereae(菊科)部落包括36个亚部落和252个属,分布在世界各地的温带和热带地区。其中一个亚部落,CelmisiinaeSaldivia,最近被限制为包括六个属和ca。160种,仅限于澳大利亚东部,新西兰,和新几内亚。该物种表现出令人印象深刻的生长习性,从小草药和类灌木到中等大小的树木。它们生活在广泛的栖息地中,通常在亚高山和高山植被中占主导地位。尽管Celmisiinae的限制得到了很好的支持,在属和种水平上,它们的内部关系和分类仍然存在不确定性。这项研究利用了高通量测序的最新进展,为Celmisiinae亚部落建立了强大的多基因系统发育。目标富集Angiosperms353诱饵集以及hybpiper-nf和paragone-nf管道用于检索,推断,并从75个分类单元中组装直系同源基因座,这些分类单元代表了该部落中所有主要的推定进化枝。由于Celmisiinae中的二倍体倍性水平,以及程序集中缺少的数据,围绕正交检测推断的不确定性仍然存在。然而,基于多种基因家族集合,基于合并和级联的系统发育重建恢复了相似的拓扑结构。基因家族中的类联和缺失数据引起了一些问题,但是估计的系统发育得到了很好的支持和解决。系统发育学证据支持Celmisiinae和三个主要进化枝:胸膜叶进化枝(胸膜叶,巨带和Damnamenia),大部分在新西兰亚南极群岛,新西兰和澳大利亚大陆的Celmisia,和新西兰的Shawia(包括“Oleariaparte”和Pachystegia),澳大利亚和巴布亚新几内亚。此处显示的结果增加了对Angiosperms353诱饵集的积累支持,该诱饵集作为记录植物多样性的有效方法。
    The tribe Astereae (Asteraceae) includes 36 subtribes and 252 genera, and is distributed worldwide in temperate and tropical regions. One of the subtribes, Celmisiinae Saldivia, has been recently circumscribed to include six genera and ca. 160 species, and is restricted to eastern Australia, New Zealand, and New Guinea. The species show an impressive range of growth habit, from small herbs and ericoid subshrubs to medium-sized trees. They live in a wide range of habitats and are often dominant in subalpine and alpine vegetation. Despite the well-supported circumscription of Celmisiinae, uncertainties have remained about their internal relationships and classification at genus and species levels. This study exploited recent advances in high-throughput sequencing to build a robust multi-gene phylogeny for the subtribe Celmisiinae. The target enrichment Angiosperms353 bait set and the hybpiper-nf and paragone-nf pipelines were used to retrieve, infer, and assemble orthologous loci from 75 taxa representing all the main putative clades within the subtribe. Because of the diploidised ploidy level in Celmisiinae, as well as missing data in the assemblies, uncertainty remains surrounding the inference of orthology detection. However, based on a variety of gene-family sets, coalescent and concatenation-based phylogenetic reconstructions recovered similar topologies. Paralogy and missing data in the gene-families caused some problems, but the estimated phylogenies were well-supported and well-resolved. The phylogenomic evidence supported Celmisiinae and three main clades: the Pleurophyllum clade (Pleurophyllum, Macrolearia and Damnamenia), mostly in the New Zealand Subantarctic Islands, Celmisia of mainland New Zealand and Australia, and Shawia (including \'Olearia pro parte\' and Pachystegia) of New Zealand, Australia and New Guinea. The results presented here add to the accumulating support for the Angiosperms353 bait set as an efficient method for documenting plant diversity.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Eugenia是开花植物在分类学上最具挑战性的谱系之一,在过去的几年中,由于最近基于分子数据的系统发育研究,形态定界发生了变化。努力,直到现在,仅限于主要是质体标记的Sanger测序。这些系统发育研究表明,有11个进化枝形式化为完整的群体。然而,这些进化枝之间的关系在关键节点上支持不力,研究之间不一致,特别是在骨干和Eugenia教派内。伞形涵盖约。700种。为了解决和更好地理解系统的不一致,使用353个低拷贝核基因对54个Eugenia分类群进行了系统基因组Hyb-Seq。使用合并和串联方法回收了基于核和质体数据集的编码和非编码基因座的20种树。通过比较树木景观来评估一致和冲突的拓扑结构,拓扑测试,以及基因和位点一致性因子。除了核和质体数据集之间的拓扑结构是相似的。合并树更好地适应内含子数据集中的差异,其中包含更多的简约信息网站,当级联树恢复更保守的拓扑时,因为它们在树木景观中的分布较窄。这表明先前研究中确定的高度支持的系统发育关系并不一定表明压倒性的一致信号。必须仔细解释一致性,尤其是在连接的数据集中。尽管如此,在这里发现的多物种合并(MSC)方法和级联树拓扑之间的一致性是值得注意的。我们的分析不支持Eugeniasubg。假的或教派。卵磷脂,根据目前的限制,建议进行必要的分类学重新评估。在Eugenia教派中进一步讨论了五个分支。伞形朝着将其划分为可行的分支的方向发展。虽然靶向测序提供了大量的数据,提高了Eugenia的系统发育分辨率,Eugenia教派仍然存在不确定性。雨伞。在Eugenia的主链中,位点系数因子(CF)比基因CF高的一般模式表明有限信号的随机误差。树木景观与一致性因子得分相结合,正如在这里实施的,提供了一个综合的方法,结合了几个系统发育假设。我们相信这里采用的协议将用于未来对桃金娘科进化史的研究。
    Eugenia is one of the most taxonomically challenging lineages of flowering plants, in which morphological delimitation has changed over the last few years resulting from recent phylogenetic study based on molecular data. Efforts, until now, have been limited to Sanger sequencing of mostly plastid markers. These phylogenetic studies indicate 11 clades formalized as infrageneric groups. However, relationships among these clades are poorly supported at key nodes and inconsistent between studies, particularly along the backbone and within Eugenia sect. Umbellatae encompasses ca. 700 species. To resolve and better understand systematic discordance, 54 Eugenia taxa were subjected to phylogenomic Hyb-Seq using 353 low-copy nuclear genes. Twenty species trees based on coding and non-coding loci of nuclear and plastid datasets were recovered using coalescent and concatenated approaches. Concordant and conflicting topologies were assessed by comparing tree landscapes, topology tests, and gene and site concordance factors. The topologies are similar except between nuclear and plastid datasets. The coalescent trees better accommodate disparity in the intron dataset, which contains more parsimony informative sites, while concatenated trees recover more conservative topologies, as they have narrower distribution in the tree landscape. This suggests that highly supported phylogenetic relationships determined in previous studies do not necessarily indicate overwhelming concordant signal. Congruence must be interpreted carefully especially in concatenated datasets. Despite this, the congruence between the multi-species coalescent (MSC) approach and concatenated tree topologies found here is notable. Our analysis does not support Eugenia subg. Pseudeugenia or sect. Pilothecium, as currently circumscribed, suggesting necessary taxonomic reassessment. Five clades are further discussed within Eugenia sect. Umbellatae progress toward its division into workable clades. While targeted sequencing provides a massive quantity of data that improves phylogenetic resolution in Eugenia, uncertainty still remains in Eugenia sect. Umbellatae. The general pattern of higher site coefficient factor (CF) than gene CF in the backbone of Eugenia suggests stochastic error from limited signal. Tree landscapes in combination with concordance factor scores, as implemented here, provide a comprehensive approach that incorporates several phylogenetic hypotheses. We believe the protocols employed here will be of use for future investigations on the evolutionary history of Myrtaceae.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    短读全基因组测序(WGS)的日益普及为研究生态和进化过程提供了前所未有的机会。尽管感兴趣的基因座可以从WGS数据中提取并与靶序列数据组合,这需要合适的生物信息工作流程。这里,我们测试了不同的组装和轨迹提取策略,并将它们实现到Secapr中,将短读数据处理为多基因座比对以进行系统遗传学和分子生态学分析的管道。我们将低覆盖率WGS(<30×)和目标序列捕获的数据处理集成到一个灵活的框架中,同时优化从头重叠群组装和基因座提取。具体来说,我们测试不同的组装策略,通过对比它们从靶向蝴蝶蛋白编码基因中恢复基因座的能力,使用四个数据集:跨不同平均覆盖范围的WGS数据集(10倍,5×和2×)和在通过靶序列捕获测序之前富集这些基因座的数据集。使用产生的denovo重叠群,我们考虑了重叠群中的潜在错误,并推断了系统发育树,以评估每种组装策略恢复物种关系的能力。我们证明,同时选择多个大小的kmer进行组装会导致从头组装的重叠群中提取的基因座的产量最高,而来自测序读取深度低至5倍的数据集可以恢复系统发育树中的预期物种关系。通过使经过测试的装配方法在secapr管道中可用,我们希望激发未来的研究,以纳入补充数据,并对最佳装配策略做出明智的选择。
    The increasing availability of short-read whole genome sequencing (WGS) provides unprecedented opportunities to study ecological and evolutionary processes. Although loci of interest can be extracted from WGS data and combined with target sequence data, this requires suitable bioinformatic workflows. Here, we test different assembly and locus extraction strategies and implement them into secapr, a pipeline that processes short-read data into multilocus alignments for phylogenetics and molecular ecology analyses. We integrate the processing of data from low-coverage WGS (<30×) and target sequence capture into a flexible framework, while optimizing de novo contig assembly and loci extraction. Specifically, we test different assembly strategies by contrasting their ability to recover loci from targeted butterfly protein-coding genes, using four data sets: a WGS data set across different average coverages (10×, 5× and 2×) and a data set for which these loci were enriched prior to sequencing via target sequence capture. Using the resulting de novo contigs, we account for potential errors within contigs and infer phylogenetic trees to evaluate the ability of each assembly strategy to recover species relationships. We demonstrate that choosing multiple sizes of kmer simultaneously for assembly results in the highest yield of extracted loci from de novo assembled contigs, while data sets derived from sequencing read depths as low as 5× recovers the expected species relationships in phylogenetic trees. By making the tested assembly approaches available in the secapr pipeline, we hope to inspire future studies to incorporate complementary data and make an informed choice on the optimal assembly strategy.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    胡萝卜科(Apiaceae)包括466属,其中包括许多著名的作物(例如,茴香,caraway,胡萝卜,芹菜,香菜,孜然,dill,茴香,欧芹,和欧洲防风草)。亚科之间的高级系统发育关系,部落,和其他主要的矢车菊科没有完全解决。本研究旨在解决这一重要的知识差距。
    使用通用Angiosperms353探针集捕获目标序列,以检查234个天麻属的系统发育关系,代表所有四个目前公认的亚家族(Apioideae,亚速藻,Mackinlayoideae,和Saniculoideae)。使用多物种聚结和串联方法分析恢复的核基因。
    我们甚至从古老而劣质的植物标本室标本中回收了数百个核基因。特别值得注意的是,我们在大力支持下放置了三个incertaesedis属(Platysace,Klotzchia,和Hermas);所有三个人都占据孤立的位置,与Platysace决定成为所有剩余的伞形科的姐妹。我们放置了九个属(Apodicarpum,Bonannia,Grafia,单倍体,Microsciavium,Physotrichia,产前检查,Tricholaser,Xatardia)以前从未被包括在任何分子系统发育研究中。
    我们提供支持,以维持现有的四个木科的亚科,在认识到Hermas的同时,Klotzschia,并且Platysace进化枝可能每个都需要容纳在其他子家庭中(有待改进的采样)。可以通过扩展Saniculoideae亚科来适应当前的Apioid属的放置,尽管尚未定义此分组的足够形态突触。这是第一个使用高通量测序方法对伞形科进行的系统发育研究,代表了该群体前所未有的进化框架。
    The carrot family (Apiaceae) comprises 466 genera, which include many well-known crops (e.g., aniseed, caraway, carrots, celery, coriander, cumin, dill, fennel, parsley, and parsnips). Higher-level phylogenetic relationships among subfamilies, tribes, and other major clades of Apiaceae are not fully resolved. This study aims to address this important knowledge gap.
    Target sequence capture with the universal Angiosperms353 probe set was used to examine phylogenetic relationships in 234 genera of Apiaceae, representing all four currently recognized subfamilies (Apioideae, Azorelloideae, Mackinlayoideae, and Saniculoideae). Recovered nuclear genes were analyzed using both multispecies coalescent and concatenation approaches.
    We recovered hundreds of nuclear genes even from old and poor-quality herbarium specimens. Of particular note, we placed with strong support three incertae sedis genera (Platysace, Klotzchia, and Hermas); all three occupy isolated positions, with Platysace resolved as sister to all remaining Apiaceae. We placed nine genera (Apodicarpum, Bonannia, Grafia, Haplosciadium, Microsciadium, Physotrichia, Ptychotis, Tricholaser, Xatardia) that have never previously been included in any molecular phylogenetic study.
    We provide support for the maintenance of the four existing subfamilies of Apiaceae, while recognizing that Hermas, Klotzschia, and the Platysace clade may each need to be accommodated in additional subfamilies (pending improved sampling). The placement of the currently apioid genus Phlyctidocarpa can be accommodated by the expansion of subfamily Saniculoideae, although adequate morphological synapomorphies for this grouping are yet to be defined. This is the first phylogenetic study of the Apiaceae using high-throughput sequencing methods and represents an unprecedented evolutionary framework for the group.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    解决Commelinales命令内的关系提出了相当大的挑战,这反映在其不稳定的家族分类中。因此,我们调查了(1)Commelinales家族和属之间的关系;(2)从未测序过的属的系统发育位置;(3)脱靶质体数据如何与其他基于质体的数据集整合;(4)新推断与家族分类如何一致。
    我们使用Angiosperms353探针集通过靶序列捕获生成了两个大数据集(核和质体),具有从公开可用的转录组和完整质体中开采的其他序列。考虑了第三个扩展质体数据集,包括所有在公共存储库中有序列的物种。在多物种合并框架下从单个基因树推断树种,并使用来自串联和分区数据的最大似然分析推断树种。
    核,质体,扩展质体数据集包括52、53和58属,分别,和多达290种Commelinales,代表了迄今为止最全面的分子采样,其中包括七个从未测序过的属。
    我们推断了支持Commelinales及其五个组成家族的单系的强大系统发育,我们发现了龙舌兰科-血球科和汉瓜科-Commelinaceae,如先前报道。Philydraceae的放置仍然存在争议。两个最大家庭的关系,Commelinaceae和血球科,已解决。基于后面的结果,我们确认了血球科的亚家族分类,并提出了一种新的Commelinaceae分类,其中包括Commelina中Tapheocarpa的同义词。
    Resolving relationships within order Commelinales has posed quite a challenge, as reflected in its unstable infra-familial classification. Thus, we investigated (1) relationships across families and genera of Commelinales; (2) phylogenetic placement of never-before sequenced genera; (3) how well off-target plastid data integrate with other plastid-based data sets; and (4) how the novel inferences coincide with the infra-familial classification.
    We generated two large data sets (nuclear and plastome) by means of target sequence capture using the Angiosperms353 probe set, with additional sequences mined from publicly available transcriptomes and full plastomes. A third extended-plastid data set was considered, including all species with sequences in public repositories. Species trees were inferred under a multispecies coalescent framework from individual gene trees and also using maximum likelihood analyses from concatenated and partitioned data.
    The nuclear, plastome, and extended-plastid data sets include 52, 53, and 58 genera, respectively, and up to 290 species of Commelinales, representing the most comprehensive molecular sampling for the order to date, which includes seven never-before sequenced genera.
    We inferred robust phylogenies supporting the monophyly of Commelinales and its five constituent families, and we recovered the clades Pontederiaceae-Haemodoraceae and Hanguanaceae-Commelinaceae, as previously reported. The placement of Philydraceae remains contentious. Relationships within the two largest families, Commelinaceae and Haemodoraceae, are resolved. Based on the latter results, we confirm the subfamilial classification of Haemodoraceae and propose a new classification for Commelinaceae, which includes the synonymization of Tapheocarpa in Commelina.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    Comprising five families that vastly differ in species richness-ranging from Gelsemiaceae with 13 species to the Rubiaceae with 13,775 species-members of the Gentianales are often among the most species-rich and abundant plants in tropical forests. Despite considerable phylogenetic work within particular families and genera, several alternative topologies for family-level relationships within Gentianales have been presented in previous studies.
    Here we present a phylogenomic analysis based on nuclear genes targeted by the Angiosperms353 probe set for approximately 150 species, representing all families and approximately 85% of the formally recognized tribes. We were able to retrieve partial plastomes from off-target reads for most taxa and infer phylogenetic trees for comparison with the nuclear-derived trees.
    We recovered high support for over 80% of all nodes. The plastid and nuclear data are largely in agreement, except for some weakly to moderately supported relationships. We discuss the implications of our results for the order\'s classification, highlighting points of increased support for previously uncertain relationships. Rubiaceae is sister to a clade comprising (Gentianaceae + Gelsemiaceae) + (Apocynaceae + Loganiaceae).
    The higher-level phylogenetic relationships within Gentianales are confidently resolved. In contrast to recent studies, our results support the division of Rubiaceae into two subfamilies: Cinchonoideae and Rubioideae. We do not formally recognize Coptosapelteae and Luculieae within any particular subfamily but treat them as incertae sedis. Our framework paves the way for further work on the phylogenetics, biogeography, morphological evolution, and macroecology of this important group of flowering plants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    生命之树是导航地球上生命的进化和属性的基本生物路线图,但在很大程度上仍然未知。甚至被子植物(开花植物)也充满了数据空白,尽管它们在维持陆地生命方面发挥了关键作用。今天,高通量测序有望大大加深我们对进化关系的理解.这里,我们描述了一个全面的系统基因组平台,用于探索被子植物生命树,包括一组开放工具和数据,这些工具和数据基于由通用Angiosperms353序列捕获探针靶向的353个核基因。本文的主要目标是(I)记录我们的方法,(Ii)描述我们的第一次数据发布,和(iii)提出了一个新颖的开放数据门户,KewTreeofLifeExplorer(https://treeoflife.kew.org)。我们的目标是为开花植物的所有属生成新的靶序列捕获数据,利用自然史收藏品,如植物标本室标本,并用挖掘的公共数据来增强它。我们的第一次数据发布,在这里描述,是迄今为止被子植物最广泛的核系统发育数据集,包含通过DNA条形码和系统发育测试验证的3099个样本,代表所有64个订单,404个家庭(96$\\%$)和2333个属(17$\\%$)。从数据中推断出“第一关”被子植物生命树,总共有824,878个序列,489,086,049个碱基对,和532,260列对齐柱,在KewTreeofLifeExplorer中进行交互式演示。这种树是使用严格的方法生成的,但在我们的经营规模上是可行的。尽管在分类单元和基因采样方面存在局限性,基因恢复,序列进化和旁系物的模型,树强烈支持现有的分类法,同时挑战订单之间的许多假设关系,并首次放置许多属。经过验证的数据集,物种树和所有中间体均可通过KewLifeExplorer树公开访问,并将在获得更多数据时进行更新。通过标准化核标记的系统测序,为所有开花植物物种的完整生命树的这一重要里程碑为被子植物系统发育组学的高度整合的未来打开了大门。我们的方法有可能成为对地球上所有生命的基因组进行测序的不断发展的运动与世界自然历史收藏的巨大系统基因组潜力之间急需的桥梁。[被子植物;被子植物;基因组学;草药组学;博物馆组学;核系统基因组学;开放获取;靶序列捕获;生命树。].
    The tree of life is the fundamental biological roadmap for navigating the evolution and properties of life on Earth, and yet remains largely unknown. Even angiosperms (flowering plants) are fraught with data gaps, despite their critical role in sustaining terrestrial life. Today, high-throughput sequencing promises to significantly deepen our understanding of evolutionary relationships. Here, we describe a comprehensive phylogenomic platform for exploring the angiosperm tree of life, comprising a set of open tools and data based on the 353 nuclear genes targeted by the universal Angiosperms353 sequence capture probes. The primary goals of this article are to (i) document our methods, (ii) describe our first data release, and (iii) present a novel open data portal, the Kew Tree of Life Explorer (https://treeoflife.kew.org). We aim to generate novel target sequence capture data for all genera of flowering plants, exploiting natural history collections such as herbarium specimens, and augment it with mined public data. Our first data release, described here, is the most extensive nuclear phylogenomic data set for angiosperms to date, comprising 3099 samples validated by DNA barcode and phylogenetic tests, representing all 64 orders, 404 families (96$\\%$) and 2333 genera (17$\\%$). A \"first pass\" angiosperm tree of life was inferred from the data, which totaled 824,878 sequences, 489,086,049 base pairs, and 532,260 alignment columns, for interactive presentation in the Kew Tree of Life Explorer. This species tree was generated using methods that were rigorous, yet tractable at our scale of operation. Despite limitations pertaining to taxon and gene sampling, gene recovery, models of sequence evolution and paralogy, the tree strongly supports existing taxonomy, while challenging numerous hypothesized relationships among orders and placing many genera for the first time. The validated data set, species tree and all intermediates are openly accessible via the Kew Tree of Life Explorer and will be updated as further data become available. This major milestone toward a complete tree of life for all flowering plant species opens doors to a highly integrated future for angiosperm phylogenomics through the systematic sequencing of standardized nuclear markers. Our approach has the potential to serve as a much-needed bridge between the growing movement to sequence the genomes of all life on Earth and the vast phylogenomic potential of the world\'s natural history collections. [Angiosperms; Angiosperms353; genomics; herbariomics; museomics; nuclear phylogenomics; open access; target sequence capture; tree of life.].
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    Despite the global importance of tropical ecosystems, few studies have identified how natural selection has shaped their megadiversity. Here, we test for the role of adaptation in the evolutionary success of the widespread, highly abundant Neotropical palm Mauritia flexuosa. We used a genome scan framework, sampling 16,262 single-nucleotide polymorphisms (SNPs) with target sequence capture in 264 individuals from 22 populations in rainforest and savanna ecosystems. We identified outlier loci as well as signal of adaptation using Bayesian correlations of allele frequency with environmental variables and detected both selective sweeps and genetic hitchhiking events. Functional annotation of SNPs with selection footprints identified loci affecting genes related to adaptation to environmental stress, plant development, and primary metabolic processes. The strong differences in climatic and soil variables between ecosystems matched the high differentiation and low admixture in population Bayesian clustering. Further, we found only small differences in allele frequency distribution in loci putatively under selection among widespread populations from different ecosystems, with fixation of a single allele in most populations. Taken together, our results indicate that adaptive selective sweeps related to environmental stress shaped the spatial pattern of genetic diversity in M. flexuosa, leading to high similarity in allele frequency among populations from different ecosystems.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    目标:直到最近,大多数蕨类植物的系统发育研究都是基于叶绿体基因。基于这些数据的进化推断可能是不完整的,因为字符来自单个链接组,并且是单亲继承的。这些限制在杂交研究中尤其严重,这在蕨类植物中很普遍;蕨类植物的杂种很常见,蕨类植物能够在高度分化的谱系中杂交,在一个记录在案的案例中,分歧长达6000万年。然而,尚不清楚这种杂交对蕨类植物的进化有什么影响,部分原因是缺乏可用的双亲遗传(核编码)标记。
    方法:我们设计了寡核苷酸诱饵来捕获25个靶向,来自24个物种样本的低拷贝核标记,跨越现存的蕨类植物多样性。
    结果:大多数基因座已成功测序。尽管诱饵是根据外显子(转录本)数据设计的,我们成功地捕获了内含子序列,这些序列应该有助于更集中的系统发育研究.我们提供了新的靶序列捕获数据的系统发育分析,并将其整合到以前的基于转录本的数据集中。
    结论:我们将我们的诱饵序列作为进一步研究蕨类植物系统发育的资源提供给群落。
    OBJECTIVE: Until recently, most phylogenetic studies of ferns were based on chloroplast genes. Evolutionary inferences based on these data can be incomplete because the characters are from a single linkage group and are uniparentally inherited. These limitations are particularly acute in studies of hybridization, which is prevalent in ferns; fern hybrids are common and ferns are able to hybridize across highly diverged lineages, up to 60 million years since divergence in one documented case. However, it not yet clear what effect such hybridization has on fern evolution, in part due to a paucity of available biparentally inherited (nuclear-encoded) markers.
    METHODS: We designed oligonucleotide baits to capture 25 targeted, low-copy nuclear markers from a sample of 24 species spanning extant fern diversity.
    RESULTS: Most loci were successfully sequenced from most accessions. Although the baits were designed from exon (transcript) data, we successfully captured intron sequences that should be useful for more focused phylogenetic studies. We present phylogenetic analyses of the new target sequence capture data and integrate these into a previous transcript-based data set.
    CONCLUSIONS: We make our bait sequences available to the community as a resource for further studies of fern phylogeny.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号