archaeogenetics

古细菌遗传学
  • 文章类型: Journal Article
    我们对人类进化史的认识已被古基因组学大大推进。自2020年以来,古代DNA的研究越来越集中在重建最近的过去。然而,在人口统计复杂性增加和遗传分化减少的情况下,古基因组学方法在解决历史和考古重要性问题方面的准确性仍然是一个悬而未决的问题。我们评估了两种常用方法的性能和行为,qpAdm和f3统计量,关于人口统计模型和数据条件多样性下的混合推断。我们进行了两种互补的模拟方法-首先在四个简单的人口统计模型下探索广泛的人口统计参数空间,这些模型具有不同的复杂性和配置,使用来自两个染色体的分支长度数据-其次,我们分析了一个由59个种群组成的欧亚历史模型,使用全基因组数据,这些数据是用古老的DNA条件(如SNP确定)修改的,数据缺失,和伪单倍体化。我们观察到人口分化是驱动qpAdm表现的主要因素。值得注意的是,虽然复杂的基因流历史会影响哪些模型被归类为合理的,它们不会降低整体性能。在反映历史时期的条件下,qpAdm最频繁地将真实模型识别为在一小组密切相关的群体中合理的。为了增加解决精细比例假设的效用,我们提供了一种启发式方法,用于进一步区分包含qpAdm模型P值和f3统计量的候选模型。最后,我们证明了使用全基因组分支长度f2统计量的qpAdm的性能显着增加,强调了改善人口统计学推断的潜力,这可以通过未来f统计估计的进步来实现。
    Our knowledge of human evolutionary history has been greatly advanced by paleogenomics. Since the 2020s, the study of ancient DNA has increasingly focused on reconstructing the recent past. However, the accuracy of paleogenomic methods in resolving questions of historical and archaeological importance amidst the increased demographic complexity and decreased genetic differentiation remains an open question. We evaluated the performance and behavior of two commonly used methods, qpAdm and the f3-statistic, on admixture inference under a diversity of demographic models and data conditions. We performed two complementary simulation approaches-firstly exploring a wide demographic parameter space under four simple demographic models of varying complexities and configurations using branch-length data from two chromosomes-and secondly, we analyzed a model of Eurasian history composed of 59 populations using whole-genome data modified with ancient DNA conditions such as SNP ascertainment, data missingness, and pseudohaploidization. We observe that population differentiation is the primary factor driving qpAdm performance. Notably, while complex gene flow histories influence which models are classified as plausible, they do not reduce overall performance. Under conditions reflective of the historical period, qpAdm most frequently identifies the true model as plausible among a small candidate set of closely related populations. To increase the utility for resolving fine-scaled hypotheses, we provide a heuristic for further distinguishing between candidate models that incorporates qpAdm model P-values and f3-statistics. Finally, we demonstrate a significant performance increase for qpAdm using whole-genome branch-length f2-statistics, highlighting the potential for improved demographic inference that could be achieved with future advancements in f-statistic estimations.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    罗马帝国的兴衰是一个社会政治过程,对人类历史产生了巨大影响。多瑙河中部是人口和文化运动的重要边界和十字路口。这里,我们提供了136个巴尔干个体的全基因组数据,这些个体可追溯到公元一千年。尽管广泛的军事化和文化影响,我们发现斜体血统的人几乎没有祖先的贡献。然而,我们追溯了帝国时期安纳托利亚血统的大规模涌入。在公元250年至550年之间,我们发现来自中欧/北欧和草原的移民,确认“野蛮人”的迁徙是由种族多样化的联盟推动的。罗马人控制结束后,我们检测到基因上与现代东欧讲斯拉夫语的人群相似的个体的大规模到来,他们贡献了巴尔干人祖先的30%-60%,代表移民时期欧洲任何地方最大的永久性人口变化之一。
    The rise and fall of the Roman Empire was a socio-political process with enormous ramifications for human history. The Middle Danube was a crucial frontier and a crossroads for population and cultural movement. Here, we present genome-wide data from 136 Balkan individuals dated to the 1st millennium CE. Despite extensive militarization and cultural influence, we find little ancestry contribution from peoples of Italic descent. However, we trace a large-scale influx of people of Anatolian ancestry during the Imperial period. Between ∼250 and 550 CE, we detect migrants with ancestry from Central/Northern Europe and the Steppe, confirming that \"barbarian\" migrations were propelled by ethnically diverse confederations. Following the end of Roman control, we detect the large-scale arrival of individuals who were genetically similar to modern Eastern European Slavic-speaking populations, who contributed 30%-60% of the ancestry of Balkan people, representing one of the largest permanent demographic changes anywhere in Europe during the Migration Period.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Preprint
    古基因组学扩展了我们对人类进化史的认识。自2020年以来,古代DNA的研究增加了对重建最近过去的关注。然而,在人口统计学复杂性增加和遗传分化减少的情况下,古基因组学方法在回答历史和考古重要性问题方面的准确性仍然是一个悬而未决的问题。我们使用了两种模拟方法来评估常用方法的局限性和行为,qpAdm和f3-统计量,关于混合推断。第一个是基于从不同复杂性和配置的四个简单人口统计学模型模拟的分支长度数据。第二个,使用全基因组数据对59个种群组成的欧亚历史进行分析,这些数据是用古老的DNA条件(如SNP确定)进行修改的,数据缺失,和伪单倍体化。我们表明,在类似历史人口的条件下,qpAdm可以识别与它们密切相关的真实来源和群体的小候选集。然而,在典型的古代DNA条件下,qpAdm无法进一步区分它们,限制了其解决精细假设的效用。值得注意的是,我们发现,复杂的基因流历史通常会导致qpAdm性能的改善,并且在混合物权重的估计中没有偏差。我们为混合推断提供了一种启发式方法,该方法结合了混合权重估计和qpAdm模型的P值,和f3-统计信息,以增强区分多个似是而非的候选人的能力。最后,我们通过全基因组分支长度F2统计来强调qpAdm的未来潜力,证明了人口统计学推断的改进,这可以通过f统计量估计的改进来实现。
    Paleogenomics has expanded our knowledge of human evolutionary history. Since the 2020s, the study of ancient DNA has increased its focus on reconstructing the recent past. However, the accuracy of paleogenomic methods in answering questions of historical and archaeological importance amidst the increased demographic complexity and decreased genetic differentiation within the historical period remains an open question. We used two simulation approaches to evaluate the limitations and behavior of commonly used methods, qpAdm and the f3-statistic, on admixture inference. The first is based on branch-length data simulated from four simple demographic models of varying complexities and configurations. The second, an analysis of Eurasian history composed of 59 populations using whole-genome data modified with ancient DNA conditions such as SNP ascertainment, data missingness, and pseudo-haploidization. We show that under conditions resembling historical populations, qpAdm can identify a small candidate set of true sources and populations closely related to them. However, in typical ancient DNA conditions, qpAdm is unable to further distinguish between them, limiting its utility for resolving fine-scaled hypotheses. Notably, we find that complex gene-flow histories generally lead to improvements in the performance of qpAdm and observe no bias in the estimation of admixture weights. We offer a heuristic for admixture inference that incorporates admixture weight estimate and P-values of qpAdm models, and f3-statistics to enhance the power to distinguish between multiple plausible candidates. Finally, we highlight the future potential of qpAdm through whole-genome branch-length f2-statistics, demonstrating the improved demographic inference that could be achieved with advancements in f-statistic estimations.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    现代和古代基因组不一定来自同质种群,因为它们可能是从不同的地方和不同的时间收集的。这种异质采样可能是人口统计推断的问题,如果没有适当考虑,会导致有偏差的人口统计参数和不正确的模型选择。当明确说明时,它可能导致非常复杂的模型和难以分析的高数据维度。在本文中,我们正式研究了这种时空抽样异质性对人口统计推断的影响,我们介绍了一种规避这个问题的方法。为了在不增加站点频谱(SFS)维数的情况下处理结构化样本,我们在现有程序fastsimcoal2中引入了一种新的结构化方法。我们使用模拟和现代人类基因组数据评估了这种方法更新的效率和相关性。我们特别关注时空异质性,以证明这种新的基于SFS的方法的兴趣,这在处理分散和古老的DNA样本时特别有用,如在保护遗传学或古细菌遗传学中。
    Modern and ancient genomes are not necessarily drawn from homogeneous populations, as they may have been collected from different places and at different times. This heterogeneous sampling can be an issue for demographic inferences and results in biased demographic parameters and incorrect model choice if not properly considered. When explicitly accounted for, it can result in very complex models and high data dimensionality that are difficult to analyse. In this paper, we formally study the impact of such spatial and temporal sampling heterogeneity on demographic inference, and we introduce a way to circumvent this problem. To deal with structured samples without increasing the dimensionality of the site frequency spectrum (SFS), we introduce a new structured approach to the existing program fastsimcoal2. We assess the efficiency and relevance of this methodological update with simulated and modern human genomic data. We particularly focus on spatial and temporal heterogeneities to evidence the interest of this new SFS-based approach, which can be especially useful when handling scattered and ancient DNA samples, as in conservation genetics or archaeogenetics.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    这篇评论的重点是萨赫勒/萨凡纳腰带,非洲的一个大地区,有两种替代生存系统(畜牧业和农业),如今,interactive.这是一个长期存在的问题,在牛开始传播到非洲之后,牧民是否在这里与其他人口隔离开来(〜8千年前,kya)或,更确切地说,开始与其他人群融合,比如农牧民,驯化后的高粱和珍珠小米(~5kya)和随后的农业传播。如果我们看乳糖酶的持久性,与田园生活方式密切相关的特征,我们看到,它的变体在当前的牧民区分他们从他们的农民邻居。大多数其他(大多数是中性的)遗传多态性没有,然而,表明这些群体之间的明显区别;它们表明了共同的起源和/或广泛的基因流。两种生存系统之间的遗传亲和力和生态共生可以帮助我们更好地了解该非洲地区的人口历史。在这次审查中,我们表明,在当地人口中正确收集的现代萨赫勒/萨凡纳带种群的基因组数据集可以补充该地区仍然不足的考古研究,特别是在处理具有易腐物质文化的流动人口的史前史时,因此考古知名度不稳定。
    This review focuses on the Sahel/Savannah belt, a large region of Africa where two alternative subsistence systems (pastoralism and agriculture), nowadays, interact. It is a long-standing question whether the pastoralists became isolated here from other populations after cattle began to spread into Africa (~8 thousand years ago, kya) or, rather, began to merge with other populations, such as agropastoralists, after the domestication of sorghum and pearl millet (~5 kya) and with the subsequent spread of agriculture. If we look at lactase persistence, a trait closely associated with pastoral lifestyle, we see that its variants in current pastoralists distinguish them from their farmer neighbours. Most other (mostly neutral) genetic polymorphisms do not, however, indicate such clear differentiation between these groups; they suggest a common origin and/or an extensive gene flow. Genetic affinity and ecological symbiosis between the two subsistence systems can help us better understand the population history of this African region. In this review, we show that genomic datasets of modern Sahel/Savannah belt populations properly collected in local populations can complement the still insufficient archaeological research of this region, especially when dealing with the prehistory of mobile populations with perishable material culture and therefore precarious archaeological visibility.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    古代DNA(aDNA)研究始于1984年,此后极大地扩展了我们对进化和迁移的理解。今天,DNA分析是用来解决有关人类起源的各种难题,迁移模式,以及传染病的传播。从识别人类家族中的新分支到研究已灭绝的动植物的基因组,这些令人难以置信的发现最近让世界大吃一惊。然而,仔细观察这些公布的结果,就会发现全球北方和全球南方存在明显的鸿沟。因此,通过这项研究,我们的目标是强调鼓励更好的合作机会和技术转让,以支持全球南方的研究人员。Further,本研究还侧重于通过报道世界各地发表的相关文献并讨论该领域的进步和挑战,扩大aDNA领域正在进行的对话的范围。
    Ancient DNA (aDNA) research first began in 1984 and ever since has greatly expanded our understanding of evolution and migration. Today, aDNA analysis is used to solve various puzzles about the origin of mankind, migration patterns, and the spread of infectious diseases. The incredible findings ranging from identifying the new branches within the human family to studying the genomes of extinct flora and fauna have caught the world by surprise in recent times. However, a closer look at these published results points out a clear Global North and Global South divide. Therefore, through this research, we aim to emphasize encouraging better collaborative opportunities and technology transfer to support researchers in the Global South. Further, the present research also focuses on expanding the scope of the ongoing conversation in the field of aDNA by reporting relevant literature published around the world and discussing the advancements and challenges in the field.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    检索高质量的内源性古DNA(aDNA)提出了几个挑战,包括低分子拷贝数,高碎片率,读取终端处的损坏,和潜在的外源污染物DNA的存在。所有这些因素使来自高通量测序平台的读取中的共有aDNA序列的可靠重建复杂化。这里,我们报告了对旨在克服这些问题并构建高质量古代有丝分裂基因组的两种替代工具(ANGSD和schmutzi)进行全面评估的结果.原始基因组数据(BAM/FASTQ)来自17个先前发表的整个古代人类基因组,范围从公元前14年到7年,并使用不同质量的过滤器重建线粒体共有序列。与他们的精度测量和比较。此外,不同序列参数的影响(读取次数,测序碱基,平均覆盖率,以及脱氨和污染率)作为衍生序列质量的预测因子进行了评估。所有古代样本都成功重建了完整的有丝分裂基因组,对于他们中的大多数人来说,过滤大大提高了mtDNA共识调用和单倍群预测。总的来说,schmutzi管道,它估计并考虑到外源污染,在中等至高覆盖率样品(>1,000,000个读数)中,似乎具有快得多且用户友好的替代方法(ANGSD)的优势。ANGSD,然而,通过它的读取终端微调过滤器,在从低质量样品中调用共有序列方面表现出更好的能力。在检查的所有总体样本质量的预测因子中,对于序列读数和碱基的可用数量,发现最强的相关性.在这个过程中,我们报告了来自南安纳托利亚/北黎凡特的早期石器病个体的先前未分配的单倍群(U3b)。
    Retrieving high-quality endogenous ancient DNA (aDNA) poses several challenges, including low molecular copy number, high rates of fragmentation, damage at read termini, and potential presence of exogenous contaminant DNA. All these factors complicate a reliable reconstruction of consensus aDNA sequences in reads from high-throughput sequencing platforms. Here, we report findings from a thorough evaluation of two alternative tools (ANGSD and schmutzi) aimed at overcoming these issues and constructing high-quality ancient mitogenomes. Raw genomic data (BAM/FASTQ) from a total of 17 previously published whole ancient human genomes ranging from the 14th to the 7th millennium BCE were retrieved and mitochondrial consensus sequences were reconstructed using different quality filters, with their accuracy measured and compared. Moreover, the influence of different sequence parameters (number of reads, sequenced bases, mean coverage, and rate of deamination and contamination) as predictors of derived sequence quality was evaluated. Complete mitogenomes were successfully reconstructed for all ancient samples, and for the majority of them, filtering substantially improved mtDNA consensus calling and haplogroup prediction. Overall, the schmutzi pipeline, which estimates and takes into consideration exogenous contamination, appeared to have the edge over the much faster and user-friendly alternative method (ANGSD) in moderate to high coverage samples (>1,000,000 reads). ANGSD, however, through its read termini trimming filter, showed better capabilities in calling the consensus sequence from low-quality samples. Among all the predictors of overall sample quality examined, the strongest correlation was found for the available number of sequence reads and bases. In the process, we report a previously unassigned haplogroup (U3b) for an Early Chalcolithic individual from Southern Anatolia/Northern Levant.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    我们介绍了一种新的种群遗传方法,适用于对种群的起源和关系进行建模,使用新的计算方法分析汞频率分布。Hgs被选入群体亚群中显示相关频率的群体,基于在古代分离中建立相关性的假设,迁移和混合过程。人口是用这个通用汞数据库定义的,然后使用无监督的人工智能,中心矢量(CV)是根据多维点系统中Hg分布矢量的局部冷凝确定的。根据群体与CV的接近度进行聚类。我们表明,CV可以被视为古代种群的近似值,而真实种群可以使用基于权重梯度搜索的新线性组合算法建模为CV的加权线性组合。该方法的有效性是通过将喀尔巴厘盆地的铜时代人口与中世纪人口和现代匈牙利人进行比较来证明的。我们的分析揭示了自中世纪以来显著的人口连续性,以及自铜器时代以来基材成分的存在。
    We introduce a novel population genetic approach suitable to model the origin and relationships of populations, using new computation methods analyzing Hg frequency distributions. Hgs were selected into groups which show correlated frequencies in subsets of populations, based on the assumption that correlations were established in ancient separation, migration and admixture processes. Populations are defined with this universal Hg database, then using unsupervised artificial intelligence, central vectors (CVs) are determined from local condensations of the Hg-distribution vectors in the multidimensional point system. Populations are clustered according to their proximity to CVs. We show that CVs can be regarded as approximations of ancient populations and real populations can be modeled as weighted linear combinations of the CVs using a new linear combination algorithm based on a gradient search for the weights. The efficacy of the method is demonstrated by comparing Copper Age populations of the Carpathian Basin to Middle Age ones and modern Hungarians. Our analysis reveals significant population continuity since the Middle Ages, and the presence of a substrate component since the Copper Age.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Historical Article
    这里,我们报告了从新石器时代晚期到青铜时代晚期的110个古代近东个体的全基因组数据分析,一个以近东地区间激烈互动为特征的时期。我们发现,北/中安纳托利亚和南高加索地区的公元前六千年人口在新石器时代形成的遗传序列上具有混合血统,该遗传序列在新石器时代形成于今天的南高加索/扎格罗斯地区之间。在晚期石器时代和/或早期青铜时代,超过一半的北黎凡特基因库被替换,在安纳托利亚和南高加索的其他地区,我们记录了只有瞬时基因流的遗传连续性。此外,我们揭示了青铜时代晚期北黎凡特地区一个遗传上独特的个体。总的来说,我们的研究揭示了人口动态随时间变化的多个尺度,从新石器时代的广泛混合到青铜时代晚期的全球化社会中的远距离流动。视频摘要.
    Here, we report genome-wide data analyses from 110 ancient Near Eastern individuals spanning the Late Neolithic to Late Bronze Age, a period characterized by intense interregional interactions for the Near East. We find that 6th millennium BCE populations of North/Central Anatolia and the Southern Caucasus shared mixed ancestry on a genetic cline that formed during the Neolithic between Western Anatolia and regions in today\'s Southern Caucasus/Zagros. During the Late Chalcolithic and/or the Early Bronze Age, more than half of the Northern Levantine gene pool was replaced, while in the rest of Anatolia and the Southern Caucasus, we document genetic continuity with only transient gene flow. Additionally, we reveal a genetically distinct individual within the Late Bronze Age Northern Levant. Overall, our study uncovers multiple scales of population dynamics through time, from extensive admixture during the Neolithic period to long-distance mobility within the globalized societies of the Late Bronze Age. VIDEO ABSTRACT.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Comparative Study
    High-throughput DNA sequencing enables large-scale metagenomic analyses of complex biological systems. Such analyses are not restricted to present-day samples and can also be applied to molecular data from archaeological remains. Investigations of ancient microbes can provide valuable information on past bacterial commensals and pathogens, but their molecular detection remains a challenge. Here, we present HOPS (Heuristic Operations for Pathogen Screening), an automated bacterial screening pipeline for ancient DNA sequences that provides detailed information on species identification and authenticity. HOPS is a versatile tool for high-throughput screening of DNA from archaeological material to identify candidates for genome-level analyses.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号