genome-wide data

全基因组数据
  • 文章类型: Journal Article
    南岛语族的起源和传播,世界上最大和最普遍的,长期以来一直引起语言学家的注意,考古学家,和遗传学家。尽管人们越来越一致认为台湾是南岛语言传播的源头,对定居和离开台湾的早期南岛人的移民模式知之甚少,即“进入台湾”和“离开台湾”事件。特别是,台湾境内的遗传多样性和结构,以及这与台湾境内/境外事件的关系,主要是因为大多数基因组研究主要利用了来自台湾16个公认的高地南岛人群体中的两个的数据。在这项研究中,我们生成了迄今为止最大的台湾南岛人基因组数据集,包括六个高地团体和一个来自全岛的低地团体以及两个台湾汉族团体。我们在台湾鉴定了精细的基因组结构,推断南岛人祖先的祖先轮廓,并发现台湾南部南岛人与台湾以外的南岛人表现出过度的遗传亲和力。因此,我们的发现为台湾境内和境外的扩散提供了新的思路。
    The origin and dispersal of the Austronesian language family, one of the largest and most widespread in the world, have long attracted the attention of linguists, archaeologists, and geneticists. Even though there is a growing consensus that Taiwan is the source of the spread of Austronesian languages, little is known about the migration patterns of the early Austronesians who settled in and left Taiwan, i.e. the \"Into-Taiwan\" and \"out-of-Taiwan\" events. In particular, the genetic diversity and structure within Taiwan and how this relates to the into-/out-of-Taiwan events are largely unexplored, primarily because most genomic studies have largely utilized data from just two of the 16 recognized Highland Austronesian groups in Taiwan. In this study, we generated the largest genome-wide data set of Taiwanese Austronesians to date, including six Highland groups and one Lowland group from across the island and two Taiwanese Han groups. We identified fine-scale genomic structure in Taiwan, inferred the ancestry profile of the ancestors of Austronesians, and found that the southern Taiwanese Austronesians show excess genetic affinities with the Austronesians outside of Taiwan. Our findings thus shed new light on the Into- and Out-of-Taiwan dispersals.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    古代解剖学现代人类(AMHs)遇到了其他古老的人类物种,尤其是尼安德特人和丹尼索瓦人,当他们离开非洲并在大约6万年前传播到欧洲和亚洲时。他们与他们杂交,现代人类基因组保留了从这些杂交事件中遗传的DNA。最近对高质量(高覆盖率)的古代人类基因组进行了测序,可以直接估计个体的杂合性。这表明这些古人类群体的遗传多样性非常低,表明人口规模低。在这项研究中,我们分析了十种古代人类全基因组数据,包括四个高覆盖率的测序。我们筛选了这些古老的全基因组数据,寻找与单基因疾病相关的致病突变,并在个体受试者中建立了异常的致病性突变聚集,包括〜120,000岁的尼安德特人中与苯丙酮尿症相关的PAH基因致病性变异的四例纯合子病例。这种致病性突变的聚集在当代人群中极为罕见,它们在古代人类中的存在可以用不那么重要的临床表现加上小社区来解释,导致更高的近亲繁殖水平。我们的结果表明,与罕见疾病相关的致病变异可能是其他古人类物种渗入的结果,和古老的混合物因此可能会影响现代人的疾病风险。
    Ancient anatomically modern humans (AMHs) encountered other archaic human species, most notably Neanderthals and Denisovans, when they left Africa and spread across Europe and Asia ~60,000 years ago. They interbred with them, and modern human genomes retain DNA inherited from these interbreeding events. High quality (high coverage) ancient human genomes have recently been sequenced allowing for a direct estimation of individual heterozygosity, which has shown that genetic diversity in these archaic human groups was very low, indicating low population sizes. In this study, we analyze ten ancient human genome-wide data, including four sequenced with high-coverage. We screened these ancient genome-wide data for pathogenic mutations associated with monogenic diseases, and established unusual aggregation of pathogenic mutations in individual subjects, including quadruple homozygous cases of pathogenic variants in the PAH gene associated with the condition phenylketonuria in a ~120,000 years old Neanderthal. Such aggregation of pathogenic mutations is extremely rare in contemporary populations, and their existence in ancient humans could be explained by less significant clinical manifestations coupled with small community sizes, leading to higher inbreeding levels. Our results suggest that pathogenic variants associated with rare diseases might be the result of introgression from other archaic human species, and archaic admixture thus could have influenced disease risk in modern humans.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    蒙古族人口超过600万,是中国讲蒙古语的人口最多的国家。然而,由于样本数量有限且单核苷酸多态性(SNP)覆盖率较低,因此蒙古人的遗传结构和混合史仍不清楚。在这项研究中,我们对来自辽宁省阜新市的38个蒙古族个体的700,000多个SNP的全基因组数据进行基因分型,以基于典型和先进的群体遗传分析方法[主成分分析(PCA),掺合料,FST,f3-统计,f4-统计,qpAdm/qpWave,qpGraph,ALDER,和TreeMix]。我们发现阜新蒙古族与汉族有着密切的遗传关系,北部蒙古人,其他讲蒙古语的人,和东亚地区的通古西人。此外,我们发现黄河流域和西辽河流域的新石器时代的小米农民和蒙古高原和阿穆尔河流域的新石器时代的狩猎采集者是主要的祖先来源,在阜新蒙古人的基因库中,还有其他与欧亚草原牧民和新石器时代伊朗农民有关的基因流。这些结果揭示了动态的人口历史,复杂的人口混合物,阜新蒙古族遗传多样性的多种来源。
    The Mongolian population exceeds six million and is the largest population among the Mongolic speakers in China. However, the genetic structure and admixture history of the Mongolians are still unclear due to the limited number of samples and lower coverage of single-nucleotide polymorphism (SNP). In this study, we genotyped genome-wide data of over 700,000 SNPs in 38 Mongolian individuals from Fuxin in Liaoning Province to explore the genetic structure and population history based on typical and advanced population genetic analysis methods [principal component analysis (PCA), admixture, FST, f 3 -statistics, f 4 -statistics, qpAdm/qpWave, qpGraph, ALDER, and TreeMix]. We found that Fuxin Mongolians had a close genetic relationship with Han people, northern Mongolians, other Mongolic speakers, and Tungusic speakers in East Asia. Also, we found that Neolithic millet farmers in the Yellow River Basin and West Liao River Basin and Neolithic hunter-gatherers in the Mongolian Plateau and Amur River Basin were the dominant ancestral sources, and there were additional gene flows related to Eurasian Steppe pastoralists and Neolithic Iranian farmers in the gene pool of Fuxin Mongolians. These results shed light on dynamic demographic history, complex population admixture, and multiple sources of genetic diversity in Fuxin Mongolians.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在物种形成连续体中确定物种是一项复杂的任务,因为物种起源的过程通常不是瞬时的。全基因组数据的使用为解决复杂的物种划界案件提供了前所未有的解决方案,经常解开神秘的多样性。然而,因为已知基于多物种合并模型的全基因组方法会混淆种群结构与物种边界,通常导致分类学上的过度分裂,越来越明显的是,物种划界研究必须考虑多种证据。在这项研究中,我们使用了系统基因组,群体基因组,和基于合并的物种划界方法,并根据形态和生态信息检查了这些信息,调查Chirostoma\“humboltianum组”(Atherinidae家族)的物种数量和边界。humboltianum组是一个在分类学上有争议的物种复合体,先前的形态学和线粒体研究产生了相互矛盾的物种划界结果。我们生成了77个个体的ddRADseq数据,代表该组中的9个名义物种,跨越它们在墨西哥中部高原的分布范围。
    我们的结果与形态物种和生态定界假设相冲突,确定在三个地理上有凝聚力的进化枝中组织的四个独立进化的谱系:(i)查帕拉湖中的chapalae和sphyraena组,(ii)帕兹夸罗湖和齐拉胡恩湖的estor小组,和(iii)Zacapu湖和Lerma河系的humboltianumsensustricto组。
    总的来说,我们的研究提供了一个非典型的例子,其中全基因组分析描述的物种比以前基于形态学认识的物种少.它还强调了Chapala-Lerma水文系统的地质历史对推动humboltianum组的同种形态形成的影响。
    Delimiting species across a speciation continuum is a complex task, as the process of species origin is not generally instantaneous. The use of genome-wide data provides unprecedented resolution to address convoluted species delimitation cases, often unraveling cryptic diversity. However, because genome-wide approaches based on the multispecies coalescent model are known to confound population structure with species boundaries, often resulting in taxonomic over-splitting, it has become increasingly evident that species delimitation research must consider multiple lines of evidence. In this study, we used phylogenomic, population genomic, and coalescent-based species delimitation approaches, and examined those in light of morphological and ecological information, to investigate species numbers and boundaries comprising the Chirostoma \"humboltianum group\" (family Atherinidae). The humboltianum group is a taxonomically controversial species complex where previous morphological and mitochondrial studies produced conflicting species delimitation outcomes. We generated ddRADseq data for 77 individuals representing the nine nominal species in the group, spanning their distribution range in the central Mexican plateau.
    Our results conflict with the morphospecies and ecological delimitation hypotheses, identifying four independently evolving lineages organized in three geographically cohesive clades: (i) chapalae and sphyraena groups in Lake Chapala, (ii) estor group in Lakes Pátzcuaro and Zirahuén, and (iii) humboltianum sensu stricto group in Lake Zacapu and Lerma river system.
    Overall, our study provides an atypical example where genome-wide analyses delineate fewer species than previously recognized on the basis of morphology. It also highlights the influence of the geological history of the Chapala-Lerma hydrological system in driving allopatric speciation in the humboltianum group.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    通过比较分析,分析了来自48个精心选择的Transylvania居住的Szekler和非Szekler匈牙利人的人群样本的全基因组基因型数据。我们的分析涉及生活在匈牙利的当代匈牙利人,其他欧洲人,和欧亚样本共计数530个人。塞克勒样本的来源是科隆镇,Transylvania.分析的非Szekler匈牙利样本是从村庄收集的,其历史可以追溯到阿帕德王朝时代。通过主成分分析和祖先分析得出的人口结构也揭示了所分析的Szeklers和非SzeklerTransylvania匈牙利人的群体内部相似性。这些群体也显示出彼此相似的遗传模式。使用按血统分类的身份发现工具进行的单倍型分析显示,在所调查的人群中,按血统分类的平均成对身份共享是相似的。但是科隆·塞克勒样本与匈牙利的匈牙利人的平均分享率高于非塞克勒·Transylvania匈牙利人。平均分享结果显示,与其他欧洲人相比,这两个群体都是孤立的,并指出,被调查的阿帕德时代村庄的非塞克勒·Transylvania匈牙利居民比被调查的科隆的塞克勒更为孤立。这也通过我们的自合性分析得到证实。按血统分类的身份分析和4人口测试也证实,这些讲匈牙利语的Transylvanvanian族与居住在匈牙利的匈牙利人密切相关。
    Genome-wide genotype data from 48 carefully selected population samples of Transylvania-living Szeklers and non-Szekler Hungarians were analyzed by comparative analysis. Our analyses involved contemporary Hungarians living in Hungary, other Europeans, and Eurasian samples counting 530 individuals altogether. The source of the Szekler samples was the commune of Korond, Transylvania. The analyzed non-Szekler Hungarian samples were collected from villages with a history dating back to the era of the Árpád Dynasty. Population structure by principal component analysis and ancestry analysis also revealed a great within-group similarity of the analyzed Szeklers and non-Szekler Transylvanian Hungarians. These groups also showed similar genetic patterns with each other. Haplotype analyses using identity-by-descent segment discovering tools showed that average pairwise identity-by-descent sharing is similar in the investigated populations, but the Korond Szekler samples had higher average sharing with the Hungarians from Hungary than non-Szekler Transylvanian Hungarians. Average sharing results showed that both groups are isolated compared to other Europeans, and pointed out that the non-Szekler Transylvanian Hungarian inhabitants of the investigated Árpád Age villages are more isolated than investigated Szeklers from Korond. This was confirmed by our autozygosity analysis as well. Identity-by-descent segment analyses and 4-population tests also confirmed that these Hungarian-speaking Transylvanian ethnic groups are strongly related to Hungarians living in Hungary.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    巴斯克人历史上一直生活在比利牛斯山脉西部,在佛朗哥-坎塔布里亚地区,横跨目前的西班牙和法国领土。在过去的几十年里,由于其独特的文化和生物学特征,它们一直是深入研究的焦点,争议很大,把它们作为异质的,孤立的,独特的人口。他们的非印欧语言,Euskara,被认为是塑造巴斯克人遗传景观的主要因素。然而,由于先前研究的局限性,关于他们的历史和假设的奇异性仍然存在激烈的争论。这里,我们在微观地理水平上分析了巴斯克和周围不讲Euskara的群体的全基因组数据。在1,970个现代和古代样本中,共分析了629,000个全基因组变异,包括来自巴斯克地区18个采样地点的190个新个体。第一次,从全基因组数据进行了局部和大规模分析,覆盖了整个佛朗哥-坎塔布里亚地区,结合等位基因频率和基于单倍型的方法。我们的结果表明,巴斯克人与周围人群有明显的区别,不讲Euskara的法语-Cantabrians处于中间位置。此外,观察到巴斯克人的遗传异质性与地理有显著的相关性。最后,与其他伊比利亚和周围人群相比,检测到的巴斯克人分化不能归因于外部起源。相反,我们表明,这种分化是铁器时代以来遗传连续性的结果,其特征是孤立的时期和最近缺乏基因流动,这些基因流动可能被语言障碍所加强。
    Basques have historically lived along the Western Pyrenees, in the Franco-Cantabrian region, straddling the current Spanish and French territories. Over the last decades, they have been the focus of intense research due to their singular cultural and biological traits that, with high controversy, placed them as a heterogeneous, isolated, and unique population. Their non-Indo-European language, Euskara, is thought to be a major factor shaping the genetic landscape of the Basques. Yet there is still a lively debate about their history and assumed singularity due to the limitations of previous studies. Here, we analyze genome-wide data of Basque and surrounding groups that do not speak Euskara at a micro-geographical level. A total of ∼629,000 genome-wide variants were analyzed in 1,970 modern and ancient samples, including 190 new individuals from 18 sampling locations in the Basque area. For the first time, local- and wide-scale analyses from genome-wide data have been performed covering the whole Franco-Cantabrian region, combining allele frequency and haplotype-based methods. Our results show a clear differentiation of Basques from the surrounding populations, with the non-Euskara-speaking Franco-Cantabrians located in an intermediate position. Moreover, a sharp genetic heterogeneity within Basques is observed with significant correlation with geography. Finally, the detected Basque differentiation cannot be attributed to an external origin compared to other Iberian and surrounding populations. Instead, we show that such differentiation results from genetic continuity since the Iron Age, characterized by periods of isolation and lack of recent gene flow that might have been reinforced by the language barrier.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    Trans-Eurasian cultural and genetic exchanges have significantly influenced the demographic dynamics of Eurasian populations. The Hexi Corridor, located along the southeastern edge of the Eurasian steppe, served as an important passage of the ancient Silk Road in Northwest China and intensified the transcontinental exchange and interaction between populations on the Central Plain and in Western Eurasia. Historical and archeological records indicate that the Western Eurasian cultural elements were largely brought into North China via this geographical corridor, but there is debate on the extent to which the spread of barley/wheat agriculture into North China and subsequent Bronze Age cultural and technological mixture/shifts were achieved by the movement of people or dissemination of ideas. Here, we presented higher-resolution genome-wide autosomal and uniparental Y/mtDNA SNP or STR data for 599 northwestern Han Chinese individuals and conducted 2 different comprehensive genetic studies among Neolithic-to-present-day Eurasians. Genetic studies based on lower-resolution STR markers via PCA, STRUCTURE, and phylogenetic trees showed that northwestern Han Chinese individuals had increased genetic homogeneity relative to northern Mongolic/Turkic/Tungusic speakers and Tibeto-Burman groups. The genomic signature constructed based on modern/ancient DNA further illustrated that the primary ancestry of the northwestern Han was derived from northern millet farmer ancestors, which was consistent with the hypothesis of Han origin in North China and more recent northwestward population expansion. This was subsequently confirmed via excess shared derived alleles in f3/f4 statistical analyses and by more northern East Asian-related ancestry in the qpAdm/qpGraph models. Interestingly, we identified one western Eurasian admixture signature that was present in northwestern Han but absent from southern Han, with an admixture time dated to approximately 1000 CE (Tang and Song dynasties). Generally, we provided supporting evidence that historic Trans-Eurasian communication was primarily maintained through population movement, not simply cultural diffusion. The observed population dynamics in northwestern Han Chinese not only support the North China origin hypothesis but also reflect the multiple sources of the genetic diversity observed in this population.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    公牛生育能力被认为是不可或缺的特征,就农业经济学而言,因为它是牛提供小牛作物的成功概念,以及随之而来的泌乳。这确保了奶牛场的可持续性。传统上,公牛的繁殖力没有受到农场管理者的重视,饲养动物只根据表型预测因子进行评估,即,公牛的父亲受孕率和精液参数。随着动物育种分子时代的到来,尝试通过鉴定与性状相关的遗传标记来揭示公牛生育力的遗传复杂性。标记辅助选择(MAS)是一种方法,旨在利用标记的遗传信息并选择重要性状的改良种群。传统上,MAS是使用候选基因方法进行的,用于鉴定与已知具有与性状相关的生理功能的基因相关的标记,但是这种方法具有某些缺点,例如严格的显著性测试标准。现在,随着全基因组数据的可用性,与公牛生育力相关的识别标记和解释方差的数量有所增加。所以,这提供了一个独特的机会,通过基于大量全基因组标记的信息进行选择来重新审视MAS,因此,提高选择的准确性。
    Bull fertility is considered an indispensable trait, as far as farm economics is concerned since it is the successful conception in a cow that provides calf crop, along with the ensuing lactation. This ensures sustainability of a dairy farm. Traditionally, bull fertility did not receive much attention by the farm managers and breeding animals were solely evaluated based on phenotypic predictors, namely, sire conception rate and seminal parameters in bull. With the advent of the molecular era in animal breeding, attempts were made to unravel the genetic complexity of bull fertility by the identification of genetic markers related to the trait. Marker-Assisted Selection (MAS) is a methodology that aims at utilizing the genetic information at markers and selecting improved populations for important traits. Traditionally, MAS was pursued using a candidate gene approach for identifying markers related to genes that are already known to have a physiological function related to the trait but this approach had certain shortcomings like stringent criteria for significance testing. Now, with the availability of genome-wide data, the number of markers identified and variance explained in relation to bull fertility has gone up. So, this presents a unique opportunity to revisit MAS by selection based on the information of a large number of genome-wide markers and thus, improving the accuracy of selection.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目前的技术允许以单点分辨率快速评估人类基因组中成千上万个位点的DNA序列和甲基化水平。成千上万的人同时这导致了复杂性状的全表观基因组关联研究(EWAS)的增加,特别是那些以前的全基因组关联研究(GWAS)解释不清的人。然而,基因组和表观基因组交织在一起,例如,已知DNA甲基化通过影响基因表达,例如,基因组印记。因此,需要超越单组学数据分析,并开发允许来自EWAS和GWAS的信息的有意义组合的交互模型。
    我们提出了两种新的遗传关联分析方法,将后代DNA甲基化水平视为环境暴露。我们的方法搜索SNP等位基因和DNA甲基化(G×Me)之间的统计相互作用,以及亲本起源效应和DNA甲基化(PoO×Me)之间的统计相互作用。使用case-parent三元组或dyad。我们使用附近基因组区域的甲基化水平来简化生物学解释。这些方法在亲子关系的数据集上进行了测试,与后代的EWAS数据。我们的结果表明,SNP周围的甲基化水平可以显着改变估计的相对风险。此外,我们展示了控制数据集如何识别假阳性。
    新方法,G×Me和PoO×Me,在遗传相对风险评估中整合DNA甲基化,从而对全基因组扫描进行更全面的生物学解释。此外,我们在区域内浓缩DNA甲基化水平的策略有助于克服使用基于稀疏芯片的测量的特定缺点.这些方法在免费提供的R包Haplin(https://cran。r-project.org/package=Haplin),实现多组数据集的快速扫描。
    Current technology allows rapid assessment of DNA sequences and methylation levels at a single-site resolution for hundreds of thousands of sites in the human genome, in thousands of individuals simultaneously. This has led to an increase in epigenome-wide association studies (EWAS) of complex traits, particularly those that are poorly explained by previous genome-wide association studies (GWAS). However, the genome and epigenome are intertwined, e.g., DNA methylation is known to affect gene expression through, for example, genomic imprinting. There is thus a need to go beyond single-omics data analyses and develop interaction models that allow a meaningful combination of information from EWAS and GWAS.
    We present two new methods for genetic association analyses that treat offspring DNA methylation levels as environmental exposure. Our approach searches for statistical interactions between SNP alleles and DNA methylation (G ×Me) and between parent-of-origin effects and DNA methylation (PoO ×Me), using case-parent triads or dyads. We use summarized methylation levels over nearby genomic region to ease biological interpretation. The methods were tested on a dataset of parent-offspring dyads, with EWAS data on the offspring. Our results showed that methylation levels around a SNP can significantly alter the estimated relative risk. Moreover, we show how a control dataset can identify false positives.
    The new methods, G ×Me and PoO ×Me, integrate DNA methylation in the assessment of genetic relative risks and thus enable a more comprehensive biological interpretation of genome-wide scans. Moreover, our strategy of condensing DNA methylation levels within regions helps overcome specific disadvantages of using sparse chip-based measurements. The methods are implemented in the freely available R package Haplin ( https://cran.r-project.org/package=Haplin ), enabling fast scans of multi-omics datasets.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Historical Article
    这里,我们报告了从新石器时代晚期到青铜时代晚期的110个古代近东个体的全基因组数据分析,一个以近东地区间激烈互动为特征的时期。我们发现,北/中安纳托利亚和南高加索地区的公元前六千年人口在新石器时代形成的遗传序列上具有混合血统,该遗传序列在新石器时代形成于今天的南高加索/扎格罗斯地区之间。在晚期石器时代和/或早期青铜时代,超过一半的北黎凡特基因库被替换,在安纳托利亚和南高加索的其他地区,我们记录了只有瞬时基因流的遗传连续性。此外,我们揭示了青铜时代晚期北黎凡特地区一个遗传上独特的个体。总的来说,我们的研究揭示了人口动态随时间变化的多个尺度,从新石器时代的广泛混合到青铜时代晚期的全球化社会中的远距离流动。视频摘要.
    Here, we report genome-wide data analyses from 110 ancient Near Eastern individuals spanning the Late Neolithic to Late Bronze Age, a period characterized by intense interregional interactions for the Near East. We find that 6th millennium BCE populations of North/Central Anatolia and the Southern Caucasus shared mixed ancestry on a genetic cline that formed during the Neolithic between Western Anatolia and regions in today\'s Southern Caucasus/Zagros. During the Late Chalcolithic and/or the Early Bronze Age, more than half of the Northern Levantine gene pool was replaced, while in the rest of Anatolia and the Southern Caucasus, we document genetic continuity with only transient gene flow. Additionally, we reveal a genetically distinct individual within the Late Bronze Age Northern Levant. Overall, our study uncovers multiple scales of population dynamics through time, from extensive admixture during the Neolithic period to long-distance mobility within the globalized societies of the Late Bronze Age. VIDEO ABSTRACT.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号