Chromosomes, Human

染色体,Human
  • 文章类型: Journal Article
    背景:亲属比不相关的个体共享更多的基因组区域,更亲密的亲戚分享更多的地区。这个概念,与高通量单核苷酸多态性(SNP)基因分型技术的可用性增加配对,已经使测量个体之间共享的染色体区域以评估他们彼此之间的关系水平变得可行。然而,这些技术在应用度量或指数方面仍处于概念阶段,而不是实际阶段。最近,我们利用来自韩国家庭样本的大规模SNP数据开发了一种名为"基于遗传距离的染色体共享指数(GD-ICS)"的指数,并证明了其在亲属关系确定中的实际应用潜力.在目前的研究中,我们提供了来自各种实际案例的验证结果,证明了该方法在解决复杂的家族关系中的实用性,在这些家族关系中,从传统的短串联重复序列(STR)或谱系标记获得的信息尚无定论。
    方法:我们通过涉及13个亲属关系病例的韩国个体的微阵列分析获得了大规模SNP数据,并使用我们先前研究中描述的方法计算了GD-ICS值。基于为韩国家庭构建的GD-ICS参考,每个有争议的亲属关系使用传统STR和谱系标记的组合进行评估和验证.
    结果:这些案例包括使用传统方法发现不确定的那些A),B)难以应用传统的测试方法,和C)使用GD-ICS方法更确定地解决了这些问题。该方法克服了传统STR在亲属关系测试中面临的局限性,特别是在有STR突变事件的亲子关系案件中,以及在确认感兴趣的个人无法进行测试的远亲关系中。它还被证明在没有特定假设的情况下识别各种关系以及在确认个体之间缺乏遗传相关性方面是有效的。
    结论:这种方法已被证明在不同的复杂和实际情况下识别家庭关系是有效的。它不仅在传统测试方法无法提供结论性结果时有用,但它也提高了挑战性亲属案件的解决能力,这表明它适用于各种类型的实际案例。
    BACKGROUND: Relatives share more genomic regions than unrelated individuals, with closer relatives sharing more regions. This concept, paired with the increased availability of high-throughput single nucleotide polymorphism (SNP) genotyping technologies, has made it feasible to measure the shared chromosomal regions between individuals to assess their level of relation to each other. However, such techniques have remained in the conceptual rather than practical stages in terms of applying measures or indices. Recently, we developed an index called \"genetic distance-based index of chromosomal sharing (GD-ICS)\" utilizing large-scale SNP data from Korean family samples and demonstrated its potential for practical applications in kinship determination. In the current study, we present validation results from various real cases demonstrating the utility of this method in resolving complex familial relationships where information obtained from traditional short tandem repeats (STRs) or lineage markers is inconclusive.
    METHODS: We obtained large-scale SNP data through microarray analysis from Korean individuals involving 13 kinship cases and calculated GD-ICS values using the method described in our previous study. Based on the GD-ICS reference constructed for Korean families, each disputed kinship was evaluated and validated using a combination of traditional STRs and lineage markers.
    RESULTS: The cases comprised those A) that were found to be inconclusive using the traditional approach, B) for which it was difficult to apply traditional testing methods, and C) that were more conclusively resolved using the GD-ICS method. This method has overcome the limitations faced by traditional STRs in kinship testing, particularly in a paternity case with STR mutational events and in confirming distant kinship where the individual of interest is unavailable for testing. It has also been demonstrated to be effective in identifying various relationships without specific presumptions and in confirming a lack of genetic relatedness between individuals.
    CONCLUSIONS: This method has been proven effective in identifying familial relationships across diverse complex and practical scenarios. It is not only useful when traditional testing methods fail to provide conclusive results, but it also enhances the resolution of challenging kinship cases, which suggests its applicability in various types of practical casework.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    常染色体三体性的镶嵌在临床实践中并不常见。然而,尽管在产前和产后诊断中都很少见,有大量特征性和公开的病例。令人惊讶的是,与常规三体相反,没有尝试对马赛克载体的人口统计学进行系统分析。这是旨在解决这一差距的第一项研究。为此,我们已经筛选了八百多本关于马赛克三体的出版物,审查数据,包括性别和马赛克携带者的临床状况,产妇年龄和生育史。总的来说,596种出版物符合分析条件,包含948个产前诊断的数据,包括真正的胎儿镶嵌(TFM)和局限的胎盘镶嵌(CPM),以及318例产后检测到的马赛克(PNM)。出生体重适当的正常妊娠结局与宫内生长受限的孕妇年龄无差异。出乎意料的是,与异常结局(异常胎儿或新生儿)和胎儿损失相比,在正常结局中发现的高龄产妇(AMA)比例更高,73%vs.56%和50%,相应地,p=0.0015和p=0.0011。另一个有趣的发现是,与具有双亲二体(BPD)的携带者相比,染色体7、14、15和16的伴随单亲二体(UPD)的马赛克携带者中AMA比例更高(72%vs.58%,92%vs.55%,87%vs.78%,和65%vs.24%,相应地);总体数字为78%,而不是48%,p=0.0026。对生殖史的分析显示,与TFM和CPM队列的母亲(正常结局的比例很大)相比,PNM队列中报告先前胎儿丢失的母亲(几乎所有患者均为临床异常)的报告率非常差,但几乎高出两倍。30%vs.16%,p=0.0072。先前妊娠染色体异常的发生在产前队列中占13分之一,在出生后队列中占16分之一,与已发表的非马赛克三体研究相比,高出五倍。我们认为在这项研究中获得的数据是初步的,尽管文献综述的数量很大,因为详细数据的报告大多很差。因此,研究的队列并不代表“大数据”。然而,获得的信息对于临床遗传咨询和建模进一步研究都很有用。
    Mosaicism for autosomal trisomy is uncommon in clinical practice. However, despite its rarity among both prenatally and postnatally diagnoses, there are a large number of characterized and published cases. Surprisingly, in contrast to regular trisomies, no attempts at systematic analyses of mosaic carriers\' demographics were undertaken. This is the first study aimed to address this gap. For that, we have screened more than eight hundred publications on mosaic trisomies, reviewing data including gender and clinical status of mosaic carriers, maternal age and reproductive history. In total, 596 publications were eligible for analysis, containing data on 948 prenatal diagnoses, including true fetal mosaicism (TFM) and confined placental mosaicism (CPM), and on 318 cases of postnatally detected mosaicism (PNM). No difference was found in maternal age between normal pregnancy outcomes with appropriate birth weight and those with intrauterine growth restriction. Unexpectedly, a higher proportion of advanced maternal ages (AMA) was found in normal outcomes compared to abnormal ones (abnormal fetus or newborn) and fetal losses, 73% vs. 56% and 50%, p = 0.0015 and p = 0.0011, correspondingly. Another intriguing finding was a higher AMA proportion in mosaic carriers with concomitant uniparental disomy (UPD) for chromosomes 7, 14, 15, and 16 compared to carriers with biparental disomy (BPD) (72% vs. 58%, 92% vs. 55%, 87% vs. 78%, and 65% vs. 24%, correspondingly); overall figures were 78% vs. 48%, p = 0.0026. Analysis of reproductive histories showed a very poor reporting but almost two-fold higher rate of mothers reporting a previous fetal loss from PNM cohort (in which almost all patients were clinically abnormal) compared to mothers from the TFM and CPM cohorts (with a large proportion of normal outcomes), 30% vs. 16%, p = 0.0072. The occurrence of a previous pregnancy with a chromosome abnormality was 1 in 13 in the prenatal cohort and 1 in 16 in the postnatal cohort, which are five-fold higher compared to published studies on non-mosaic trisomies. We consider the data obtained in this study to be preliminary despite the magnitude of the literature reviewed since reporting of detailed data was mostly poor, and therefore, the studied cohorts do not represent \"big data\". Nevertheless, the information obtained is useful both for clinical genetic counseling and for modeling further studies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    TERRA在调节人细胞端粒酶中的功能仍存在争议。虽然TERRA与端粒酶相互作用,它如何调节端粒酶功能仍然未知。这里,我们显示TERRA在细胞周期的不同阶段与端粒酶RNA亚基hTR在核质和端粒共定位。我们报道TERRA转录本在端粒延长过程中远离染色体末端,导致端粒TERRA-hTR分子数量减少,并随之增加端粒上的“无TERRA”端粒酶分子。使用活细胞成像和超分辨率显微镜,我们表明,在转录时,TERRA从其起源的端粒重新定位到长染色体末端。此外,反义寡核苷酸消除TERRA促进hTR定位到端粒,导致hTR分子在端粒的停留时间增加和半衰期延长。总的来说,我们的发现表明端粒TERRA转录物通过端粒酶反式作用抑制端粒延伸,损害端粒酶进入端粒不同于它们的染色体末端起源。
    The function of TERRA in the regulation of telomerase in human cells is still debated. While TERRA interacts with telomerase, how it regulates telomerase function remains unknown. Here, we show that TERRA colocalizes with the telomerase RNA subunit hTR in the nucleoplasm and at telomeres during different phases of the cell cycle. We report that TERRA transcripts relocate away from chromosome ends during telomere lengthening, leading to a reduced number of telomeric TERRA-hTR molecules and consequent increase in \"TERRA-free\" telomerase molecules at telomeres. Using live-cell imaging and super-resolution microscopy, we show that upon transcription, TERRA relocates from its telomere of origin to long chromosome ends. Furthermore, TERRA depletion by antisense oligonucleotides promoted hTR localization to telomeres, leading to increased residence time and extended half-life of hTR molecules at telomeres. Overall, our findings indicate that telomeric TERRA transcripts inhibit telomere elongation by telomerase acting in trans, impairing telomerase access to telomeres that are different from their chromosome end of origin.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    多形性胶质母细胞瘤(GBM)包括以表型和转录异质性为特征的脑恶性肿瘤,认为这些肿瘤具有侵袭性,对治疗有抵抗力,不可避免地反复出现。然而,关于GBM基因组的空间组织是如何构成这种异质性及其影响的,人们知之甚少。这里,我们编制了一个由28个患者来源的胶质母细胞瘤干细胞样细胞系(GSCs)组成的队列,已知这些细胞系能够反映其肿瘤起源的特性;其中6个是来自同一患者的原发复发肿瘤对.我们生成并分析来自所有GSC的5kbp分辨率染色体构象捕获(Hi-C)数据,以系统地绘制数千个独立和复杂的结构变体(SV)以及由此产生的大量新样本。通过结合Hi-C,组蛋白修饰,和染色质折叠模拟的基因表达数据,我们解释了无处不在,不均匀,和特殊发生的新洛洛普通过形成新的增强子-启动子接触来维持肿瘤特异性转录程序。我们还展示了即使是中度复发的新样本也可能与患者特定的漏洞相关。一起,我们的数据为剖析GBM生物学和异质性提供了资源,以及告知治疗方法。
    Glioblastoma multiforme (GBM) encompasses brain malignancies marked by phenotypic and transcriptional heterogeneity thought to render these tumors aggressive, resistant to therapy, and inevitably recurrent. However, little is known about how the spatial organization of GBM genomes underlies this heterogeneity and its effects. Here, we compile a cohort of 28 patient-derived glioblastoma stem cell-like lines (GSCs) known to reflect the properties of their tumor-of-origin; six of these were primary-relapse tumor pairs from the same patient. We generate and analyze 5 kbp-resolution chromosome conformation capture (Hi-C) data from all GSCs to systematically map thousands of standalone and complex structural variants (SVs) and the multitude of neoloops arising as a result. By combining Hi-C, histone modification, and gene expression data with chromatin folding simulations, we explain how the pervasive, uneven, and idiosyncratic occurrence of neoloops sustains tumor-specific transcriptional programs via the formation of new enhancer-promoter contacts. We also show how even moderately recurrent neoloops can relate to patient-specific vulnerabilities. Together, our data provide a resource for dissecting GBM biology and heterogeneity, as well as for informing therapeutic approaches.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    根据实验确定的平均原点间距离〜100kb,在每个细胞周期中,DNA复制从人类染色体上的约50,000个起点开始。认为起源是通过结合因子如起源识别复合物(ORC)或CTCF或其他特征如G-四链体来指定的。我们已经对113个基因组范围的人类起源谱(来自五种不同的技术)和五个ORC结合谱进行了综合分析,以严格评估这些特征是否指定了最可重复的起源。在所有数据集确定的约750万个工会起源中,在至少20个独立的SNS-seq数据集中,只有0.27%(20,250个共享来源)可重复获得,并包含在其他三种技术中的每一种识别的起始区域中,表明原产地使用和识别的广泛差异。此外,21%的共同起源与转录启动子重叠,摆出一个难题。尽管共享起源与组成型CTCF结合位点的联合起源重叠更多,G-四链体位点,激活组蛋白标记,这些重叠与已知的转录起始位点相当或更小,所以这些特征可以丰富起源,因为起源与表观遗传开放的重叠,启动子样序列。在20,250个共享起源中,只有6.4%来自人类癌细胞中〜13,000个可复制的ORC结合位点中的任何一个,在1kb内。只有4.5%在1kb的〜11,000个结合MCM2-7结合位点内,与酵母中两个比较中接近100%的重叠相反,酿酒酵母。因此,在人类癌细胞系中,复制起点似乎是由高度可变的随机事件指定的,依赖于启动子周围的高表观遗传可达性,在最可重复的起源和目前已知的ORC或MCM结合位点之间没有广泛的重叠。
    Based on experimentally determined average inter-origin distances of ~100 kb, DNA replication initiates from ~50,000 origins on human chromosomes in each cell cycle. The origins are believed to be specified by binding of factors like the origin recognition complex (ORC) or CTCF or other features like G-quadruplexes. We have performed an integrative analysis of 113 genome-wide human origin profiles (from five different techniques) and five ORC-binding profiles to critically evaluate whether the most reproducible origins are specified by these features. Out of ~7.5 million union origins identified by all datasets, only 0.27% (20,250 shared origins) were reproducibly obtained in at least 20 independent SNS-seq datasets and contained in initiation zones identified by each of three other techniques, suggesting extensive variability in origin usage and identification. Also, 21% of the shared origins overlap with transcriptional promoters, posing a conundrum. Although the shared origins overlap more than union origins with constitutive CTCF-binding sites, G-quadruplex sites, and activating histone marks, these overlaps are comparable or less than that of known transcription start sites, so that these features could be enriched in origins because of the overlap of origins with epigenetically open, promoter-like sequences. Only 6.4% of the 20,250 shared origins were within 1 kb from any of the ~13,000 reproducible ORC-binding sites in human cancer cells, and only 4.5% were within 1 kb of the ~11,000 union MCM2-7-binding sites in contrast to the nearly 100% overlap in the two comparisons in the yeast, Saccharomyces cerevisiae. Thus, in human cancer cell lines, replication origins appear to be specified by highly variable stochastic events dependent on the high epigenetic accessibility around promoters, without extensive overlap between the most reproducible origins and currently known ORC- or MCM-binding sites.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    由于各种实验技术的进步,许多物种中的间期染色体的组织开始出现。然而,对动力学知之甚少,特别是在染色质的功能状态。一些实验表明,人类间期染色体中单个基因座的运动性在转录过程中降低,而在抑制转录时增加。这是一个违反直觉的发现,因为人们认为大约十个皮牛顿的主动机械力(F),由RNA聚合酶II(RNAPII)产生,可能会传递到染色质的基因丰富区域,会让它变得更加开放,从而提高流动性。我们为相间染色体开发了最小活性共聚物模型,以研究F如何影响染色质的动力学特性。基因丰富区域中的基因座的运动在F的中间范围内受到抑制,并在较小的F值下得到增强,在实验中也观察到了这一点。在中间F中,连续基因座之间的键长增加,与非结合基因座之间的有吸引力的相互作用最小时的距离相称。这导致了短暂的无序到有序的转变,导致转录过程中的移动性降低。引人注目的是,基因座动力学中的F依赖性变化保留了染色体在[公式:见正文]处的组织。基因座的瞬时排序,在具有随机表观遗传特征的聚合物中没有发现,在基因丰富的区域可能是一种合理的机制,用于成核涉及转录因子的动态网络,RNAPII,和染色质。
    The organization of interphase chromosomes in a number of species is starting to emerge thanks to advances in a variety of experimental techniques. However, much less is known about the dynamics, especially in the functional states of chromatin. Some experiments have shown that the motility of individual loci in human interphase chromosome decreases during transcription and increases upon inhibiting transcription. This is a counterintuitive finding because it is thought that the active mechanical force (F) on the order of ten piconewtons, generated by RNA polymerase II (RNAPII) that is presumably transmitted to the gene-rich region of the chromatin, would render it more open, thus enhancing the mobility. We developed a minimal active copolymer model for interphase chromosomes to investigate how F affects the dynamical properties of chromatin. The movements of the loci in the gene-rich region are suppressed in an intermediate range of F and are enhanced at small F values, which has also been observed in experiments. In the intermediate F, the bond length between consecutive loci increases, becoming commensurate with the distance at the minimum of the attractive interaction between nonbonded loci. This results in a transient disorder-to-order transition, leading to a decreased mobility during transcription. Strikingly, the F-dependent change in the locus dynamics preserves the organization of the chromosome at [Formula: see text]. Transient ordering of the loci, which is not found in the polymers with random epigenetic profiles, in the gene-rich region might be a plausible mechanism for nucleating a dynamic network involving transcription factors, RNAPII, and chromatin.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    最近发现了k-mer词(2≤k≤11)在基因组分布和进化保守性方面的特定特征。其中包括,在高丰度,具有串联重复结构的单词(重复单元长度为1bp至3bp)。此外,似乎有一类极短的串联重复(≤12bp),到目前为止被忽视了,它们是非随机分布的,因此,可能在基因组的功能中起着至关重要的作用。在下面的文章中,将这些我们称之为超短串联重复序列(SSTR)的基序的位置分布与其他功能元件进行了比较,比如基因和反转录转座子.我们发现了局部SSTR密度和G+C含量之间的长度和序列相关关系,以及SSTR和基因的密度,以及与反转录转座子密度的相关性。除了许多一般有趣的关系,我们发现SINEAlu对局部SSTR密度有很大影响。此外,观察到的SSTR模式与假基因和外显子的连接可能暗示了SSTR在基因表达中的特殊作用。总之,我们的研究结果支持SSTRs在基因组中的特殊作用和功能相关性的观点.
    The specific characteristics of k-mer words (2 ≤ k ≤ 11) regarding genomic distribution and evolutionary conservation were recently found. Among them are, in high abundance, words with a tandem repeat structure (repeat unit length of 1 bp to 3 bp). Furthermore, there seems to be a class of extremely short tandem repeats (≤12 bp), so far overlooked, that are non-random-distributed and, therefore, may play a crucial role in the functioning of the genome. In the following article, the positional distributions of these motifs we call super-short tandem repeats (SSTRs) were compared to other functional elements, like genes and retrotransposons. We found length- and sequence-dependent correlations between the local SSTR density and G+C content, and also between the density of SSTRs and genes, as well as correlations with retrotransposon density. In addition to many general interesting relations, we found that SINE Alu has a strong influence on the local SSTR density. Moreover, the observed connection of SSTR patterns to pseudogenes and -exons might imply a special role of SSTRs in gene expression. In summary, our findings support the idea of a special role and the functional relevance of SSTRs in the genome.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在五个人类顶基染色体的核仁组织区(NOR)中排列的数百个rRNA基因拷贝中,约50%保持转录失活。NOR相关序列和表观遗传修饰有助于rRNA的差异表达。然而,控制哺乳动物中每个NOR中活性和非活性rRNA基因剂量的机制尚待确定。我们发现了一个ncRNAs家族,SNULs(单个NUcleolus定位RNA),在单个NORs上形成受约束的核仁下区域并影响rRNA表达。SNULs的单个成员单等位基因与特定的含NOR染色体相关。SNULs与pre-rRNA具有序列相似性,并与pre-rRNA定位在核仁区室中。最后,SNUL通过影响pre-rRNA分选到DFC区室和pre-rRNA加工来控制rRNA表达。我们的研究发现了一类新的ncRNAs通过在单个NORs上形成限制的核仁区域来影响rRNA表达。
    Out of the several hundred copies of rRNA genes arranged in the nucleolar organizing regions (NOR) of the five human acrocentric chromosomes, ~50% remain transcriptionally inactive. NOR-associated sequences and epigenetic modifications contribute to the differential expression of rRNAs. However, the mechanism(s) controlling the dosage of active versus inactive rRNA genes within each NOR in mammals is yet to be determined. We have discovered a family of ncRNAs, SNULs (Single NUcleolus Localized RNA), which form constrained sub-nucleolar territories on individual NORs and influence rRNA expression. Individual members of the SNULs monoallelically associate with specific NOR-containing chromosomes. SNULs share sequence similarity to pre-rRNA and localize in the sub-nucleolar compartment with pre-rRNA. Finally, SNULs control rRNA expression by influencing pre-rRNA sorting to the DFC compartment and pre-rRNA processing. Our study discovered a novel class of ncRNAs influencing rRNA expression by forming constrained nucleolar territories on individual NORs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    由于长读取测序的高错误率,解析富含分段重复(SD)的复杂基因组区域是具有挑战性的。这里,我们描述了一种新型基因组组装者PhaseDancer的靶向方法,该方法迭代地扩展了富含SD的感兴趣区域。我们使用人类BAC克隆的黄金标准集和具有预定义进化场景的硅片生成的SD来验证其鲁棒性和效率。PhaseDancer能够延伸与人类2号染色体(HSA2)融合位点直系同源的大猿染色体的不完整复杂的富含SD的亚端粒区域,揭示了HSA2形成的模型,并揭示了人类和大猿基因组的进化。
    Resolving complex genomic regions rich in segmental duplications (SDs) is challenging due to the high error rate of long-read sequencing. Here, we describe a targeted approach with a novel genome assembler PhaseDancer that extends SD-rich regions of interest iteratively. We validate its robustness and efficiency using a golden-standard set of human BAC clones and in silico-generated SDs with predefined evolutionary scenarios. PhaseDancer enables extension of the incomplete complex SD-rich subtelomeric regions of Great Ape chromosomes orthologous to the human chromosome 2 (HSA2) fusion site, informing a model of HSA2 formation and unravelling the evolution of human and Great Ape genomes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    众所周知,〜1.6kb神经母细胞瘤断点家族(NBPF)重复序列是人类特异性的,有助于认知能力。高阶重复3merHOR(Olduvai三胞胎)的频率增加。从黑猩猩到现代人,有一个不连续的跳跃,从0到50tandemly组织的3merHOR。在这里,我们研究了Pääbo等人的尼安德特人基因组组装中NBPF3merHOR的结构。,将其与人类HG38获得的结果进行比较。p14染色体1。我们的发现揭示了尼安德特人中相应的NBPF3merHOR阵列,与人类相比,单体结构和HOR拷贝数略有不同。此外,我们计算了Miga等人的完整端粒到端粒人类基因组组装(T2T-CHM13)的NBPF3merHOR模式。,鉴定具有5个和9个NBPF3merHOR拷贝的两个新的NBPF3merHOR重复序列。我们假设这些阵列对应于新的NBPF基因(这里称为NBPFA1和NBPFA2)。使用T2T-CHM13作为参考进一步提高尼安德特人基因组的质量将对确定尼安德特人基因组中这种遥远的新NBPF基因的存在以及增强我们对人类进化的理解非常感兴趣。
    It is known that the ~ 1.6 kb Neuroblastoma BreakPoint Family (NBPF) repeats are human specific and contributing to cognitive capabilities, with increasing frequency in higher order repeat 3mer HORs (Olduvai triplets). From chimpanzee to modern human there is a discontinuous jump from 0 to ~ 50 tandemly organized 3mer HORs. Here we investigate the structure of NBPF 3mer HORs in the Neanderthal genome assembly of Pääbo et al., comparing it to the results obtained for human hg38.p14 chromosome 1. Our findings reveal corresponding NBPF 3mer HOR arrays in Neanderthals with slightly different monomer structures and numbers of HOR copies compared to humans. Additionally, we compute the NBPF 3mer HOR pattern for the complete telomere-to-telomere human genome assembly (T2T-CHM13) by Miga et al., identifying two novel tandem arrays of NBPF 3mer HOR repeats with 5 and 9 NBPF 3mer HOR copies. We hypothesize that these arrays correspond to novel NBPF genes (here referred to as NBPFA1 and NBPFA2). Further improving the quality of the Neanderthal genome using T2T-CHM13 as a reference would be of great interest in determining the presence of such distant novel NBPF genes in the Neanderthal genome and enhancing our understanding of human evolution.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号