Genome, Human

基因组,Human
  • 文章类型: Journal Article
    BACKGROUND: Cell free DNA (cfDNA)-based assays hold great potential in detecting early cancer signals yet determining the tissue-of-origin (TOO) for cancer signals remains a challenging task. Here, we investigated the contribution of a methylation atlas to TOO detection in low depth cfDNA samples.
    METHODS: We constructed a tumor-specific methylation atlas (TSMA) using whole-genome bisulfite sequencing (WGBS) data from five types of tumor tissues (breast, colorectal, gastric, liver and lung cancer) and paired white blood cells (WBC). TSMA was used with a non-negative least square matrix factorization (NNLS) deconvolution algorithm to identify the abundance of tumor tissue types in a WGBS sample. We showed that TSMA worked well with tumor tissue but struggled with cfDNA samples due to the overwhelming amount of WBC-derived DNA. To construct a model for TOO, we adopted the multi-modal strategy and used as inputs the combination of deconvolution scores from TSMA with other features of cfDNA.
    RESULTS: Our final model comprised of a graph convolutional neural network using deconvolution scores and genome-wide methylation density features, which achieved an accuracy of 69% in a held-out validation dataset of 239 low-depth cfDNA samples.
    CONCLUSIONS: In conclusion, we have demonstrated that our TSMA in combination with other cfDNA features can improve TOO detection in low-depth cfDNA samples.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    DNA聚合酶κ(Polκ)是一种专门的聚合酶,具有多种细胞作用,例如跨损伤DNA合成,重复序列的复制,和核苷酸切除修复。我们开发了一种利用Polκ特异性底物捕获Polκ合成的DNA的方法,N2-(4-乙炔基苄基)-2'-脱氧鸟苷(EBndG)。将DNA剪切成200至500bp长度后,使用Cu(I)催化的炔-叠氮化物环加成反应将含EBndG的DNA共价结合到生物素上,并用链霉亲和素珠分离。然后将分离的DNA连接到衔接子,然后进行PCR扩增和下一代测序以生成全基因组修复图。我们将这种方法称为聚合酶κ测序。这里,我们提供了未受损细胞系中Polκ活性的人类基因组图谱。我们发现Polκ活性在富含GC的区域增强,常染色质区域,基因的启动子,在S期早期复制的DNA中。
    DNA polymerase κ (Polκ) is a specialized polymerase that has multiple cellular roles such as translesion DNA synthesis, replication of repetitive sequences, and nucleotide excision repair. We have developed a method for capturing DNA synthesized by Polκ utilizing a Polκ-specific substrate, N2-(4-ethynylbenzyl)-2\'-deoxyguanosine (EBndG). After shearing of the DNA into 200 to 500 bp lengths, the EBndG-containing DNA was covalently bound to biotin using the Cu(I)-catalyzed alkyne-azide cycloaddition reaction and isolated with streptavidin beads. Isolated DNA was then ligated to adaptors, followed by PCR amplification and next-generation sequencing to generate genome-wide repair maps. We have termed this method polymerase κ sequencing. Here, we present the human genome maps for Polκ activity in an undamaged cell line. We found that Polκ activity was enhanced in GC-rich regions, euchromatin regions, the promoter of genes, and in DNA that is replicated early in the S phase.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    循环细胞在S期通过定义的称为复制定时的时间程序复制其DNA。突变频率,表观遗传染色质状态,和转录活性对于在S期早期和晚期复制的基因组区域是不同的。这里,我们从ChIP-Seq分析中发现,DNA聚合酶(Pol)κ富集在HEK293T细胞的早期复制基因组区域。此外,通过用N2-庚炔基-2'-脱氧鸟苷喂养细胞,然后进行基于点击化学的富集和高通量测序,我们观察到在S期早期复制的基因组区域中Polκ活性升高。基于Polκ在与内源性诱导的N2修饰的dG病变相反的准确有效的核苷酸插入中已建立的功能,我们的工作表明,Polκ的主动参与可能有助于在人类基因组的早期复制区域中观察到的突变率降低,包括癌症基因组.一起,我们的工作扩展了Polκ的功能,并提供了人类基因组中复制时间依赖性突变累积的合理机制.
    Cycling cells replicate their DNA during the S phase through a defined temporal program known as replication timing. Mutation frequencies, epigenetic chromatin states, and transcriptional activities are different for genomic regions that are replicated early and late in the S phase. Here, we found from ChIP-Seq analysis that DNA polymerase (Pol) κ is enriched in early-replicating genomic regions in HEK293T cells. In addition, by feeding cells with N 2-heptynyl-2\'-deoxyguanosine followed by click chemistry-based enrichment and high-throughput sequencing, we observed elevated Pol κ activities in genomic regions that are replicated early in the S phase. On the basis of the established functions of Pol κ in accurate and efficient nucleotide insertion opposite endogenously induced N 2-modified dG lesions, our work suggests that active engagement of Pol κ may contribute to diminished mutation rates observed in early-replicating regions of the human genome, including cancer genomes. Together, our work expands the functions of Pol κ and offered a plausible mechanism underlying replication timing-dependent mutation accrual in the human genome.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    CRISPR疗法的临床成功取决于Cas蛋白的安全性和有效性。来自Francisellanovicida(FnCas9)的Cas9非常精确,对错配底物的亲和力可忽略不计,但其低细胞靶向效率限制了治疗用途。这里,我们合理地设计蛋白质以开发增强的FnCas9(enFnCas9)变体,并将其在人类基因组位点的可及性扩大约3.5倍。具有单一错配特异性的enFnCas9蛋白扩展了基于FnCas9的CRISPR诊断的目标范围以检测致病性DNA特征。它们在目标编辑效率方面优于化脓性链球菌Cas9(SpCas9)及其工程衍生物,敲入率,和脱靶特异性。enFnCas9可以与延伸的gRNA组合,用于在PAM约束的规范碱基编辑器不可访问的位点处进行稳健的碱基编辑。最后,我们证明了使用enFnCas9腺嘌呤碱基编辑器在Leber先天性黑蒙2(LCA2)患者特异性iPSC系中的RPE65突变校正,强调其治疗效用。
    The clinical success of CRISPR therapies hinges on the safety and efficacy of Cas proteins. The Cas9 from Francisella novicida (FnCas9) is highly precise, with a negligible affinity for mismatched substrates, but its low cellular targeting efficiency limits therapeutic use. Here, we rationally engineer the protein to develop enhanced FnCas9 (enFnCas9) variants and broaden their accessibility across human genomic sites by ~3.5-fold. The enFnCas9 proteins with single mismatch specificity expanded the target range of FnCas9-based CRISPR diagnostics to detect the pathogenic DNA signatures. They outperform Streptococcus pyogenes Cas9 (SpCas9) and its engineered derivatives in on-target editing efficiency, knock-in rates, and off-target specificity. enFnCas9 can be combined with extended gRNAs for robust base editing at sites which are inaccessible to PAM-constrained canonical base editors. Finally, we demonstrate an RPE65 mutation correction in a Leber congenital amaurosis 2 (LCA2) patient-specific iPSC line using enFnCas9 adenine base editor, highlighting its therapeutic utility.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    基因调控对胎盘功能和胎儿发育至关重要。我们使用数字基因组足迹和转录组数据构建了人类胎盘的基因组尺度转录调控网络(TRN)。我们整合了来自胎盘样品的475个转录组和12个DNase超敏反应数据集,以全局和定量地绘制转录因子(TF)-靶基因相互作用。在独立的数据集中,TRN模型预测了73%靶基因的样品外R2大于0.25的靶基因表达。我们进行了四种TF的siRNA敲除,并且实现了我们的TRN中预测的基因靶标与敲除的表达差异之间的一致性,对于四种TF中的三种,准确度>0.7。我们的最终模型包含391TFs和7712个靶基因的113,158个相互作用,并且是公开可用的。我们确定了29个TFs,这些TFs作为先前与早产相关的基因的调节因子,在早产胎盘中,这些TFs中有8个减少。
    Gene regulation is essential to placental function and fetal development. We built a genome-scale transcriptional regulatory network (TRN) of the human placenta using digital genomic footprinting and transcriptomic data. We integrated 475 transcriptomes and 12 DNase hypersensitivity datasets from placental samples to globally and quantitatively map transcription factor (TF)-target gene interactions. In an independent dataset, the TRN model predicted target gene expression with an out-of-sample R2 greater than 0.25 for 73% of target genes. We performed siRNA knockdowns of four TFs and achieved concordance between the predicted gene targets in our TRN and differences in expression of knockdowns with an accuracy of >0.7 for three of the four TFs. Our final model contained 113,158 interactions across 391 TFs and 7712 target genes and is publicly available. We identified 29 TFs which were significantly enriched as regulators for genes previously associated with preterm birth, and eight of these TFs were decreased in preterm placentas.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    结论:纳米孔测序的改进需要有效的分类方法,包括预滤波和自适应采样算法,其丰富感兴趣的读取。基于信号的方法规避了碱基判定的计算瓶颈。但是过去的基于信号的分类方法不能有效地扩展到大,像pangenomes这样的重复引用,将其效用限制在部分参考或个体基因组。我们介绍Sigmoni:一个快速的,基于r指数的多类别分类方法,可扩展到数百Gbps的参考。Sigmoni将纳米孔信号量化为皮安范围的离散字母。它执行迅速,使用匹配统计量的近似匹配,根据皮安amp匹配统计和共线性统计的分布对读段进行分类,所有这些都在线性查询时间内,而不需要种子链扩展。Sigmoni比以前的方法快10-100倍,用于在宿主耗竭实验中进行自适应采样,精度更高。并且可以针对大型微生物或人类pangenomes查询读数。Sigmoni是第一个基于信号的工具,可以扩展到完整的人类基因组和pangenome,同时保持足够快的自适应采样应用。
    方法:Sigmoni在Python中实现,并且可以在https://github.com/vshiv18/sigmoni上开源。
    CONCLUSIONS: Improvements in nanopore sequencing necessitate efficient classification methods, including pre-filtering and adaptive sampling algorithms that enrich for reads of interest. Signal-based approaches circumvent the computational bottleneck of basecalling. But past methods for signal-based classification do not scale efficiently to large, repetitive references like pangenomes, limiting their utility to partial references or individual genomes. We introduce Sigmoni: a rapid, multiclass classification method based on the r-index that scales to references of hundreds of Gbps. Sigmoni quantizes nanopore signal into a discrete alphabet of picoamp ranges. It performs rapid, approximate matching using matching statistics, classifying reads based on distributions of picoamp matching statistics and co-linearity statistics, all in linear query time without the need for seed-chain-extend. Sigmoni is 10-100× faster than previous methods for adaptive sampling in host depletion experiments with improved accuracy, and can query reads against large microbial or human pangenomes. Sigmoni is the first signal-based tool to scale to a complete human genome and pangenome while remaining fast enough for adaptive sampling applications.
    METHODS: Sigmoni is implemented in Python, and is available open-source at https://github.com/vshiv18/sigmoni.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    内源性逆转录病毒(ERV)与长末端重复序列(LTR)逆转录转座子有关,包含整合到宿主基因组中并根据孟德尔定律遗传的外源逆转录病毒的基因序列。它们被认为对宿主基因组结构和功能的进化做出了巨大贡献。我们先前在人类基因组中表征了HERV-KHML-9。然而,黑猩猩基因组中这种元件的生物学功能,是人类最亲近的亲戚,在很大程度上仍然难以捉摸。因此,本研究旨在表征黑猩猩基因组中的HML-9,并将结果与人类基因组中的结果进行比较。首先,我们报告了黑猩猩基因组中HML-9的26种原病毒元素和38种单独LTR元素的分布和遗传结构表征。结果表明,这些元素的分布呈现非随机整合模式,只有六个元素保持了相对完整的结构。然后,我们分析了它们的系统发育,并揭示了鉴定的元素都与HML-9参考文献以及人类基因组中鉴定的元素聚集在一起。基于2-LTR方法估计HML-9积分时间,结果表明,HML-9元件在1400万至3600万年前被整合到黑猩猩基因组中,在18至49mya之间被整合到人类基因组中。此外,保守的图案,顺式监管区域,并基于生物信息学预测了黑猩猩基因组中富集的PBS序列特征。结果表明,在黑猩猩基因组中发现的显著富集的ERVLTR调节基因的通路与疾病发展密切相关,包括神经和神经发育精神疾病。总之,身份证明,表征,和这里介绍的HML-9基因组学不仅有助于我们对ERV在灵长类动物进化中的作用的理解,而且有助于我们对其生物功能意义的理解。
    Endogenous retroviruses (ERVs) are related to long terminal repeat (LTR) retrotransposons, comprising gene sequences of exogenous retroviruses integrated into the host genome and inherited according to Mendelian law. They are considered to have contributed greatly to the evolution of host genome structure and function. We previously characterized HERV-K HML-9 in the human genome. However, the biological function of this type of element in the genome of the chimpanzee, which is the closest living relative of humans, largely remains elusive. Therefore, the current study aims to characterize HML-9 in the chimpanzee genome and to compare the results with those in the human genome. Firstly, we report the distribution and genetic structural characterization of the 26 proviral elements and 38 solo LTR elements of HML-9 in the chimpanzee genome. The results showed that the distribution of these elements displayed a non-random integration pattern, and only six elements maintained a relatively complete structure. Then, we analyze their phylogeny and reveal that the identified elements all cluster together with HML-9 references and with those identified in the human genome. The HML-9 integration time was estimated based on the 2-LTR approach, and the results showed that HML-9 elements were integrated into the chimpanzee genome between 14 and 36 million years ago and into the human genome between 18 and 49 mya. In addition, conserved motifs, cis-regulatory regions, and enriched PBS sequence features in the chimpanzee genome were predicted based on bioinformatics. The results show that pathways significantly enriched for ERV LTR-regulated genes found in the chimpanzee genome are closely associated with disease development, including neurological and neurodevelopmental psychiatric disorders. In summary, the identification, characterization, and genomics of HML-9 presented here not only contribute to our understanding of the role of ERVs in primate evolution but also to our understanding of their biofunctional significance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    DNA错配修复(MMRd)的缺陷在癌症基因组中留下了微卫星不稳定性(MSI)的特征性足迹。我们使用来自癌症基因组图谱和国际癌症基因组联盟的数据对MSI相关癌症进行了全面分析,专注于indel突变签名。我们根据其indel谱将MSI高基因组分为两个亚型:缺失显性(MMRd-del)和插入显性(MMRd-ins)。与MMRd-del基因组相比,MMRd-ins基因组表现出不同的突变和转录组特征,包括T>C取代和相关突变特征的患病率较高。MMRd-ins和MMRd-del基因组中的短插入和缺失靶向不同的基因集,导致两个亚型之间不同的indel配置文件。此外,MMRd-ins基因组中的indel富含亚克隆改变,这些亚克隆改变提供了有关MMRd-ins和MMRd-del基因组之间独特进化关系的线索。值得注意的是,转录组分析表明MMRd-ins癌症上调免疫相关基因,显示高水平的免疫细胞浸润,并显示出升高的新抗原负荷。两种类型的MMRd基因组之间的基因组和转录组差异突出了遗传机制的异质性,以及导致癌症的基因组足迹和转录组变化。具有潜在的临床意义。
    Deficiencies in DNA mismatch repair (MMRd) leave characteristic footprints of microsatellite instability (MSI) in cancer genomes. We used data from the Cancer Genome Atlas and International Cancer Genome Consortium to conduct a comprehensive analysis of MSI-associated cancers, focusing on indel mutational signatures. We classified MSI-high genomes into two subtypes based on their indel profiles: deletion-dominant (MMRd-del) and insertion-dominant (MMRd-ins). Compared with MMRd-del genomes, MMRd-ins genomes exhibit distinct mutational and transcriptomic features, including a higher prevalence of T>C substitutions and related mutation signatures. Short insertions and deletions in MMRd-ins and MMRd-del genomes target different sets of genes, resulting in distinct indel profiles between the two subtypes. In addition, indels in the MMRd-ins genomes are enriched with subclonal alterations that provide clues about a distinct evolutionary relationship between the MMRd-ins and MMRd-del genomes. Notably, the transcriptome analysis indicated that MMRd-ins cancers upregulate immune-related genes, show a high level of immune cell infiltration, and display an elevated neoantigen burden. The genomic and transcriptomic distinctions between the two types of MMRd genomes highlight the heterogeneity of genetic mechanisms and resulting genomic footprints and transcriptomic changes in cancers, which has potential clinical implications.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    梅罗芬吉安时期(公元5至8cc)是人口统计学时期,社会经济,文化,西欧的政治重组。这里,我们报告了30个人类骨骼遗骸的全基因组shot弹枪序列数据,这些遗骸来自Koksijde的沿海Merovingian晚期遗址(公元675至750年),与现今佛兰德斯的两个中世纪早期到晚期遗址的18个遗迹一起,比利时。我们发现了两个不同的祖先,一个与中世纪早期的英格兰和荷兰分享,而另一个,次要成分,反映了可能的大陆高雅血统。亲属关系分析发现,没有以精英墓葬为特征的大型家谱,而是揭示了主要祖先群体个体之间遥远关系的高度模块化。相比之下,>90%高雅血统的个体在样本个体之间没有亲属关系。人口结构和主要群体中高利人祖先程度的主要差异的证据,包括一对母女,表明他们埋葬时社区中正在进行的混合。同位素和遗传证据结合在一起支持了一个模型,代表一个成熟的沿海非精英社区,吸收了来自内陆人口的移民。Koksijde的主要墓葬群显示出丰富的>5cM长的共享等位基因间隔,附近的中世纪高地,暗示长期连续性,并暗示与英国类似,中世纪早期祖先的变化对佛兰德人口的遗传构成产生了重大而持久的影响。我们发现两个祖先群体在色素沉着和饮食相关变异中的等位基因频率差异很大,包括那些与乳糖酶持久性有关的,可能反映了祖先的变化,而不是当地的适应。
    The Merovingian period (5th to 8th cc AD) was a time of demographic, socioeconomic, cultural, and political realignment in Western Europe. Here, we report the whole-genome shotgun sequence data of 30 human skeletal remains from a coastal Late Merovingian site of Koksijde (675 to 750 AD), alongside 18 remains from two Early to Late Medieval sites in present-day Flanders, Belgium. We find two distinct ancestries, one shared with Early Medieval England and the Netherlands, while the other, minor component, reflecting likely continental Gaulish ancestry. Kinship analyses identified no large pedigrees characteristic to elite burials revealing instead a high modularity of distant relationships among individuals of the main ancestry group. In contrast, individuals with >90% Gaulish ancestry had no kinship links among sampled individuals. Evidence for population structure and major differences in the extent of Gaulish ancestry in the main group, including in a mother-daughter pair, suggests ongoing admixture in the community at the time of their burial. The isotopic and genetic evidence combined supports a model by which the burials, representing an established coastal nonelite community, had incorporated migrants from inland populations. The main group of burials at Koksijde shows an abundance of >5 cM long shared allelic intervals with the High Medieval site nearby, implying long-term continuity and suggesting that similarly to Britain, the Early Medieval ancestry shifts left a significant and long-lasting impact on the genetic makeup of the Flemish population. We find substantial allele frequency differences between the two ancestry groups in pigmentation and diet-associated variants, including those linked with lactase persistence, likely reflecting ancestry change rather than local adaptation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    霍奇金淋巴瘤(HL)是最常见的淋巴瘤之一,发病率为每10万人中3人。目前的治疗使用基因毒性剂的混合物,包括阿霉素,博来霉素,长春碱,和达卡巴嗪(ABVD),有或没有放射治疗。这种治疗方案已被证明在杀死癌细胞方面是有效的,导致HL患者在五年时的无癌生存率>90%。然而,这种疗法没有特定的细胞靶点,它可以诱导非癌细胞基因组的损伤。先前的研究表明,HL幸存者通常表现出核型,其特征是复杂的染色体异常,难以通过常规条带分析。多色荧光原位杂交(M-FISH)是分析复杂核型的有力工具;我们使用M-FISH研究了5名健康人和5名HL患者外周血淋巴细胞染色体损伤的存在,during,抗癌治疗后一年。我们的结果表明,这种抗癌治疗引起的基因组混乱在完成治疗一年后仍然存在于HL患者的造血干细胞中。这种染色体不稳定性可能在10%的HL幸存者中观察到的第二原发癌的发生中起作用。本章将描述利用M-FISH研究霍奇金淋巴瘤(HL)患者治疗诱导的基因组混沌的协议,在简短的讨论之后。
    Hodgkin lymphoma (HL) is one of the most common lymphomas, with an incidence of 3 per 100,000 persons. Current treatment uses a cocktail of genotoxic agents, including adriamycin, bleomycin, vinblastine, and dacarbazine (ABVD), along with or without radiotherapy. This treatment regimen has proved to be efficient in killing cancer cells, resulting in HL patients having a survival rate of >90% cancer-free survival at five years. However, this therapy does not have a specific cell target, and it can induce damage in the genome of non-cancerous cells. Previous studies have shown that HL survivors often exhibit karyotypes characterized by complex chromosomal abnormalities that are difficult to analyze by conventional banding. Multicolor fluorescence in situ hybridization (M-FISH) is a powerful tool to analyze complex karyotypes; we used M-FISH to investigate the presence of chromosomal damage in peripheral blood lymphocytes from five healthy individuals and five HL patients before, during, and one year after anti-cancer treatment. Our results show that this anti-cancer treatment-induced genomic chaos that persists in the hematopoietic stem cells from HL patients one year after finishing therapy. This chromosomal instability may play a role in the occurrence of second primary cancers that are observed in 10% of HL survivors. This chapter will describe a protocol for utilizing M-FISH to study treatment-induced genome chaos in Hodgkin\'s lymphoma (HL) patients, following a brief discussion.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号