CpG Islands

CpG 群岛
  • 文章类型: Journal Article
    母亲吸烟与新生儿的不良健康结局有关,但其影响新生儿健康的程度尚未通过汇总脐带血DNA甲基化(DNAm)评分进行量化。这里,在欧洲白人和南亚人群中,我们研究了利用脐带血DNAm评分作为发现样本来捕获孕妇吸烟的表观遗传特征及其对新生儿的影响的可行性.
    我们首先检查了个体CpG与怀孕期间吸烟之间的关联,和吸烟暴露在两个欧洲白人出生队列中(n=744)。利用已建立的CpG来吸烟,我们构建了孕妇吸烟的脐带血表观遗传评分,该评分在一个欧洲裔队列中得到验证(n=347).然后测试这个分数与吸烟状况的关系,怀孕期间二次吸烟,在独立的欧洲白人(n=397)和南亚出生队列(n=504)中测量出生后后代的健康结果。
    先前报道的几个母亲吸烟基因得到了支持,具有来自GFI1基因的最强和最一致的关联信号(6个CpG,p<5×10-5)。表观遗传母亲吸烟评分与怀孕期间的吸烟状况密切相关(OR=1.09[1.07,1.10],p=5.5×10-33)以及每周自我报告吸烟暴露的更多小时(1.93[1.27,2.58],p=7.8×10-9)在欧洲白人中。然而,它与南亚人自我报告的暴露无关(p>0.05),可能是由于该组中缺乏吸烟。相同的分数始终与较小的出生尺寸(-0.37±0.12cm,p=0.0023)在南亚队列中,出生体重较低(-0.043±0.013kg,组合队列中的p=0.0011)。
    这项脐带血表观遗传评分可以帮助识别暴露于母亲吸烟的婴儿,并评估其对生长的长期影响。值得注意的是,这些结果表明,孕妇吸烟的DNAm特征与新生儿身材矮小和低出生体重之间存在一致的关联,在欧洲白人母亲谁表现出一定程度的吸烟和南亚母亲谁自己不是积极的吸烟者。
    本研究由加拿大健康研究院代谢组学研究团队资助:MWG-146332。
    UNASSIGNED: Maternal smoking has been linked to adverse health outcomes in newborns but the extent to which it impacts newborn health has not been quantified through an aggregated cord blood DNA methylation (DNAm) score. Here, we examine the feasibility of using cord blood DNAm scores leveraging large external studies as discovery samples to capture the epigenetic signature of maternal smoking and its influence on newborns in White European and South Asian populations.
    UNASSIGNED: We first examined the association between individual CpGs and cigarette smoking during pregnancy, and smoking exposure in two White European birth cohorts (n=744). Leveraging established CpGs for maternal smoking, we constructed a cord blood epigenetic score of maternal smoking that was validated in one of the European-origin cohorts (n=347). This score was then tested for association with smoking status, secondary smoking exposure during pregnancy, and health outcomes in offspring measured after birth in an independent White European (n=397) and a South Asian birth cohort (n=504).
    UNASSIGNED: Several previously reported genes for maternal smoking were supported, with the strongest and most consistent association signal from the GFI1 gene (6 CpGs with p<5 × 10-5). The epigenetic maternal smoking score was strongly associated with smoking status during pregnancy (OR = 1.09 [1.07, 1.10], p=5.5 × 10-33) and more hours of self-reported smoking exposure per week (1.93 [1.27, 2.58], p=7.8 × 10-9) in White Europeans. However, it was not associated with self-reported exposure (p>0.05) among South Asians, likely due to a lack of smoking in this group. The same score was consistently associated with a smaller birth size (-0.37±0.12 cm, p=0.0023) in the South Asian cohort and a lower birth weight (-0.043±0.013 kg, p=0.0011) in the combined cohorts.
    UNASSIGNED: This cord blood epigenetic score can help identify babies exposed to maternal smoking and assess its long-term impact on growth. Notably, these results indicate a consistent association between the DNAm signature of maternal smoking and a small body size and low birth weight in newborns, in both White European mothers who exhibited some amount of smoking and in South Asian mothers who themselves were not active smokers.
    UNASSIGNED: This study was funded by the Canadian Institutes of Health Research Metabolomics Team Grant: MWG-146332.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:血浆代谢组反映了各种生物过程的生理状态,可以作为疾病风险的代表。血浆代谢物变异,受遗传和表观遗传机制的影响,还可以影响细胞微环境和血细胞表观遗传学。血浆代谢组和血细胞表观基因组之间的相互作用仍然难以捉摸。在这项研究中,我们对来自LifeLines-DEEP队列的693名参与者的1183种血浆代谢物进行了全表观基因组关联研究(EWAS),并使用双向孟德尔随机化和中介分析研究了DNA甲基化-代谢物关联中的因果关系.
    结果:在严格调整潜在的混杂因素后,包括遗传学,我们确定了两种血浆代谢物(L-丝氨酸和甘氨酸)与位于两个独立基因组区域(PHGDH中的cg14476101和cg16246545以及SLC1A5中的cg02711608)的3个CpG位点之间的5个强关联,错误发现率小于0.05.进一步的分析揭示了血浆甘氨酸/丝氨酸水平与DNA甲基化之间复杂的双向关系。此外,我们观察到DNA甲基化在甘氨酸/丝氨酸对其代谢/转运基因表达的影响中具有很强的介导作用,介导效应的比例从11.8到54.3%不等。这一结果也在一个独立的基于人群的队列中得到了重复,鹿特丹研究。为了验证我们的发现,我们进行了体外细胞研究,证实了DNA甲基化在PHGDH基因表达调控中的中介作用。
    结论:我们的发现揭示了甘氨酸和丝氨酸通过DNA甲基化调节基因表达的潜在反馈机制。
    BACKGROUND: The plasma metabolome reflects the physiological state of various biological processes and can serve as a proxy for disease risk. Plasma metabolite variation, influenced by genetic and epigenetic mechanisms, can also affect the cellular microenvironment and blood cell epigenetics. The interplay between the plasma metabolome and the blood cell epigenome remains elusive. In this study, we performed an epigenome-wide association study (EWAS) of 1183 plasma metabolites in 693 participants from the LifeLines-DEEP cohort and investigated the causal relationships in DNA methylation-metabolite associations using bidirectional Mendelian randomization and mediation analysis.
    RESULTS: After rigorously adjusting for potential confounders, including genetics, we identified five robust associations between two plasma metabolites (L-serine and glycine) and three CpG sites located in two independent genomic regions (cg14476101 and cg16246545 in PHGDH and cg02711608 in SLC1A5) at a false discovery rate of less than 0.05. Further analysis revealed a complex bidirectional relationship between plasma glycine/serine levels and DNA methylation. Moreover, we observed a strong mediating role of DNA methylation in the effect of glycine/serine on the expression of their metabolism/transport genes, with the proportion of the mediated effect ranging from 11.8 to 54.3%. This result was also replicated in an independent population-based cohort, the Rotterdam Study. To validate our findings, we conducted in vitro cell studies which confirmed the mediating role of DNA methylation in the regulation of PHGDH gene expression.
    CONCLUSIONS: Our findings reveal a potential feedback mechanism in which glycine and serine regulate gene expression through DNA methylation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:虽然已经广泛研究了实足年龄与全血中DNA甲基化(DNAm)的关系,与年龄相关的DNAm变化的组织特异性仍然是一个活跃的研究领域.研究了大脑等组织中年龄与DNAm的关系,皮肤,免疫细胞,脂肪,肝脏已经确定了组织特异性和非特异性效应,因此,激发了对不同人体组织和细胞类型的其他研究。
    结果:这里,我们进行了全表观基因组关联研究,利用来自961个组织样本的DNAM数据(IlluminaEPIC阵列),代表9种组织类型(乳腺,肺,结肠,子房,前列腺,骨骼肌,睾丸,全血,和肾脏)来自基因型-组织表达(GTEx)项目。我们确定了8个组织中与年龄相关的CpG位点(错误发现率<0.05)(除骨骼肌外,n=47)。这包括在所有组织类型中的162,002个独特的高甲基化和90,626个低甲基化CpG位点,在单一组织类型中观察到130,137(80%)高甲基化CpG和74,703(82%)低甲基化CpG位点。虽然大多数与年龄相关的CpG位点表现为组织特异性,基因组特征之间的富集模式,如染色质状态和CpG岛,大多数组织都相似,提示细胞衰老的共同机制。与以前的发现一致,我们观察到,高甲基化的CpG位点在具有抑制的多梳特征和CpG岛的区域中富集,而低甲基化CpG位点优先出现在非CpG岛和增强子中。为了深入了解与年龄相关的DNAm变化的功能效应,我们评估了DNAm与局部基因表达变化之间的相关性,以确定与年龄相关的表达数量性状甲基化(age-eQTM).我们确定了多种组织类型中存在的几种年龄eQTM,包括在CDKN2A中,HENMT1和VCWE区域。
    结论:总体而言,我们的研究结果将有助于未来开发衰老的生物标志物,并了解不同人体组织类型的衰老机制。
    BACKGROUND: While the association of chronological age with DNA methylation (DNAm) in whole blood has been extensively studied, the tissue-specificity of age-related DNAm changes remains an active area of research. Studies investigating the association of age with DNAm in tissues such as brain, skin, immune cells, fat, and liver have identified tissue-specific and non-specific effects, thus, motivating additional studies of diverse human tissue and cell types.
    RESULTS: Here, we performed an epigenome-wide association study, leveraging DNAm data (Illumina EPIC array) from 961 tissue samples representing 9 tissue types (breast, lung, colon, ovary, prostate, skeletal muscle, testis, whole blood, and kidney) from the Genotype-Tissue Expression (GTEx) project. We identified age-associated CpG sites (false discovery rate < 0.05) in 8 tissues (all except skeletal muscle, n = 47). This included 162,002 unique hypermethylated and 90,626 hypomethylated CpG sites across all tissue types, with 130,137 (80%) hypermethylated CpGs and 74,703 (82%) hypomethylated CpG sites observed in a single tissue type. While the majority of age-associated CpG sites appeared tissue-specific, the patterns of enrichment among genomic features, such as chromatin states and CpG islands, were similar across most tissues, suggesting common mechanisms underlying cellular aging. Consistent with previous findings, we observed that hypermethylated CpG sites are enriched in regions with repressed polycomb signatures and CpG islands, while hypomethylated CpG sites preferentially occurred in non-CpG islands and enhancers. To gain insights into the functional effects of age-related DNAm changes, we assessed the correlation between DNAm and local gene expression changes to identify age-related expression quantitative trait methylation (age-eQTMs). We identified several age-eQTMs present in multiple tissue-types, including in the CDKN2A, HENMT1, and VCWE regions.
    CONCLUSIONS: Overall, our findings will aid future efforts to develop biomarkers of aging and understand mechanisms of aging in diverse human tissue types.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    结构变化严重影响癌症的分子格局,部分通过影响DNA甲基化介导的转录调控。这里,使用涉及儿童脑肿瘤网络中不同组织学的超过2400个小儿脑和中枢神经系统肿瘤的多组学数据集,我们报告了数百个基因和相关的CpG岛(CGI),其附近存在的体细胞结构变异(SV)断点与改变的表达或DNA甲基化有关。分别,包括肿瘤抑制基因ATRX和CDKN2A。增强子附近改变的DNA甲基化与附近的体细胞SV断点相关,包括MYC和MYCN。具有SV-CGI甲基化关联的基因子集也与患者生存具有表达关联。包括BCOR,TERT,RCOR2和PDLIM4。与同一患者的初始肿瘤相比,复发性或进行性肿瘤中的DNA甲基化变化可以预测儿科和成人癌症的生存率。我们的全面和全组织学基因组分析揭示了影响癌症基因的非编码改变的机制。
    Structural variation heavily influences the molecular landscape of cancer, in part by impacting DNA methylation-mediated transcriptional regulation. Here, using multi-omic datasets involving >2400 pediatric brain and central nervous system tumors of diverse histologies from the Children\'s Brain Tumor Network, we report hundreds of genes and associated CpG islands (CGIs) for which the nearby presence of somatic structural variant (SV) breakpoints is recurrently associated with altered expression or DNA methylation, respectively, including tumor suppressor genes ATRX and CDKN2A. Altered DNA methylation near enhancers associates with nearby somatic SV breakpoints, including MYC and MYCN. A subset of genes with SV-CGI methylation associations also have expression associations with patient survival, including BCOR, TERT, RCOR2, and PDLIM4. DNA methylation changes in recurrent or progressive tumors compared to the initial tumor within the same patient can predict survival in pediatric and adult cancers. Our comprehensive and pan-histology genomic analyses reveal mechanisms of noncoding alterations impacting cancer genes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:肾细胞癌(RCC),最致命的泌尿系肿瘤之一,约占所有成人癌症的3%,并在诊断时表现出高转移指数和高复发率。根治性或部分肾切除术是非转移性RCC的治愈选择。靶向治疗已被证明可以改善转移性RCC患者的生存率。然而,与RCC发病机制相关的潜在细胞和分子事件尚不清楚.
    方法:探讨转录因子激活蛋白(AP)-2α在RCC中的临床作用,采用甲基化CpG岛回收测定和微阵列分析。进行COBRA和RT-qPCR测定以评估RCC中的AP-2α表达。
    结果:AP-2αmRNA表达水平与甲基化状态呈负相关。多因素分析表明,AP-2αmRNA不仅是RCC总体和无病生存的主要危险因素,也是透明细胞RCC无病生存的主要危险因素。
    结论:我们的结果表明,AP-2α在RCC中的表达下调,并与患者的总生存期和无病生存期相关。提示AP-2α可能在RCC的发病机制中起重要作用。
    BACKGROUND: Renal cell carcinoma (RCC), one of the most fatal urologic tumors, accounts for approximately 3% of all adult cancers and exhibits a high metastatic index at diagnosis and a high rate of relapse. Radical or partial nephrectomy is a curative option for nonmetastatic RCCs. Targeted therapy has been shown to improve the survival of patients with metastatic RCCs. However, the underlying cellular and molecular events associated with RCC pathogenesis are not well known.
    METHODS: To investigate the clinical role of the transcription factor activator protein (AP)-2α in RCC, methylated CpG island recovery assays and microarray analysis were employed. COBRA and RT‒qPCR assays were performed to assess AP-2α expression in RCC.
    RESULTS: A negative correlation was noted between AP-2α mRNA expression levels and methylation status. Multivariate analyses showed that AP-2α mRNA was a major risk factor not only for overall and disease-free survival in RCC but also for disease-free survival in clear cell RCC.
    CONCLUSIONS: Our results indicated that AP-2α expression was deregulated in RCC and associated with overall patient survival and disease-free survival. Such findings suggest that AP-2α might play an important role in the pathogenesis of RCC.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    循环无细胞DNA(cfDNA)正在成为癌症检测的一种途径,但是对血液中cfDNA片段化的特征知之甚少。我们通过对969个个体的分析来评估DNA甲基化和基因表达对全基因组cfDNA片段化的影响。cfDNA片段末端更频繁地含有CC或CG,以CGs或CCGs结尾的片段被富集或耗尽,分别,在甲基化的CpG位置。更高水平和更大尺寸的cfDNA片段与CpG甲基化和降低的基因表达相关。这些作用在具有或不具有突变IDH1的等基因肿瘤的小鼠中得到验证,并且与癌症患者中cfDNA片段化的全基因组变化有关。肿瘤相关的低甲基化和增加的基因表达与cfDNA片段大小的减小相关,这可以解释人类癌症中较小的cfDNA片段。这些结果提供了表观遗传变化和cfDNA片段之间的联系,对疾病检测有意义。
    Circulating cell-free DNA (cfDNA) is emerging as an avenue for cancer detection, but the characteristics of cfDNA fragmentation in the blood are poorly understood. We evaluate the effect of DNA methylation and gene expression on genome-wide cfDNA fragmentation through analysis of 969 individuals. cfDNA fragment ends more frequently contained CCs or CGs, and fragments ending with CGs or CCGs are enriched or depleted, respectively, at methylated CpG positions. Higher levels and larger sizes of cfDNA fragments are associated with CpG methylation and reduced gene expression. These effects are validated in mice with isogenic tumors with or without the mutant IDH1, and are associated with genome-wide changes in cfDNA fragmentation in patients with cancer. Tumor-related hypomethylation and increased gene expression are associated with decrease in cfDNA fragment size that may explain smaller cfDNA fragments in human cancers. These results provide a connection between epigenetic changes and cfDNA fragmentation with implications for disease detection.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:柴油废气颗粒(DEP),含有危险化合物,是在柴油燃烧过程中排放的。由于全球约有三分之一的车辆使用柴油,人们越来越担心DEP对人类健康构成的风险。长期暴露于DEP与气道高反应性有关,肺纤维化,和炎症;然而,DEP对呼吸道影响背后的分子机制知之甚少。这种机制可以通过检查转录和DNA甲基化变化来解决。尽管一些研究集中在短期DEP暴露对基因表达的影响上,缺乏对长期DEP暴露引起的转录效应和全基因组DNA甲基化变化的研究。因此,在这项研究中,我们研究了长期暴露于DEP引起的人腺癌肺泡基底上皮A549细胞的转录和DNA甲基化变化,并确定这些变化是否一致.
    结果:使用IlluminaInfinium甲基化EPICBeadChips进行的DNA甲基化分析表明,A549细胞中受DEP影响的CpG位点的甲基化水平呈剂量依赖性变化;仅在暴露于30µg/mlDEP的样品中,变化程度随剂量增加而增加,达到统计学意义。暴露于30µg/ml的DEP四周,在24,464个CpG位点显着诱导DNA低甲基化,它们显著富集了DNase过敏位点,H3K4me1和H3K27ac标记的基因组区域,和几个转录因子结合位点。相比之下,具有增加的DNA甲基化水平的9,436个CpG位点在由H3K27me3以及H3K4me1和H3K27ac标记的基因组区域中显著过量地代表。并行,通过RNA测序的基因表达谱分析表明,长期暴露于DEP改变了2,410个基因的表达水平,富集16个基因集,包括异种生物代谢,炎症反应,和衰老。计算机模拟分析显示,854个基因的表达水平与受DEP影响的顺式CpG位点的甲基化水平相关。
    结论:据我们所知,这是首次报道长期暴露于DEP后A549细胞中的全基因组转录和DNA甲基化变化及其关联.
    BACKGROUND: Diesel exhaust particles (DEP), which contain hazardous compounds, are emitted during the combustion of diesel. As approximately one-third of the vehicles worldwide use diesel, there are growing concerns about the risks posed by DEP to human health. Long-term exposure to DEP is associated with airway hyperresponsiveness, pulmonary fibrosis, and inflammation; however, the molecular mechanisms behind the effects of DEP on the respiratory tract are poorly understood. Such mechanisms can be addressed by examining transcriptional and DNA methylation changes. Although several studies have focused on the effects of short-term DEP exposure on gene expression, research on the transcriptional effects and genome-wide DNA methylation changes caused by long-term DEP exposure is lacking. Hence, in this study, we investigated transcriptional and DNA methylation changes in human adenocarcinoma alveolar basal epithelial A549 cells caused by prolonged exposure to DEP and determined whether these changes are concordant.
    RESULTS: DNA methylation analysis using the Illumina Infinium MethylationEPIC BeadChips showed that the methylation levels of DEP-affected CpG sites in A549 cells changed in a dose-dependent manner; the extent of change increased with increasing dose reaching the statistical significance only in samples exposed to 30 µg/ml DEP. Four-week exposure to 30 µg/ml of DEP significantly induced DNA hypomethylation at 24,464 CpG sites, which were significantly enriched for DNase hypersensitive sites, genomic regions marked by H3K4me1 and H3K27ac, and several transcription factor binding sites. In contrast, 9,436 CpG sites with increased DNA methylation levels were significantly overrepresented in genomic regions marked by H3K27me3 as well as H3K4me1 and H3K27ac. In parallel, gene expression profiling by RNA sequencing demonstrated that long-term exposure to DEP altered the expression levels of 2,410 genes, enriching 16 gene sets including Xenobiotic metabolism, Inflammatory response, and Senescence. In silico analysis revealed that the expression levels of 854 genes correlated with the methylation levels of the DEP-affected cis-CpG sites.
    CONCLUSIONS: To our knowledge, this is the first report of genome-wide transcriptional and DNA methylation changes and their associations in A549 cells following long-term exposure to DEP.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    从受孕到生命第一年的时间是人类发展中最活跃的时期。这段时间对于早产(妊娠<30周;VPT)的婴儿尤为重要,因为他们在正常的发育轨迹上经历了严重的破坏,并且有更高的发育障碍和发育迟缓的风险。在此期间表观遗传景观的变化可能反映了这种破坏,并揭示了衰老之间的相互关系。成熟,和表观基因组。我们评估了新生儿怀孕以来的胎龄(GA)和年龄[月经后年龄(PMA)],与NICU出院时收集的VPT婴儿口腔细胞DNA甲基化有关(n=538)。在调整混杂因素并应用Bonferroni校正后,我们确定了2,366个与GA相关的CpG和14,979个与PMA相关的CpG,以及多个差异甲基化区域。通路富集分析确定了参与轴突发生和神经元投射发育调节的通路,在许多其他生长和发育途径中(FDRq<0.001)。我们的发现与之前的工作一致,并确定了许多新颖的协会,这表明在生长发育中很重要的基因,尤其是神经发育,在出生的VPT儿童的早期发育过程中会发生实质性的表观遗传变化。
    The time from conception through the first year of life is the most dynamic period in human development. This time period is particularly important for infants born very preterm (< 30 weeks gestation; VPT), as they experience a significant disruption in the normal developmental trajectories and are at heightened risk of experiencing developmental impairments and delays. Variations in the epigenetic landscape during this period may reflect this disruption and shed light on the interrelationships between aging, maturation, and the epigenome. We evaluated how gestational age (GA) and age since conception in neonates [post-menstrual age (PMA)], were related to DNA methylation in buccal cells collected at NICU discharge from VPT infants (n = 538). After adjusting for confounders and applying Bonferroni correction, we identified 2,366 individual CpGs associated with GA and 14,979 individual CpGs associated with PMA, as well as multiple differentially methylated regions. Pathway enrichment analysis identified pathways involved in axonogenesis and regulation of neuron projection development, among many other growth and developmental pathways (FDR q < 0.001). Our findings align with prior work, and also identify numerous novel associations, suggesting that genes important in growth and development, particularly neurodevelopment, are subject to substantial epigenetic changes during early development among children born VPT.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    区分前列腺肿瘤与邻近良性组织的差异甲基化CpG位点(dmCpG)可以帮助前列腺癌的诊断和预后。以前,此类dmCpG的鉴定仅在根治性前列腺切除术(RP)样本中进行,而不是在原发性诊断肿瘤样本(穿刺活检或经尿道前列腺切除术)中进行。我们查询了澳大利亚的数据集,该数据集包含125个肿瘤和43个相邻的组织学良性诊断组织样本,包括41个配对的样本,使用InfiniumHumanMethylation450BeadChip。配对肿瘤和邻近良性样本的回归分析确定了2,386个显著的dmCpG(Bonferronip<0.01;δ-β≥40%),LASSO回归选择16个dmCpG,可在完整的澳大利亚诊断数据集中区分肿瘤样本(AUC=0.99)。结果在独立的北美(n配对=19;AUC=0.87)和癌症基因组图谱(TCGA;n配对=50;AUC=0.94)RP数据集中进行了验证。16个dmCpG中有两个在澳大利亚肿瘤样品中显著下调的基因中(Bonferronip<0.01;GSTM2和PRKCB)。10个额外的dmCpG区分低(n=34)和高格里森(n=88)评分的肿瘤诊断澳大利亚数据集(AUC=0.95),但应用于RP数据集时表现不佳(北美:AUC=0.66;TCGA:AUC=0.62)。此处鉴定的DNA甲基化标记可以增强和改善当前的诊断测试和/或形成未来预后测试的基础。
    Differentially methylated CpG sites (dmCpGs) that distinguish prostate tumour from adjacent benign tissue could aid in the diagnosis and prognosis of prostate cancer. Previously, the identification of such dmCpGs has only been undertaken in radical prostatectomy (RP) samples and not primary diagnostic tumour samples (needle biopsy or transurethral resection of the prostate). We interrogated an Australian dataset comprising 125 tumour and 43 adjacent histologically benign diagnostic tissue samples, including 41 paired samples, using the Infinium Human Methylation450 BeadChip. Regression analyses of paired tumour and adjacent benign samples identified 2,386 significant dmCpGs (Bonferroni p < 0.01; delta-β ≥ 40%), with LASSO regression selecting 16 dmCpGs that distinguished tumour samples in the full Australian diagnostic dataset (AUC = 0.99). Results were validated in independent North American (npaired = 19; AUC = 0.87) and The Cancer Genome Atlas (TCGA; npaired = 50; AUC = 0.94) RP datasets. Two of the 16 dmCpGs were in genes that were significantly down-regulated in Australian tumour samples (Bonferroni p < 0.01; GSTM2 and PRKCB). Ten additional dmCpGs distinguished low (n = 34) and high Gleason (n = 88) score tumours in the diagnostic Australian dataset (AUC = 0.95), but these performed poorly when applied to the RP datasets (North American: AUC = 0.66; TCGA: AUC = 0.62). The DNA methylation marks identified here could augment and improve current diagnostic tests and/or form the basis of future prognostic tests.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    将多组学数据集成到预测模型中有可能提高准确性,这对精准医学至关重要。在这项研究中,我们通过采用由先验生物学知识提供信息的神经网络,为多组数据开发了可解释的预测模型,称为可见网络。这些神经网络提供了对决策过程的见解,并可以揭示与性状和复杂疾病相关的潜在生物学机制的新观点。我们测试了性能,推断吸烟状况的可解释性和普遍性,使用来自BIOS联盟血液的全基因组RNA表达和CpG甲基化数据(四个人群队列,Ntotal=2940)。在队列交叉验证设置中,评估了诊断性能和解释的一致性.预测吸烟状况的性能一直很高,总体平均AUC为0.95(95%CI:0.90-1.00),并且解释显示涉及复制良好的基因,例如AHRR,GPR15和LRRN3。LDL水平预测仅在R2为0.07(95%CI:0.05-0.08)的单个队列中得到推广。推断年龄与基因COL11A2,AFAP1,OTUD7A的平均误差为5.16(95%CI:3.97-6.35)年,PTPRN2、ADARB2和CD34一致预测。对于这两个回归任务,我们发现使用多组学网络提高了性能,与可解释的单组学网络相比,稳定性和泛化性。我们认为,可见神经网络在多组学分析中具有巨大的潜力;它们优雅地结合了多组学数据,是可解释的,并很好地推广到来自不同队列的数据。
    Integrating multi-omics data into predictive models has the potential to enhance accuracy, which is essential for precision medicine. In this study, we developed interpretable predictive models for multi-omics data by employing neural networks informed by prior biological knowledge, referred to as visible networks. These neural networks offer insights into the decision-making process and can unveil novel perspectives on the underlying biological mechanisms associated with traits and complex diseases. We tested the performance, interpretability and generalizability for inferring smoking status, subject age and LDL levels using genome-wide RNA expression and CpG methylation data from the blood of the BIOS consortium (four population cohorts, Ntotal = 2940). In a cohort-wise cross-validation setting, the consistency of the diagnostic performance and interpretation was assessed. Performance was consistently high for predicting smoking status with an overall mean AUC of 0.95 (95% CI: 0.90-1.00) and interpretation revealed the involvement of well-replicated genes such as AHRR, GPR15 and LRRN3. LDL-level predictions were only generalized in a single cohort with an R2 of 0.07 (95% CI: 0.05-0.08). Age was inferred with a mean error of 5.16 (95% CI: 3.97-6.35) years with the genes COL11A2, AFAP1, OTUD7A, PTPRN2, ADARB2 and CD34 consistently predictive. For both regression tasks, we found that using multi-omics networks improved performance, stability and generalizability compared to interpretable single omic networks. We believe that visible neural networks have great potential for multi-omics analysis; they combine multi-omic data elegantly, are interpretable, and generalize well to data from different cohorts.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号