Genome, Human

基因组,Human
  • 文章类型: Review
    非编码RNA(ncRNA)是一类不具有编码蛋白质潜力的RNA分子。同时,它们可以占据人类基因组的很大一部分,并通过各种机制参与基因表达调控。妊娠期糖尿病(GDM)是在怀孕期间开始或首次检测到的碳水化合物不耐受的病理状况,使其成为最常见的妊娠并发症之一。虽然GDM的确切发病机制尚不清楚,最近的一些研究表明,ncRNAs在GDM中起着至关重要的调节作用。在这里,我们对ncRNAs在GDM中的多种机制及其作为生物标志物的潜在作用进行了全面综述。此外,我们研究了基于深度学习的模型在发现疾病特异性ncRNA生物标志物和阐明ncRNA的潜在机制方面的贡献。这可能有助于社区范围的努力,以深入了解ncRNAs在疾病中的调控机制,并指导疾病的早期诊断和治疗的新方法。
    Non-coding RNAs (ncRNAs) are a class of RNA molecules that do not have the potential to encode proteins. Meanwhile, they can occupy a significant portion of the human genome and participate in gene expression regulation through various mechanisms. Gestational diabetes mellitus (GDM) is a pathologic condition of carbohydrate intolerance that begins or is first detected during pregnancy, making it one of the most common pregnancy complications. Although the exact pathogenesis of GDM remains unclear, several recent studies have shown that ncRNAs play a crucial regulatory role in GDM. Herein, we present a comprehensive review on the multiple mechanisms of ncRNAs in GDM along with their potential role as biomarkers. In addition, we investigate the contribution of deep learning-based models in discovering disease-specific ncRNA biomarkers and elucidate the underlying mechanisms of ncRNA. This might assist community-wide efforts to obtain insights into the regulatory mechanisms of ncRNAs in disease and guide a novel approach for early diagnosis and treatment of disease.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    几十年来,众所周知,人类DNA中大量的基因表现出重叠;然而,这些重叠的生物学和进化意义仍然知之甚少。这项研究的重点是调查重叠的特定情况,其中重叠的DNA区域包含蛋白质编码基因的编码DNA序列(CDS)。结果表明,重叠CDS编码的蛋白质比非重叠CDS编码的蛋白质表现出更大的紊乱。此外,这些DNA区域被鉴定为富含GC。这可以部分归因于来自两个不同阅读框而不是一个阅读框的终止密码子的缺失。此外,发现这些区域的单核苷酸多态性(SNP)位点较少,可能是由于重叠状态引起的限制,其中突变可能同时影响两个基因。在阐明这些属性的同时,NR1D1-THRA基因对是一种特殊情况,具有高度结构化的蛋白质,并且在整个欧洲哺乳动物中具有明显保守的序列。NR1D1和THRA都是在其C端缺乏配体结合域的核受体,这是这些基因对重叠的区域。NR1D1基因参与昼夜节律的调节,虽然THRA基因编码甲状腺激素受体,两者都在各种生理过程中起着至关重要的作用。这项研究表明,除了他们既定的功能之外,这些基因的特殊重叠的CDS区域可能编码额外的蛋白质片段,但未被发现,生物学角色。
    For several decades, it has been known that a substantial number of genes within human DNA exhibit overlap; however, the biological and evolutionary significance of these overlaps remain poorly understood. This study focused on investigating specific instances of overlap where the overlapping DNA region encompasses the coding DNA sequences (CDSs) of protein-coding genes. The results revealed that proteins encoded by overlapping CDSs exhibit greater disorder than those from nonoverlapping CDSs. Additionally, these DNA regions were identified as GC-rich. This could be partially attributed to the absence of stop codons from two distinct reading frames rather than one. Furthermore, these regions were found to harbour fewer single-nucleotide polymorphism (SNP) sites, possibly due to constraints arising from the overlapping state where mutations could affect two genes simultaneously.While elucidating these properties, the NR1D1-THRA gene pair emerged as an exceptional case with highly structured proteins and a distinctly conserved sequence across eutherian mammals. Both NR1D1 and THRA are nuclear receptors lacking a ligand-binding domain at their C-terminus, which is the region where these gene pairs overlap. The NR1D1 gene is involved in the regulation of circadian rhythm, while the THRA gene encodes a thyroid hormone receptor, and both play crucial roles in various physiological processes. This study suggests that, in addition to their well-established functions, the specifically overlapping CDS regions of these genes may encode protein segments with additional, yet undiscovered, biological roles.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在过去的十年中,全基因组测序能力得到了巨大的改善,同时成本也大幅下降。导致大量高质量的从头基因组。出于这个原因,基因组质量度量的持续发展势在必行。使用2016年大西洋宽吻海豚NCBIRefSeq注释和基于质谱的六种组织的蛋白质组学分析,我们确认了来自4711个蛋白质组的10,402个蛋白质,构成了近三分之一的可能预测的蛋白质。由于鉴定具有更多鉴定的肽的较大蛋白质意味着减少的数据库片段化和改进的基因注释准确性,我们提出了度量NP10,它试图捕捉这种质量改进。NP10度量是通过首先基于每个蛋白质的肽数量鉴定鉴定鉴定的蛋白质的最高十分位数(或第10个10分位数)来对蛋白质组结果进行分层,然后返回所得蛋白质的中值分子量来计算的。当使用2016年与2012年的Tursiopstruncatus基因组注释来搜索此蛋白质组数据集时,NP10有21%的改善。通过使用公开可用的蛋白质组数据集比较2004年,2013年和2016年的人类基因组注释,进一步证明了这一指标,表明NP10提高了33%。这些结果表明,蛋白质组学可能是一个有用的计量工具,以基准基因组的准确性,尽管需要跨物种参考蛋白质组数据集,以促进新的从头和现有基因组的评估。
    The last decade has witnessed dramatic improvements in whole-genome sequencing capabilities coupled to drastically decreased costs, leading to an inundation of high-quality de novo genomes. For this reason, the continued development of genome quality metrics is imperative. Using the 2016 Atlantic bottlenose dolphin NCBI RefSeq annotation and mass spectrometry-based proteomic analysis of six tissues, we confirmed 10,402 proteins from 4711 protein groups, constituting nearly one-third of the possible predicted proteins. Since the identification of larger proteins with more identified peptides implies reduced database fragmentation and improved gene annotation accuracy, we propose the metric NP10, which attempts to capture this quality improvement. The NP10 metric is calculated by first stratifying proteomic results by identifying the top decile (or 10th 10-quantile) of identified proteins based on the number of peptides per protein and then returns the median molecular weight of the resulting proteins. When using the 2016 versus 2012 Tursiops truncatus genome annotation to search this proteomic data set, there was a 21% improvement in NP10. This metric was further demonstrated by using a publicly available proteomic data set to compare human genome annotations from 2004, 2013 and 2016, which showed a 33% improvement in NP10. These results demonstrate that proteomics may be a useful metrological tool to benchmark genome accuracy, though there is a need for reference proteomic datasets across species to facilitate the evaluation of new de novo and existing genome.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    这篇综述认为抗病毒核苷类似物药物,包括利巴韦林,favipiravir,和莫努比拉韦,通过致死诱变作为一种作用方式,在SARS-CoV或SARS-CoV-2中诱导基因组错误突变。体外数据表明,莫那普拉韦作为抗病毒剂的效力可能是利巴韦林或法维吡韦的100倍。Molnupiravir最近在3期临床试验中证明了疗效。由于其预期的全球使用,它的相对效力,及其活性成分的体外“宿主”细胞诱变性报道,β-d-N4-羟胞苷,我们回顾了莫努比拉韦的发展及其遗传毒性安全性评估,以及三种同源物的遗传毒性概况,也就是说,利巴韦林,favipiravir,和5-(2-氯乙基)-2'-脱氧尿苷。我们在所有可用信息的基础上考虑了莫那普拉韦的潜在遗传风险,并着重于对使用莫那普拉韦和类似药物治疗的患者的其他人类遗传毒性数据和随访的需求。此类人类数据与抗病毒NA特别相关,所述抗病毒NA具有永久修饰治疗患者的基因组和/或引起人类致畸性或胚胎毒性的潜力。我们得出的结论是,临床前遗传毒性研究和第一阶段人类临床安全性的结果,耐受性,和药代动力学是药物安全性评估的关键组成部分,也是未预期的不良健康影响的前哨指标。我们提供了在第一阶段临床试验之前和之内进行更彻底的遗传毒性测试的理由。包括在接受抗病毒NA治疗的患者的DNA和线粒体DNA中进行的人类PIG-A和错误校正的下一代测序(双重测序)研究,这些DNA通过致死诱变诱导基因组错误突变。
    This review considers antiviral nucleoside analog drugs, including ribavirin, favipiravir, and molnupiravir, which induce genome error catastrophe in SARS-CoV or SARS-CoV-2 via lethal mutagenesis as a mode of action. In vitro data indicate that molnupiravir may be 100 times more potent as an antiviral agent than ribavirin or favipiravir. Molnupiravir has recently demonstrated efficacy in a phase 3 clinical trial. Because of its anticipated global use, its relative potency, and the reported in vitro \"host\" cell mutagenicity of its active principle, β-d-N4-hydroxycytidine, we have reviewed the development of molnupiravir and its genotoxicity safety evaluation, as well as the genotoxicity profiles of three congeners, that is, ribavirin, favipiravir, and 5-(2-chloroethyl)-2\'-deoxyuridine. We consider the potential genetic risks of molnupiravir on the basis of all available information and focus on the need for additional human genotoxicity data and follow-up in patients treated with molnupiravir and similar drugs. Such human data are especially relevant for antiviral NAs that have the potential of permanently modifying the genomes of treated patients and/or causing human teratogenicity or embryotoxicity. We conclude that the results of preclinical genotoxicity studies and phase 1 human clinical safety, tolerability, and pharmacokinetics are critical components of drug safety assessments and sentinels of unanticipated adverse health effects. We provide our rationale for performing more thorough genotoxicity testing prior to and within phase 1 clinical trials, including human PIG-A and error corrected next generation sequencing (duplex sequencing) studies in DNA and mitochondrial DNA of patients treated with antiviral NAs that induce genome error catastrophe via lethal mutagenesis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Schizophrenia is a polygenic disease; however, the specific risk genetic variants of schizophrenia are still largely unknown. Single nucleotide polymorphism (SNP) is important genetic factor for the susceptibility of schizophrenia. Investigating individual candidate gene contributing to disease risk remains important.
    In a case-control study, five SNPs located in 6p21.3-p23.1 including rs2021722 in human leukocyte antigen (HLA) locus and rs107822, rs383711, rs439205 and rs421446 within the upstream of microRNA-219a-1 were genotyped in 454 schizophrenia patients and 445 healthy controls to investigate the possible association between the loci and schizophrenia in a Han Chinese population.
    Our results showed significant associations between the rs2021722 and schizophrenia in allele (A vs. G: adjusted OR = 1.661, 95%CI = 1.196-2.308), co-dominant (AG vs. GG: OR = 1.760, 95%CI = 1.234-2.510) and dominant genetic model (AG + AA vs. GG: OR = 1.756, 95%CI = 1.237-2.492), respectively. Haplotype analysis showed that TGGT and CAAC were protective factor for schizophrenia compared with TAAC haplotype (OR = 0.324, 95% CI = 0.157-0.672; OR = 0.423, 95% CI = 0.199-0.900).
    These findings indicate that rs2021722 in HLA locus might be involved in pathogenesis of schizophrenia and that genotypes AG and allele A of the locus are risk factors for schizophrenia in the Han Chinese population, confirming the association between immune system and schizophrenia.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Historical Article
    意大利半岛,横跨地中海的天然码头,自从欧洲人类占领开始以来,见证了错综复杂的人口事件。在过去的几年里,国际研究界已经发表了越来越多的来自该地区的现代和古代基因组。这种基因组观点开始揭示意大利的相关性,以了解欧洲的最后一次冰川最大(LGM)重新居住,新石器时代向西迁移的早期阶段,及其在铁器时代之后在东地中海地区和西地中海地区之间的联系作用。然而,许多悬而未决的问题仍在等待更多的数据得到充分解决。通过这次审查,我们总结了从可用的古代意大利人中出现的当前知识,通过一次重新分析它们,我们试图阐明该领域未来研究应涵盖的途径。特别是,悬而未决的问题涉及(1)Villabruna前欧洲人的命运以及LGM后的狩猎采集者在多大程度上吸收了它们的基因组成分;(2)西西里岛和撒丁岛在LGM之前的作用;(3)在何种程度上,新石器时代早期定居者中记录的遗传结构可以被描述为两个独立的迁徙;(4)在新石器时代和类似的伊朗时代的欧洲样本中明显
    The Italian Peninsula, a natural pier across the Mediterranean Sea, witnessed intricate population events since the very beginning of the human occupation in Europe. In the last few years, an increasing number of modern and ancient genomes from the area have been published by the international research community. This genomic perspective started unveiling the relevance of Italy to understand the post-Last Glacial Maximum (LGM) re-peopling of Europe, the earlier phase of the Neolithic westward migrations, and its linking role between Eastern and Western Mediterranean areas after the Iron Age. However, many open questions are still waiting for more data to be addressed in full. With this review, we summarize the current knowledge emerging from the available ancient Italian individuals and, by re-analysing them all at once, we try to shed light on the avenues future research in the area should cover. In particular, open questions concern (1) the fate of pre-Villabruna Europeans and to what extent their genomic components were absorbed by the post-LGM hunter-gatherers; (2) the role of Sicily and Sardinia before LGM; (3) to what degree the documented genetic structure within the Early Neolithic settlers can be described as two separate migrations; (4) what are the population events behind the marked presence of an Iranian Neolithic-like component in Bronze Age and Iron Age Italian and Southern European samples.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    古代人类遗骸有可能解释人类的史前史。由于最近的技术和生物信息学的进步,他们的研究,在古基因组学层面,可以提供有关人口动态的重要信息,文化变迁,以及我们祖先的生活方式.在这项研究中,从人类骨骼遗骸获得的线粒体和核基因组数据与新石器时代的球状两性培养物有关,在Kierzkowo(波兰)的巨石手推车中回收,重新分析,以深入了解考古遗址的社会组织和使用,并提供个人层面的信息。我们能够成功地估计出最小的个体数量,性别,亲属关系,和被埋葬个体的表型特征,尽管骨骼样本的保存水平很低,并且有复杂的解剖学条件。此外,对损害模式的评估使我们能够突出“入侵者”的存在-也就是说,最近的骨骼遗骸不属于原始墓葬。由于其特点,对Kierzkowo手推车的研究对重建利用它的人类社区的生物学特征提出了挑战,也是古代基因组分析可以为考古重建提供贡献的一个很好的例子。
    Ancient human remains have the potential to explain a great deal about the prehistory of humankind. Due to recent technological and bioinformatics advances, their study, at the palaeogenomic level, can provide important information about population dynamics, culture changes, and the lifestyles of our ancestors. In this study, mitochondrial and nuclear genome data obtained from human bone remains associated with the Neolithic Globular Amphorae culture, which were recovered in the Megalithic barrow of Kierzkowo (Poland), were reanalysed to gain insight into the social organisation and use of the archaeological site and to provide information at the individual level. We were able to successfully estimate the minimum number of individuals, sex, kin relationships, and phenotypic traits of the buried individuals, despite the low level of preservation of the bone samples and the intricate taphonomic conditions. In addition, the evaluation of damage patterns allowed us to highlight the presence of \"intruders\"-that is, of more recent skeletal remains that did not belong to the original burial. Due to its characteristics, the study of the Kierzkowo barrow represented a challenge for the reconstruction of the biological profile of the human community who exploited it and an excellent example of the contribution that ancient genomic analysis can provide to archaeological reconstruction.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    基因簇正在成为基因鉴定的有希望的工具。该研究揭示了基因的目的基因组分布朝向完整代谢途径/表型的更高遗传率,因此,更高的健身。共同表达的共同本地化,共同互动,功能相关的基因被发现是人类全基因组趋势,鼠标,金鹰,稻米鱼,果蝇,花生,和拟南芥。如预期,分析验证了共定位事件的共隔离。共同定位事件的可能性与基因座间距离之间呈负相关。还发现基因组块的进化沿染色体臂收敛且均匀。因此,建议将负责相邻代谢反应的基因组块用于鉴定候选基因和解释细胞功能。作为一个案例故事,根据Slc25A44的基因组局部信息,提出了一种能量代谢和次生代谢产物的功能。Slc25A44被进一步表征为必需的管家基因,该基因一直处于进化净化压力下,属于SLC25s的系统发育ETC进化枝。途径富集将Slc25A44s映射到能量代谢。花生和人Slc25A44s在卵母细胞和酿酒酵母菌株中的表达证实了次级代谢产物和泛醌的常见前体的运输。这些结果表明,SLC25A44是具有生物技术应用的线粒体-ER-核区转运蛋白。最后,发现在运输腔的胞浆表面上保守的三个氨基酸特征对于SLC25s的合理工程很重要。
    Gene clusters are becoming promising tools for gene identification. The study reveals the purposive genomic distribution of genes toward higher inheritance rates of intact metabolic pathways/phenotypes and, thereby, higher fitness. The co-localization of co-expressed, co-interacting, and functionally related genes was found as genome-wide trends in humans, mouse, golden eagle, rice fish, Drosophila, peanut, and Arabidopsis. As anticipated, the analyses verified the co-segregation of co-localized events. A negative correlation was notable between the likelihood of co-localization events and the inter-loci distances. The evolution of genomic blocks was also found convergent and uniform along the chromosomal arms. Calling a genomic block responsible for adjacent metabolic reactions is therefore recommended for identification of candidate genes and interpretation of cellular functions. As a case story, a function in the metabolism of energy and secondary metabolites was proposed for Slc25A44, based on its genomic local information. Slc25A44 was further characterized as an essential housekeeping gene which has been under evolutionary purifying pressure and belongs to the phylogenetic ETC-clade of SLC25s. Pathway enrichment mapped the Slc25A44s to the energy metabolism. The expression of peanut and human Slc25A44s in oocytes and Saccharomyces cerevisiae strains confirmed the transport of common precursors for secondary metabolites and ubiquinone. These results suggest that SLC25A44 is a mitochondrion-ER-nucleus zone transporter with biotechnological applications. Finally, a conserved three-amino acid signature on the cytosolic face of transport cavity was found important for rational engineering of SLC25s.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    仅病例(CO)研究是揭示复杂人类疾病的基因-环境(G×E)相互作用的有力手段。此外,原则上,此类研究还可以利用基因型插补来进一步提高统计能力。然而,基因型归因通常采用健康对照,如单倍型参考联盟(HRC)数据作为归因基础,这可能会系统地扰乱基因组区域的CO研究,主要影响疾病风险。使用来自719名德国克罗恩病(CD)患者的基因型数据,我们调查了具有或不具有遗传主要效应的单核苷酸多态性(SNP)可实现的填补准确性水平,并且具有变化的次要等位基因频率(MAF)。使用HRC数据作为归因基础,从不同水平的连锁不平衡(LD)的相邻SNP估算基因型到目标SNP。真实基因型和估算基因型的比较显示,SNP的归因准确性较低,具有很强的主要作用。我们还模拟了不同水平的G×E相互作用,以评估由于使用估算基因型而导致的统计有效性和功效的潜在损失。零假设下的模拟表明,基因型归因不会增加G×E的CO研究的I型错误率。但是,发现统计能力因估算而降低,特别是对于具有低MAF的SNP,当驱动归因的SNP的LD水平降低时,统计能力逐渐丧失。因此,我们的研究强调,在G×E相互作用的CO研究中应格外小心地使用基因型归因。
    Case-only (CO) studies are a powerful means to uncover gene-environment (G × E) interactions for complex human diseases. Moreover, such studies may in principle also draw upon genotype imputation to increase statistical power even further. However, genotype imputation usually employs healthy controls such as the Haplotype Reference Consortium (HRC) data as an imputation base, which may systematically perturb CO studies in genomic regions with main effects upon disease risk. Using genotype data from 719 German Crohn Disease (CD) patients, we investigated the level of imputation accuracy achievable for single nucleotide polymorphisms (SNPs) with or without a genetic main effect, and with varying minor allele frequency (MAF). Genotypes were imputed from neighbouring SNPs at different levels of linkage disequilibrium (LD) to the target SNP using the HRC data as an imputation base. Comparison of the true and imputed genotypes revealed lower imputation accuracy for SNPs with strong main effects. We also simulated different levels of G × E interaction to evaluate the potential loss of statistical validity and power incurred by the use of imputed genotypes. Simulations under the null hypothesis revealed that genotype imputation does not inflate the type I error rate of CO studies of G × E. However, the statistical power was found to be reduced by imputation, particularly for SNPs with low MAF, and a gradual loss of statistical power resulted when the level of LD to the SNPs driving the imputation decreased. Our study thus highlights that genotype imputation should be employed with great care in CO studies of G × E interaction.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Case Reports
    Ovarian mature cystic teratomas comprise tissues derived from all three germ layers. In rare cases, malignant tumors arise from ovarian mature cystic teratoma. A variety of tumors can arise from mature cystic teratoma, among which primary malignant melanoma (MM), for which no molecular analyses such as genomic sequencing have been reported to date, is exceedingly rare, thereby limiting possible therapeutic options using precision medicine. We used targeted gene sequencing to analyze the status of 160 cancer-related genes in a patient with MM arising from an ovarian mature cystic teratoma (MM-MCT). KRAS amplification and homozygous deletion in PTEN and RB1 were detected in tumor samples collected from the patient. No KRAS amplification has been previously reported in cutaneous MM, indicating that the carcinogenesis of MM-MCT differs from that of primary cutaneous melanomas. A better understanding of the underlying genetic mechanisms will help clarify the carcinogenesis of MM-MCT. In turn, this will enable treatment with novel targeting agents as well as the initial exploration of gene-based precision oncological therapies, which aim to improve treatment outcomes for patients with this disease.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号