ClinVar

ClinVar
  • 文章类型: Journal Article
    背景:遗传变异数据库有助于临床医生和研究人员解释遗传变异。然而,这些数据库包含一些错误分类的变体。随着这些数据库的迅速发展和实施新的指南,变体错误分类是否正在减弱尚不清楚。
    方法:使用ClinVar和HGMD的档案,我们调查了6年来变异错误分类的变化,跨越不同的祖先群体。我们将在新生儿中筛查的先天性代谢错误(IEM)视为模型系统,因为这些疾病通常对新生儿表型具有高度渗透性。我们使用来自1000基因组计划(1KGP)的样本来鉴定具有被数据库分类为致病性的基因型的个体。由于IEM的稀有性,几乎所有此类分类的致病基因型都表明ClinVar或HGMD中可能存在变异错误分类。
    结果:虽然ClinVar和HGMD的假阳性率随着时间的推移有所改善,HGMD变体目前暗示1KGP中受影响的个体比ClinVar变体多两个数量级。我们观察到,当使用HGMD变体时,非洲血统个体被错误地指示受筛选的IEM影响的机会显着增加。然而,一旦根据最近的变异分类指南去除常见变异,这种影响非洲血统基因组的偏倚就不再显著.我们发现分类为致病性或可能致病性的ClinVar变体的重新分类频率比DM或DM?HGMD中的变体高六倍,这可能导致ClinVar的假阳性率较低。
    结论:考虑到已被重新分类的错误分类变异,揭示了我们对罕见遗传变异的认识不断提高。我们发现,变异分类指南和包含遗传多样性样本的等位基因频率数据库是重新分类的重要因素。我们还发现,在欧洲和南亚个体中常见的ClinVar变体更有可能被重新分类为较低的置信度类别。可能是由于这些变体被多个提交者分类的机会增加。我们讨论了变体分类数据库的功能,这些功能将支持它们的持续改进。
    Curated databases of genetic variants assist clinicians and researchers in interpreting genetic variation. Yet, these databases contain some misclassified variants. It is unclear whether variant misclassification is abating as these databases rapidly grow and implement new guidelines.
    Using archives of ClinVar and HGMD, we investigated how variant misclassification has changed over 6 years, across different ancestry groups. We considered inborn errors of metabolism (IEMs) screened in newborns as a model system because these disorders are often highly penetrant with neonatal phenotypes. We used samples from the 1000 Genomes Project (1KGP) to identify individuals with genotypes that were classified by the databases as pathogenic. Due to the rarity of IEMs, nearly all such classified pathogenic genotypes indicate likely variant misclassification in ClinVar or HGMD.
    While the false-positive rates of both ClinVar and HGMD have improved over time, HGMD variants currently imply two orders of magnitude more affected individuals in 1KGP than ClinVar variants. We observed that African ancestry individuals have a significantly increased chance of being incorrectly indicated to be affected by a screened IEM when HGMD variants are used. However, this bias affecting genomes of African ancestry was no longer significant once common variants were removed in accordance with recent variant classification guidelines. We discovered that ClinVar variants classified as Pathogenic or Likely Pathogenic are reclassified sixfold more often than DM or DM? variants in HGMD, which has likely resulted in ClinVar\'s lower false-positive rate.
    Considering misclassified variants that have since been reclassified reveals our increasing understanding of rare genetic variation. We found that variant classification guidelines and allele frequency databases comprising genetically diverse samples are important factors in reclassification. We also discovered that ClinVar variants common in European and South Asian individuals were more likely to be reclassified to a lower confidence category, perhaps due to an increased chance of these variants being classified by multiple submitters. We discuss features for variant classification databases that would support their continued improvement.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Since RBPs play important roles in the cell, it\'s particularly important to find new RBPs. We performed iRIP-seq and CLIP-seq to verify two proteins, CLIP1 and DMD, predicted by RBPPred whether are RBPs or not. The experimental results confirm that these two proteins have RNA-binding activity. We identified significantly enriched binding motifs UGGGGAGG, CUUCCG and CCCGU for CLIP1 (iRIP-seq), DMD (iRIP-seq) and DMD (CLIP-seq), respectively. The computational KEGG and GO analysis show that the CLIP1 and DMD share some biological processes and functions. Besides, we found that the SNPs between DMD and its RNA partners may be associated with Becker muscular dystrophy, Duchenne muscular dystrophy, Dilated cardiomyopathy 3B and Cardiovascular phenotype. Among the thirteen cancers data, CLIP1 and another 300 oncogenes always co-occur, and 123 of these 300 genes interact with CLIP1. These cancers may be associated with the mutations occurred in both CLIP1 and the genes it interacts with.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Alzheimer\'s disease (AD) is a common neurodegenerative disease with high morbidity among elderly people. A genetic attribution has been extensively proved. Here, we propose to further prioritize genes that harbor single nucleotide variation (SNV) or structural variation (SV) for AD and explore the underlying potential mechanisms through exploiting their expression and methylation spectra. A high-confidence AD-associated candidate gene list was obtained from the ClinVar and Human Gene Mutation Database (HGMD). Genome-wide methylation and expression profiles of AD and normal subjects were downloaded from the Gene Expression Omnibus (GEO). Through comprehensive comparison of expression and methylation levels between AD and normal samples, as well as different stages of AD samples, SORL1 was identified as the most plausible gene for AD incidence and progression. Gene Set Enrichment Analysis (GSEA) revealed significant activation of the ABC (ATP binding cassette) transporter with the aberrant up-regulation of SORL1 within AD samples. This study unfolds the expression and methylation spectra of previously probed genes with SNV or SV in AD for the first time, and reports an aberrant activation of the ABC transporter pathway that might contribute to AD progression. This should shed some light on AD diagnosis and precision treatment.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    The Human Genome Variation Society (HGVS) nomenclature guidelines encourage the accurate and standard description of DNA, RNA, and protein sequence variants in public variant databases and the scientific literature. Inconsistent application of the HGVS guidelines can lead to misinterpretation of variants in clinical settings. Reliable software tools are essential to ensure consistent application of the HGVS guidelines when reporting and interpreting variants. We present the hgvs Python package, a comprehensive tool for manipulating sequence variants according to the HGVS nomenclature guidelines. Distinguishing features of the hgvs package include: (1) parsing, formatting, validating, and normalizing variants on genome, transcript, and protein sequences; (2) projecting variants between aligned sequences, including those with gapped alignments; (3) flexible installation using remote or local data (fully local installations eliminate network dependencies); (4) extensive automated tests; and (5) open source development by a community from eight organizations worldwide. This report summarizes recent and significant updates to the hgvs package since its original release in 2014, and presents results of extensive validation using clinical relevant variants from ClinVar and HGMD.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号