variant classification

变体分类
  • 文章类型: Journal Article
    背景:对于人类疾病,变体可以是致病性的或良性的。从良性到致病性的当前分类类别反映了对当前理解的概率总结。变体效应的多重测定(MAVE)的临床效用的主要度量是可以从不确定显著性(VUS)重新分类的变体的数量。然而,这种效用衡量标准的一个差距是,它低估了从MAVE获得的信息。这项研究的目的是为MAVE效用开发一种改进的量化指标。我们建议采用信息内容方法,其中包括不对变体进行重新分类的数据将更好地反映真实的信息增益。我们采用了信息含量的方法来评估信息增益,以位为单位,对于BRCA1,PTEN,TP53。这里,1位表示从无信息开始对单个变体进行完全分类所需的信息量。
    结果:BRCA1MAVE产生了总共831.2位的信息,BRCA1中总误义信息的6.58%,比仅导致VUS重新分类的信息增加了22倍。PTENMAVE产生了2059.6位信息,占PTEN中全部错义信息的32.8%,比促成VUS重新分类的信息增加了85倍。TP53MAVE产生了277.8位的信息,占TP53中全部错义信息的6.22%,比有助于VUS重新分类的信息增加了3.5倍。
    结论:与计算重新分类的变异数量相比,信息内容方法将更准确地描绘通过MAVE作图努力获得的信息。这种信息内容方法还可以帮助定义修改用于对变体组进行分类的信息定义的指南更改的影响。
    BACKGROUND: A variant can be pathogenic or benign with relation to a human disease. Current classification categories from benign to pathogenic reflect a probabilistic summary of the current understanding. A primary metric of clinical utility for multiplexed assays of variant effect (MAVE) is the number of variants that can be reclassified from uncertain significance (VUS). However, a gap in this measure of utility is that it underrepresents the information gained from MAVEs. The aim of this study was to develop an improved quantification metric for MAVE utility. We propose adopting an information content approach that includes data that does not reclassify variants will better reflect true information gain. We adopted an information content approach to evaluate the information gain, in bits, for MAVEs of BRCA1, PTEN, and TP53. Here, one bit represents the amount of information required to completely classify a single variant starting from no information.
    RESULTS: BRCA1 MAVEs produced a total of 831.2 bits of information, 6.58% of the total missense information in BRCA1 and a 22-fold increase over the information that only contributed to VUS reclassification. PTEN MAVEs produced 2059.6 bits of information which represents 32.8% of the total missense information in PTEN and an 85-fold increase over the information that contributed to VUS reclassification. TP53 MAVEs produced 277.8 bits of information which represents 6.22% of the total missense information in TP53 and a 3.5-fold increase over the information that contributed to VUS reclassification.
    CONCLUSIONS: An information content approach will more accurately portray information gained through MAVE mapping efforts than by counting the number of variants reclassified. This information content approach may also help define the impact of guideline changes that modify the information definitions used to classify groups of variants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:研究如果在没有异常的胎儿中检测到有多少新的致病性(P)和可能致病性(LP)的非蛋白截短或非规范剪接变异将被归类为未知意义的变异(VUS)。
    方法:该研究包括156例通过出生后外显子组测序诊断的神经发育障碍患者。在没有特定产前表现的病例中,对导致P/LP非蛋白截短和非规范剪接变异进行回顾性重新分类。忽视产后症状。
    结果:在156名患者中,72具有非截短或非规范剪接变体。六名患者因具有一个以上可能的致病变异而被排除。十二名患者的产前畸形已知与诊断的疾病相关;因此,变体解释保持不变。在剩下的54个案件中,有33个,该变异体以前被报道为P/LP.对其他21种LP/P变体的重新分类显示,如果在产前检测到,有16种将被分类为VUS。
    结论:在我们的队列中,如果在怀孕期间进行测序,则有24%(16/66)的非蛋白质截短/非规范剪接变体将被分类为VUS。假阴性结果的可能性,源于产前可用的表型信息的局限性,应该与未来的父母讨论。在产前设置中分类和报告变体的标准可能需要调整。
    OBJECTIVE: To investigate how many novel pathogenic (P) and likely pathogenic (LP) nonprotein-truncating or noncanonical splicing variants would be classified as variants of unknown significance (VUS) if they were detected in fetuses without abnormalities.
    METHODS: The study included 156 patients with neurodevelopmental disorders diagnosed through postnatal exome sequencing. Causative P/LP nonprotein-truncating and noncanonical splicing variants were retrospectively reclassified in cases without specific prenatal manifestations, disregarding postnatal symptoms.
    RESULTS: Of the 156 patients, 72 had a nontruncating or noncanonical splicing variant. Six patients were excluded for having more than one possible causative variant. Twelve patients had prenatal malformations known to be associated with the diagnosed disorder; therefore, variant interpretation remained unchanged. In 33 of the 54 remaining cases, the variant had been previously reported as P/LP. Reclassification of the other 21 LP/P variants revealed that 16 would have been classified as VUS if detected prenatally.
    CONCLUSIONS: In our cohort, ∼24% (16/66) of causative nonprotein-truncating/noncanonical splicing variants would have been classified as VUS if sequencing had been conducted during pregnancy. The potential for false-negative results, stemming from limitations in the phenotypic information available prenatally, should be discussed with prospective parents. The criteria for classifying and reporting variants in the prenatal setting may require adjustment.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    最近,talin-1(TLN1)的变体与自发性冠状动脉夹层(SCAD)有关,SCAD是一种可以在心脏管壁上形成撕裂的疾病,需要立即进行医疗护理。一个talin-1变体,A2013T,有一个广泛的SCAD家族谱系,这导致了筛查,和识别,SCAD患者中的其他talin-1变体。在这里,我们使用常用的致病性预测工具评估了这些变异,发现可靠地对SCAD相关变异进行分类具有挑战性。即使是A2013T,因果关系的证据也很强。使用生化和细胞生物学方法,我们显示了talin-1中的SCAD相关变体,通常被归类为非致病性,仍然会对蛋白质结构和细胞行为产生可测量的影响,包括细胞运动和伤口愈合能力。一起,这表明,即使是中枢机械敏感性接头蛋白的微妙变异,会在个人层面产生重大的健康影响,提示需要重新评估talin变异体致病性预测的评分标准.
    Variants of talin-1 (TLN1) have recently been linked with spontaneous coronary artery dissection (SCAD) a condition where a tear can form in the wall of a heart artery necessitating immediate medical care. One talin-1 variant, A2013T, has an extensive familial pedigree of SCAD, which led to the screening for, and identification of, further talin-1 variants in SCAD patients. Here we evaluated these variants with commonly used pathogenicity prediction tools and found it challenging to reliably classify SCAD-associated variants, even A2013T where the evidence of a causal role is strong. Using biochemical and cell biological methods, we show that SCAD-associated variants in talin-1, which would typically be classified as non-pathogenic, still cause a measurable impact on protein structure and cell behaviour, including cell movement and wound healing capacity. Together, this indicates that even subtle variants in central mechanosensitive adapter proteins, can give rise to significant health impacts at the individual level, suggesting the need for a possible re-evaluation of the scoring criteria for pathogenicity prediction for talin variants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    基因变异与另一个基因中的致病性变异的共同观察解释了疾病的表现,已被指定为针对常用变异分类指南的致病性的证据。多个变体策展专家小组已经指定,从共识来看,这种证据类型不适用于乳腺癌易感性基因变异的分类。对来自BRIDGES测序项目的55,815名被诊断患有乳腺癌的个体的序列数据进行统计分析,以正式评估共同观察数据对种系变体分类的实用性。我们的分析包括11个乳腺癌易感基因的预期功能缺失变异和BRCA1、BRCA2和TP53的致病性错义变异。我们评估了在独立性假设下,两个不同基因中致病性变异的共同观察是否比预期发生得更多或更少。BRCA1,BRCA2和PALB2中每个基因的致病性变异与其余基因的共同观察频率低于预期。在诊断时调整年龄后,这种耗竭的证据仍然存在,研究设计(家族性与人群性),和国家。在BRCA1,BRCA2或PALB2中具有不确定意义的变异体与另一个乳腺癌基因中的致病性变异体的共同观察等同于基于似然比的标准强度分配后针对致病性的支持证据,并显示了在BRIDGES中鉴定的错义BRCA1和BRCA2变异体的重新分类中的实用性。我们的方法适用于评估共同观察在其他临床环境中作为变异致病性预测因子的价值。包括由ClinGen变异固化专家小组开发的基因特异性指南。
    Co-observation of a gene variant with a pathogenic variant in another gene that explains the disease presentation has been designated as evidence against pathogenicity for commonly used variant classification guidelines. Multiple variant curation expert panels have specified, from consensus opinion, that this evidence type is not applicable for the classification of breast cancer predisposition gene variants. Statistical analysis of sequence data for 55,815 individuals diagnosed with breast cancer from the BRIDGES sequencing project was undertaken to formally assess the utility of co-observation data for germline variant classification. Our analysis included expected loss-of-function variants in 11 breast cancer predisposition genes and pathogenic missense variants in BRCA1, BRCA2, and TP53. We assessed whether co-observation of pathogenic variants in two different genes occurred more or less often than expected under the assumption of independence. Co-observation of pathogenic variants in each of BRCA1, BRCA2, and PALB2 with the remaining genes was less frequent than expected. This evidence for depletion remained after adjustment for age at diagnosis, study design (familial versus population-based), and country. Co-observation of a variant of uncertain significance in BRCA1, BRCA2, or PALB2 with a pathogenic variant in another breast cancer gene equated to supporting evidence against pathogenicity following criterion strength assignment based on the likelihood ratio and showed utility in reclassification of missense BRCA1 and BRCA2 variants identified in BRIDGES. Our approach has applicability for assessing the value of co-observation as a predictor of variant pathogenicity in other clinical contexts, including for gene-specific guidelines developed by ClinGen Variant Curation Expert Panels.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:解释遗传变异的临床后果是现代临床基因组学的核心问题,遗传性疾病和肿瘤学.然而,临床验证落后于发现的步伐,导致患者痛苦的不确定性,医生和研究人员。随着证据的积累,这种“解释差距”会随着时间的推移而变化,最初被认为具有不确定(VUS)意义的变体随后可能会被重新分类为致病性/良性。我们之前开发了RENOVO,一种基于随机森林的工具,能够根据来自GnomAD和dbNFSP的公开信息预测变异致病性,并在随时间改变其分类状态的变体上进行测试。这里,我们全面评估了RENOVO对过去4年重新分类的变异体预测的准确性.
    方法:我们检索了ClinVar数据库的16个回顾性实例,自2020年3月至2024年3月,每3个月进行一次,并分析变体分类的时间趋势。我们确定了随着时间的推移而改变其状态的变体,并将2020年产生的RENOVO预测与实际的重新分类进行了比较。
    结果:VUS已成为ClinVar中最具代表性的类别(44.97%与9.75%(可能)致病性和40,33%(可能)良性)。与VUS报告的速率相比,VUS重新分类的速率是线性且缓慢的。指数,目前快~30倍,在可以测序的内容与什么可以解释。在2020年1月的10,196个VUS变体中,到2024年3月进行了有临床意义的重新分类,RENOVO在2020年正确分类了82.6%。此外,RENOVO正确地鉴定了转换为临床意义类别的少数变体中的大多数(例如,从良性到致病性,反之亦然)。我们重点介绍了RENOVO提供特别准确估计的变体类别和临床相关基因。特别是,以高或低影响变异的大流行为特征的基因(例如,POLE,NOTCH1、FANCM等.).次优RENOVO预测主要涉及通过专用联盟验证的基因(例如,BRCA1/2),RENOVO无论如何都会产生有限的影响。
    结论:时间趋势分析表明,当前的变体解释模型无法跟上变体发现的步伐。RENOVO等基于机器学习的工具证实了高准确性,可以帮助临床实践和研究。
    BACKGROUND: Interpreting the clinical consequences of genetic variants is the central problem in modern clinical genomics, for both hereditary diseases and oncology. However, clinical validation lags behind the pace of discovery, leading to distressing uncertainty for patients, physicians and researchers. This \"interpretation gap\" changes over time as evidence accumulates, and variants initially deemed of uncertain (VUS) significance may be subsequently reclassified in pathogenic/benign. We previously developed RENOVO, a random forest-based tool able to predict variant pathogenicity based on publicly available information from GnomAD and dbNFSP, and tested on variants that have changed their classification status over time. Here, we comprehensively evaluated the accuracy of RENOVO predictions on variants that have been reclassified over the last four years.
    METHODS: we retrieved 16 retrospective instances of the ClinVar database, every 3 months since March 2020 to March 2024, and analyzed time trends of variant classifications. We identified variants that changed their status over time and compared RENOVO predictions generated in 2020 with the actual reclassifications.
    RESULTS: VUS have become the most represented class in ClinVar (44.97% vs. 9.75% (likely) pathogenic and 40,33% (likely) benign). The rate of VUS reclassification is linear and slow compared to the rate of VUS reporting, exponential and currently ~ 30x faster, creating a growing divide between what can be sequenced vs. what can be interpreted. Out of 10,196 VUS variants in January 2020 that have undergone a clinically meaningful reclassification to march 2024, RENOVO correctly classified 82.6% in 2020. In addition, RENOVO correctly identified the majority of the few variants that switched clinically meaningful classes (e.g., from benign to pathogenic and vice versa). We highlight variant classes and clinically relevant genes for which RENOVO provides particularly accurate estimates. In particularly, genes characterized by large prevalence of high- or low-impact variants (e.g., POLE, NOTCH1, FANCM etc.). Suboptimal RENOVO predictions mostly concern genes validated through dedicated consortia (e.g., BRCA1/2), in which RENOVO would anyway have a limited impact.
    CONCLUSIONS: Time trend analysis demonstrates that the current model of variant interpretation cannot keep up with variant discovery. Machine learning-based tools like RENOVO confirm high accuracy that can aid in clinical practice and research.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    随着基因检测的采用和范围不断扩大,大规模解释DNA序列变异的临床意义仍然是一个巨大的挑战,被分类为不确定显著性变异(VUS)的比例很高。基因检测实验室历来依赖,在某种程度上,关于学术文献中的功能数据,以支持变体分类。高通量功能测定或变异效应(MAVEs)的多重测定,旨在评估DNA变体对蛋白质稳定性和功能的影响,代表了变体分类的重要且日益可用的证据来源,但是他们的潜力才刚刚开始在临床实验室中实现。这里,我们描述了一个生成的框架,验证并将MAVE的数据纳入应用于临床基因检测的半定量变异分类方法。使用单细胞基因表达测量,建立细胞证据模型来评估44个临床感兴趣基因中DNA变异的影响。该框架还应用于具有先前公布的MAVE数据集的另外22个基因的模型。总的来说,将来自24个基因的建模数据纳入我们的变异分类方法.这些数据为在超过57,000个个体中对4043个观察到的变异进行分类提供了证据。基因检测实验室具有独特的优势,分析,验证,并纳入高通量功能数据的证据,最终能够使用这些数据为更多患者提供明确的临床变异分类。
    As the adoption and scope of genetic testing continue to expand, interpreting the clinical significance of DNA sequence variants at scale remains a formidable challenge, with a high proportion classified as variants of uncertain significance (VUSs). Genetic testing laboratories have historically relied, in part, on functional data from academic literature to support variant classification. High-throughput functional assays or multiplex assays of variant effect (MAVEs), designed to assess the effects of DNA variants on protein stability and function, represent an important and increasingly available source of evidence for variant classification, but their potential is just beginning to be realized in clinical lab settings. Here, we describe a framework for generating, validating and incorporating data from MAVEs into a semi-quantitative variant classification method applied to clinical genetic testing. Using single-cell gene expression measurements, cellular evidence models were built to assess the effects of DNA variation in 44 genes of clinical interest. This framework was also applied to models for an additional 22 genes with previously published MAVE datasets. In total, modeling data was incorporated from 24 genes into our variant classification method. These data contributed evidence for classifying 4043 observed variants in over 57,000 individuals. Genetic testing laboratories are uniquely positioned to generate, analyze, validate, and incorporate evidence from high-throughput functional data and ultimately enable the use of these data to provide definitive clinical variant classifications for more patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    JAG1基因中的致病变异是多系统疾病Alagille综合征的主要原因。尽管这种疾病的变异检出率很高,错义变异分类存在不确定性,导致诊断率降低.因此,高达85%的JAG1错义变异体分类不确定或相互矛盾.我们在外显子1-7中产生了2,832个JAG1核苷酸变体的文库,该区域具有大量报道的错义变体,并设计了一种高通量检测JAG1膜表达的方法,正常功能的要求。在使用变体文库中包含的175个已知或预测的致病性和良性变体的集合进行校准后,486个变体被表征为功能异常(n=277个异常和n=209个可能的异常)。其中439人(90.3%)是错误的。我们确定了在特定残基发生的不同膜表达,表明野生型残基的丢失本身不会驱动致病性,这一发现得到了结构模型数据的支持,对Alagille综合征和全球其他疾病基因的临床变异分类具有广泛意义.在接受临床或研究测试的患者中报告的144种不确定变异中,27例膜表达功能异常,纳入我们的数据导致26人重新分类为可能致病。功能证据增强了基因组变异的分类,减少不确定性并改善诊断。在JAG1变异体重新分类过程中包含此功能证据库将显着影响变异体致病性的分辨率,对Alagille综合征的分子诊断有重要影响。
    Pathogenic variants in the JAG1 gene are a primary cause of the multi-system disorder Alagille syndrome. Although variant detection rates are high for this disease, there is uncertainty associated with the classification of missense variants that leads to reduced diagnostic yield. Consequently, up to 85% of reported JAG1 missense variants have uncertain or conflicting classifications. We generated a library of 2,832 JAG1 nucleotide variants within exons 1-7, a region with a high number of reported missense variants, and designed a high-throughput assay to measure JAG1 membrane expression, a requirement for normal function. After calibration using a set of 175 known or predicted pathogenic and benign variants included within the variant library, 486 variants were characterized as functionally abnormal (n = 277 abnormal and n = 209 likely abnormal), of which 439 (90.3%) were missense. We identified divergent membrane expression occurring at specific residues, indicating that loss of the wild-type residue itself does not drive pathogenicity, a finding supported by structural modeling data and with broad implications for clinical variant classification both for Alagille syndrome and globally across other disease genes. Of 144 uncertain variants reported in patients undergoing clinical or research testing, 27 had functionally abnormal membrane expression, and inclusion of our data resulted in the reclassification of 26 to likely pathogenic. Functional evidence augments the classification of genomic variants, reducing uncertainty and improving diagnostics. Inclusion of this repository of functional evidence during JAG1 variant reclassification will significantly affect resolution of variant pathogenicity, making a critical impact on the molecular diagnosis of Alagille syndrome.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    下一代测序(NGS)的使用极大地改善了罕见疾病的诊断。然而,随着外显子组和基因组测序对变异体的检测越来越多,对基因组数据的分析变得越来越复杂。美国医学遗传学和基因组学学院(ACMG)和分子病理学协会(AMP)于2015年开发了5层分类方案,用于变异解释。此后被广泛采用。尽管努力将这些标准的应用差异降至最低,不一致仍然存在。临床基因组资源(ClinGen)联盟的变体固化专家组(VCEP)开发了单个基因的进一步规范,这也考虑到基因或疾病的特定特征。例如,在具有高度特征的面部完形的疾病中,“表型匹配”(PP4)比非综合征形式的智力障碍具有更高的致病证据。通过用于量化异形特征的相似性的计算方法,现在可以在ACMG/AMP标准的精细贝叶斯框架中使用这种分析的结果。
    The use of next-generation sequencing (NGS) has dramatically improved the diagnosis of rare diseases. However, the analysis of genomic data has become complex with the increasing detection of variants by exome and genome sequencing. The American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) developed a 5-tier classification scheme in 2015 for variant interpretation, that has since been widely adopted. Despite efforts to minimise discrepancies in the application of these criteria, inconsistencies still occur. Further specifications for individual genes were developed by Variant Curation Expert Panels (VCEPs) of the Clinical Genome Resource (ClinGen) consortium, that also take into consideration gene or disease specific features. For instance, in disorders with a highly characerstic facial gestalt a \"phenotypic match\" (PP4) has higher pathogenic evidence than e.g. in a non-syndromic form of intellectual disability. With computational approaches for quantifying the similarity of dysmorphic features results of such analysis can now be used in a refined Bayesian framework for the ACMG/AMP criteria.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:KCNE1编码129个残基的心脏钾通道(IKs)亚基。KCNE1变异与长QT综合征和心房颤动相关。然而,大多数变种的临床后果证据不足,因此限制了临床应用.
    方法:在本研究中,我们利用了变异效应映射的力量,将饱和诱变与高通量测序相结合,以确定数千种编码蛋白质的KCNE1变体的功能。
    结果:我们全面测定了KCNE1变体细胞表面表达(2554/2709可能的单氨基酸变体)和功能(2534个变体)。我们的研究确定了470个损失或部分损失表面表达和574个损失或部分损失功能变体。在574个丧失或部分丧失功能的变体中,152(26.5%)细胞表面表达减少,表明大多数功能有害变体会影响通道门控。残基56-104的无义变体通常具有WT样运输得分,但功能得分降低,表明蛋白质的后半部分对于蛋白质运输是可有可无的,但对于通道功能至关重要。30个KCNE1残基中的22个(73%)高度不耐受变异(具有>70%功能丧失变体)预测与结合配偶体KCNQ1或钙调蛋白紧密接触。我们的功能测定数据与金标准电生理数据一致(ρ=-0.64),人群和患者队列(32/38假定良性或致病性变异,得分一致),和计算预测因子(ρ=-0.62)。我们的数据为美国医学遗传学学院/分子病理学协会良性和致病性变异的功能标准提供了中等强度的证据。
    结论:KCNE1的综合变异效应图既可以提供对IKs通道生物学的了解,又可以帮助对意义不确定的变异进行重新分类。
    KCNE1 encodes a 129-residue cardiac potassium channel (IKs) subunit. KCNE1 variants are associated with long QT syndrome and atrial fibrillation. However, most variants have insufficient evidence of clinical consequences and thus limited clinical utility.
    In this study, we leveraged the power of variant effect mapping, which couples saturation mutagenesis with high-throughput sequencing, to ascertain the function of thousands of protein-coding KCNE1 variants.
    We comprehensively assayed KCNE1 variant cell surface expression (2554/2709 possible single-amino-acid variants) and function (2534 variants). Our study identified 470 loss- or partial loss-of-surface expression and 574 loss- or partial loss-of-function variants. Of the 574 loss- or partial loss-of-function variants, 152 (26.5%) had reduced cell surface expression, indicating that most functionally deleterious variants affect channel gating. Nonsense variants at residues 56-104 generally had WT-like trafficking scores but decreased functional scores, indicating that the latter half of the protein is dispensable for protein trafficking but essential for channel function. 22 of the 30 KCNE1 residues (73%) highly intolerant of variation (with > 70% loss-of-function variants) were in predicted close contact with binding partners KCNQ1 or calmodulin. Our functional assay data were consistent with gold standard electrophysiological data (ρ =  - 0.64), population and patient cohorts (32/38 presumed benign or pathogenic variants with consistent scores), and computational predictors (ρ =  - 0.62). Our data provide moderate-strength evidence for the American College of Medical Genetics/Association of Molecular Pathology functional criteria for benign and pathogenic variants.
    Comprehensive variant effect maps of KCNE1 can both provide insight into I Ks channel biology and help reclassify variants of uncertain significance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    迄今为止,孟德尔疾病变异的临床基因检测主要集中在外显子编码和内含子基因区域.这项多步骤研究旨在为选择和应用用于5个顺式调控区变异的临床分类的计算方法提供证据基础。在人群对照中,临床报告的致病5'顺式调控区变异和来自匹配基因组区域的变异的数据集被用来校准六个生物信息学工具作为变异致病性的预测因子。根据ClinGen建议应用美国医学遗传学和基因组学学院和分子病理学协会(ACMG/AMP)分类方案,将似然比估计值与代码权重对齐。考虑到所有参考数据集变体的代码分配,CADD(81.2%)和REMM(81.5%)的性能最好。优化的阈值为致病性提供了适度的证据(CADD,REMM)和针对致病性的中度(CADD)或支持(REMM)证据。当基于EPDnew定义的启动子区域中的位置对变体进行进一步分类时,预测的灵敏度和特异性都得到改善。结合预测(CADD,REMM,和启动子区域中的位置)以灵敏度为代价增加了特异性。重要的是,分配ACMG/AMP编码PP3(≥10)和BP4(≤8)的最佳CADD阈值与蛋白质编码变异体的建议(PP3≥25.3;BP4≤22.7)有很大不同;CADD<22.7会错误地将>90%的报告的致病顺式调控区变异体分配给BP4.我们的结果表明,有必要考虑分层方法和量身定制的评分阈值,以优化5个顺式调控区变异的临床分类的生物信息学影响预测。
    To date, clinical genetic testing for Mendelian disease variants has focused heavily on exonic coding and intronic gene regions. This multi-step study was undertaken to provide an evidence base for selecting and applying computational approaches for use in clinical classification of 5\' cis-regulatory region variants. Curated datasets of clinically reported disease-causing 5\' cis-regulatory region variants and variants from matched genomic regions in population controls were used to calibrate six bioinformatic tools as predictors of variant pathogenicity. Likelihood ratio estimates were aligned to code weights following ClinGen recommendations for application of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP) classification scheme. Considering code assignment across all reference dataset variants, performance was best for CADD (81.2%) and REMM (81.5%). Optimized thresholds provided moderate evidence toward pathogenicity (CADD, REMM) and moderate (CADD) or supporting (REMM) evidence against pathogenicity. Both sensitivity and specificity of prediction were improved when further categorizing variants based on location in an EPDnew-defined promoter region. Combining predictions (CADD, REMM, and location in a promoter region) increased specificity at the expense of sensitivity. Importantly, the optimal CADD thresholds for assigning ACMG/AMP codes PP3 (≥10) and BP4 (≤8) were vastly different from recommendations for protein-coding variants (PP3 ≥25.3; BP4 ≤22.7); CADD <22.7 would incorrectly assign BP4 for >90% of reported disease-causing cis-regulatory region variants. Our results demonstrate the need to consider a tiered approach and tailored score thresholds to optimize bioinformatic impact prediction for clinical classification of 5\' cis-regulatory region variants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号