genotype-phenotype

基因型 - 表型
  • 文章类型: Journal Article
    背景:解码人类基因组序列需要对DNA序列功能性进行全面分析。通过计算和实验方法,研究人员已经研究了基因型与表型的关系,并生成了有助于解开复杂遗传蓝图的重要数据集。因此,最近开发的人工智能方法可以用来解释这些DNA序列的功能。
    方法:本研究探讨了深度学习的使用,特别是预训练的基因组模型,如DNA_bert_6和human_gpt2-v1,在解释和表示人类基因组序列。最初,我们精心构建了多个连接基因型和表型的数据集,以微调这些模型,从而实现精确的DNA序列分类.此外,我们评估了序列长度对分类结果的影响,并使用HERV数据集分析了模型隐藏层中特征提取的影响.为了增强我们对模型识别的表型特异性模式的理解,我们进行浓缩,具有高平均局部代表权重(ALRW)评分的人内源性逆转录病毒(HERV)序列中特定基序的致病性和保守性分析。
    结果:我们构建了多个基因型-表型数据集,与随机基因组序列相比,这些数据集显示出值得称道的分类性能,特别是在HERV数据集中,实现了二进制和多分类精度,F1值分别超过0.935和0.888。值得注意的是,HERV数据集的微调不仅提高了我们识别和区分DNA序列中不同信息类型的能力,而且还成功地在ALRW评分较高的区域中识别出与神经系统疾病和癌症相关的特定基序.随后对这些基序的分析揭示了物种对环境压力的适应性反应及其与病原体的共同进化。
    结论:这些发现突出了预先训练的基因组模型在学习DNA序列表征方面的潜力。特别是在利用HERV数据集时,并为未来的研究工作提供有价值的见解。这项研究代表了一种创新的策略,将预先训练的基因组模型表示与分析基因组序列功能的经典方法相结合。从而促进基因组学和人工智能之间的交叉受精。
    BACKGROUND: Decoding human genomic sequences requires comprehensive analysis of DNA sequence functionality. Through computational and experimental approaches, researchers have studied the genotype-phenotype relationship and generate important datasets that help unravel complicated genetic blueprints. Thus, the recently developed artificial intelligence methods can be used to interpret the functions of those DNA sequences.
    METHODS: This study explores the use of deep learning, particularly pre-trained genomic models like DNA_bert_6 and human_gpt2-v1, in interpreting and representing human genome sequences. Initially, we meticulously constructed multiple datasets linking genotypes and phenotypes to fine-tune those models for precise DNA sequence classification. Additionally, we evaluate the influence of sequence length on classification results and analyze the impact of feature extraction in the hidden layers of our model using the HERV dataset. To enhance our understanding of phenotype-specific patterns recognized by the model, we perform enrichment, pathogenicity and conservation analyzes of specific motifs in the human endogenous retrovirus (HERV) sequence with high average local representation weight (ALRW) scores.
    RESULTS: We have constructed multiple genotype-phenotype datasets displaying commendable classification performance in comparison with random genomic sequences, particularly in the HERV dataset, which achieved binary and multi-classification accuracies and F1 values exceeding 0.935 and 0.888, respectively. Notably, the fine-tuning of the HERV dataset not only improved our ability to identify and distinguish diverse information types within DNA sequences but also successfully identified specific motifs associated with neurological disorders and cancers in regions with high ALRW scores. Subsequent analysis of these motifs shed light on the adaptive responses of species to environmental pressures and their co-evolution with pathogens.
    CONCLUSIONS: These findings highlight the potential of pre-trained genomic models in learning DNA sequence representations, particularly when utilizing the HERV dataset, and provide valuable insights for future research endeavors. This study represents an innovative strategy that combines pre-trained genomic model representations with classical methods for analyzing the functionality of genome sequences, thereby promoting cross-fertilization between genomics and artificial intelligence.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Letter
    暂无摘要。
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:牙齿发育不全是一种常见的颅面畸形,这通常与基因突变有关。这项研究的目的是调查和发现八个受牙齿发育不全影响的中国家庭中的外生体发育异常蛋白A(EDA)基因变异。
    方法:从牙齿发育不全家族中提取基因组DNA,并使用全外显子组测序进行测序。用westernblot方法研究了胞外发育不良素A1(EDA1)蛋白的表达,通过下拉测试与受体的结合活性,并通过双荧光素酶测定法分析NF-κB转录活性。
    结果:发现了八个EDA错义变体,其中两个(C。T812C,c.A1073G)是新颖的。生物信息学分析表明这些变异可能是致病性的。三级结构分析显示,这8种变体可引起EDA蛋白的结构损伤。体外功能研究表明,这些变体极大地影响蛋白质稳定性或损害EDA-EDAR相互作用;从而显着影响下游NF-κb转录活性。此外,我们总结了EDA变异导致的基因型-表型相关性,发现导致NSTA的EDA突变大多是位于TNF域的错义突变.
    结论:我们的研究结果拓宽了与牙齿发育不全相关的EDA基因变异谱,为未来的遗传咨询提供了有价值的信息。
    OBJECTIVE: Tooth agenesis is a common craniofacial malformation, which is often associated with gene mutations. The purpose of this research was to investigate and uncover ectodysplasin A (EDA) gene variants in eight Chinese families affected with tooth agenesis.
    METHODS: Genomic DNA was extracted from tooth agenesis families and sequenced using whole-exome sequencing. The expression of ectodysplasin A1 (EDA1) protein was studied by western blot, binding activity with receptor was tested by pull-down and the NF-κB transcriptional activity was analyzed by Dual luciferase assay.
    RESULTS: Eight EDA missense variants were discovered, of which two (c.T812C, c.A1073G) were novel. The bioinformatics analysis indicated that these variants might be pathogenic. The tertiary structure analysis revealed that these eight variants could cause structural damage to EDA proteins. In vitro functional studies demonstrated that the variants greatly affect protein stability or impair the EDA-EDAR interaction; thereby significantly affecting the downstream NF-κb transcriptional activity. In addition, we summarized the genotype-phenotype correlation caused by EDA variants and found that EDA mutations leading to NSTA are mostly missense mutations located in the TNF domain.
    CONCLUSIONS: Our results broaden the variant spectrum of the EDA gene associated with tooth agenesis and provide valuable information for future genetic counseling.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    一个中国家庭被确认有两名病患者,一名成年女性和一名男性儿童(先证者),两者均表现出与X连锁低磷酸盐血症病(XLH)相关的体征。基因测序分析显示在PHEX编码基因中的位置1985(c.1985delA)的腺嘌呤缺失。探讨该突变与XLH致病性的关系,以及分析不同剂量PHEX基因突变对临床表型的影响,我们建立了携带PHEX缺失突变的大鼠模型.采用CRISPR/Cas9基因编辑技术构建PHEX基因突变(c.1985delA)大鼠模型。通过生殖程序,获得了五种基因型的大鼠:雌性野生型(X/X),雌性杂合(-/X),雌性纯合野生型(-/-),雄性野生型(X/Y),和男性半合子(-/Y)。对不同基因型的大鼠进行生长分析,血清生化参数,骨微结构。结果表明成功产生了遗传PHEX基因突变的稳定大鼠模型。与野生型大鼠相比,突变大鼠表现出生长延迟,较短的股骨,并显著减少骨量。在雌性老鼠中,纯合个体表现出最小的体型,骨量减少,最短股骨长度,和严重的畸形。此外,突变大鼠的血磷浓度明显降低,FGF23和碱性磷酸酶水平升高,和磷调节剂的表达增加。总之,PHEX基因突变剂量的XLH大鼠模型证明了其对生长发育的影响,血清生化参数,股骨形态。
    A Chinese family was identified to have two patients with rickets, an adult female and a male child (proband), both exhibiting signs related to X-linked hypophosphatemic rickets (XLH). Gene sequencing analysis revealed a deletion of adenine at position 1985 (c.1985delA) in the PHEX-encoding gene. To investigate the relationship between this mutation and the pathogenicity of XLH, as well as analyze the effects of different dosages of PHEX gene mutations on clinical phenotypes, we developed a rat model carrying the PHEX deletion mutation. The CRISPR/Cas9 gene editing technology was employed to construct the rat model with the PHEX gene mutation (c.1985delA). Through reproductive procedures, five genotypes of rats were obtained: female wild type (X/X), female heterozygous (-/X), female homozygous wild type (-/-), male wild type (X/Y), and male hemizygous (-/Y). The rats with different genotypes underwent analysis of growth, serum biochemical parameters, and bone microstructure. The results demonstrated the successful generation of a stable rat model inheriting the PHEX gene mutation. Compared to the wild-type rats, the mutant rats displayed delayed growth, shorter femurs, and significantly reduced bone mass. Among the female rats, the homozygous individuals exhibited the smallest body size, decreased bone mass, shortest femur length, and severe deformities. Moreover, the mutant rats showed significantly lower blood phosphorus concentration, elevated levels of FGF23 and alkaline phosphatase, and increased expression of phosphorus regulators. In conclusion, the XLH rat model with the PHEX gene mutation dosage demonstrated its impact on growth and development, serum biochemical parameters, and femoral morphology.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:MSX1(OMIM#142983)对正常的牙齿发育至关重要,MSX1中的变体与牙齿异常相关。这项研究的目的是表征新型MSX1变体在中国非综合征少牙(NSO)家族中的致病性。
    方法:基因组DNA是从代表35个非综合征型寡核家族的个体中提取的,并通过Sanger测序和全外显子测序进行分析。通过涉及PolyPhen-2的分析筛选致病变异体,从耐受性筛选,和MutationTaster,和变体的保守分析。分析了MSX1相关NSO的模式。MSX1结构变化表明体外功能后果。
    结果:鉴定了三个先前未报道的MSX1杂合变体:一个插入变体(c.576_577insTAG;p.Gln193*)和两个错义变体(c。871T>C;p.Tyr291His和c。644A>C;p.Gln215Pro)。免疫荧光分析显示p.Gln193*MSX1变体的异常亚细胞定位。此外,我们发现这些MSX1变体可能导致第二前磨牙的丢失。
    结论:在中国汉族NSO家族中鉴定出三种新的MSX1变体,扩展MSX1变异谱,并呈现在患者及其家庭中检测到的发病机制的遗传起源。
    BACKGROUND: MSX1 (OMIM #142983) is crucial to normal dental development, and variants in MSX1 are associated with dental anomalies. The objective of this study was to characterize the pathogenicity of novel MSX1 variants in Chinese families with non-syndromic oligodontia (NSO).
    METHODS: Genomic DNA was extracted from individuals representing 35 families with non-syndromic oligodontia and was analyzed by Sanger sequencing and whole-exome sequencing. Pathogenic variants were screened via analyses involving PolyPhen-2, Sorting-Intolerant from Tolerant, and MutationTaster, and conservative analysis of variants. Patterns of MSX1-related NSO were analyzed. MSX1 structural changes suggested functional consequences in vitro.
    RESULTS: Three previously unreported MSX1 heterozygous variants were identified: one insertion variant (c.576_577insTAG; p.Gln193*) and two missense variants (c. 871T>C; p.Tyr291His and c. 644A>C; p.Gln215Pro). Immunofluorescence analysis revealed abnormal subcellular localization of the p.Gln193* MSX1 variant. In addition, we found that these MSX1 variants likely lead to the loss of second premolars.
    CONCLUSIONS: Three novel MSX1 variants were identified in Chinese Han families with NSO, expanding the MSX1 variant spectrum and presenting a genetic origin for the pathogenesis detected in patients and their families.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    AMPA(α-氨基-3-羟基-5-甲基-4-异恶唑丙酸)受体(AMPAR)介导大脑中的快速兴奋性神经传递。AMPAR通过GRIA1-GRIA4基因编码的亚基的同源或异源组装形成,其中只有GRIA3是X染色体。在神经发育障碍(NDD)患者中报告了越来越多的GRIA3错义变异,但只有少数被功能性检查过。这里,我们评估了通过电生理试验在NDD患者中发现的1种移码和43种罕见错义GRIA3变异体对AMPAR功能的影响.31种变体改变受体功能并显示功能丧失(LoF)或功能获得(GoF)特性,而13则表现为中性。我们从25例患者(来自23个家庭)收集了17例这些变异的详细临床数据。所有患者都有整体发育障碍,大多数是中度(9/25)或重度(12/25)。十二个病人有癫痫发作,包括聚焦运动(6/12),未知发作运动(4/12),局灶性意识受损(1/12),(非典型)缺席(2/12),肌阵挛性(5/12),和全身性强直阵挛性(1/12)或无张力(1/12)癫痫发作。癫痫综合征分为发育性脑病和癫痫性脑病8例,13例没有癫痫发作的发展性脑病,4例癫痫患者的智力障碍。13/25报告了肢体肌肉张力减退,10/25报告了高张力。在14/25报告了运动障碍,其中神经早发或非癫痫性不稳定肌阵挛症是最普遍的特征(8/25)。将受体功能表型与临床特征相关联揭示了GRIA3相关NDD的临床特征以及LoF和GoF变体的不同NDD表型。GoF变异与更严重的结局相关:患者在癫痫发作时(中位年龄1个月)更年轻,高渗,更常见的是运动障碍,包括早退.LoF变异的患者在癫痫发作时年龄较大(中位年龄16个月),低渗,并且有睡眠障碍。LoF和GoF变体在两种性别中都是致病的,但受影响的男性通常携带从健康母亲遗传的从头或半合子的LoF变体,而除一名受影响的雌性外,所有雌性都具有从头杂合GoF变体。
    AMPA (α-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) receptors (AMPARs) mediate fast excitatory neurotransmission in the brain. AMPARs form by homo- or heteromeric assembly of subunits encoded by the GRIA1-GRIA4 genes, of which only GRIA3 is X-chromosomal. Increasing numbers of GRIA3 missense variants are reported in patients with neurodevelopmental disorders (NDD), but only a few have been examined functionally. Here, we evaluated the impact on AMPAR function of one frameshift and 43 rare missense GRIA3 variants identified in patients with NDD by electrophysiological assays. Thirty-one variants alter receptor function and show loss-of-function or gain-of-function properties, whereas 13 appeared neutral. We collected detailed clinical data from 25 patients (from 23 families) harbouring 17 of these variants. All patients had global developmental impairment, mostly moderate (9/25) or severe (12/25). Twelve patients had seizures, including focal motor (6/12), unknown onset motor (4/12), focal impaired awareness (1/12), (atypical) absence (2/12), myoclonic (5/12) and generalized tonic-clonic (1/12) or atonic (1/12) seizures. The epilepsy syndrome was classified as developmental and epileptic encephalopathy in eight patients, developmental encephalopathy without seizures in 13 patients, and intellectual disability with epilepsy in four patients. Limb muscular hypotonia was reported in 13/25, and hypertonia in 10/25. Movement disorders were reported in 14/25, with hyperekplexia or non-epileptic erratic myoclonus being the most prevalent feature (8/25). Correlating receptor functional phenotype with clinical features revealed clinical features for GRIA3-associated NDDs and distinct NDD phenotypes for loss-of-function and gain-of-function variants. Gain-of-function variants were associated with more severe outcomes: patients were younger at the time of seizure onset (median age: 1 month), hypertonic and more often had movement disorders, including hyperekplexia. Patients with loss-of-function variants were older at the time of seizure onset (median age: 16 months), hypotonic and had sleeping disturbances. Loss-of-function and gain-of-function variants were disease-causing in both sexes but affected males often carried de novo or hemizygous loss-of-function variants inherited from healthy mothers, whereas affected females had mostly de novo heterozygous gain-of-function variants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    烟雾病,导致儿童和年轻人中风的脑血管疾病,其特征在于颈内动脉远端的进行性闭塞和侧支血管的形成。改变的基因在烟雾病的病因中起着重要作用,但是在大多数情况下没有发现致病基因。分析了来自84个未解决家庭的151个个体的外显子组测序数据,以确定烟雾病的进一步基因。然后在其他病例中评估候选基因(150个先证者)。两个家族在ANO1中具有相同的罕见变异,编码钙激活的氯化物通道,anocamin-1.单倍型分析发现这些家庭是相关的,和ANO1p.Met658Val在家庭中与烟雾病分离,LOD评分为3.3。在烟雾病家族中发现了另外六个ANO1罕见变体。使用膜片钳记录评估ANO1罕见变异,和大多数变体,包括ANO1p.Met658Val,显示对细胞内Ca2+的敏感性增加。具有这些功能获得ANO1变体的患者具有MMD的经典特征,但也有动脉瘤,狭窄,和/或后循环闭塞。我们的研究支持ANO1功能获得致病变异易患烟雾病,并且与后循环的独特参与有关。
    Moyamoya disease, a cerebrovascular disease leading to strokes in children and young adults, is characterized by progressive occlusion of the distal internal carotid arteries and the formation of collateral vessels. Altered genes play a prominent role in the aetiology of moyamoya disease, but a causative gene is not identified in the majority of cases. Exome sequencing data from 151 individuals from 84 unsolved families were analysed to identify further genes for moyamoya disease, then candidate genes assessed in additional cases (150 probands). Two families had the same rare variant in ANO1, which encodes a calcium-activated chloride channel, anoctamin-1. Haplotype analyses found the families were related, and ANO1 p.Met658Val segregated with moyamoya disease in the family with an LOD score of 3.3. Six additional ANO1 rare variants were identified in moyamoya disease families. The ANO1 rare variants were assessed using patch-clamp recordings, and the majority of variants, including ANO1 p.Met658Val, displayed increased sensitivity to intracellular Ca2+. Patients harbouring these gain-of-function ANO1 variants had classic features of moyamoya disease, but also had aneurysm, stenosis and/or occlusion in the posterior circulation. Our studies support that ANO1 gain-of-function pathogenic variants predispose to moyamoya disease and are associated with unique involvement of the posterior circulation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Case Reports
    最近有报道称,双等位基因TENM3变体会导致非综合征性小眼病伴9型(MCOPCB9)和小眼病和/或发育迟缓型小眼病(MCOPS15)。迄今为止,仅报道了8例隐性TENM3变异的综合征型和非综合征型小眼症病例.在这里,我们报告了两例无关的新病例,在TENM3中具有双等位基因变异,拓宽了分子和临床范围。关于患者1,WES揭示了TENM3基因中的复合杂合变体:c.3847_3855del;p.Leu1283_Ser1285del和c.3698_3699insA;索引患者中的p.Thr1233Thrfs*20,出现双侧小眼症的人,先天性白内障,小头畸形,和全球发展延迟。关于患者2,确定了TENM3基因中的复合错义杂合变体:c.941C>T;p.Ala314Val和c.6464T>C;p.Leu2155Pro在3岁男孩中,患有先天性内斜视的人,说话延迟,和运动发育迟缓。这两个病例的临床特征显示与先前报道的病例高度一致,包括小眼症和发育迟缓。我们患者中小头畸形的存在可能会扩大与TENM3功能变异丧失相关的神经系统表型,因为以前没有描述小头畸形。此外,我们提供证据表明,TENM3中的错义变异与相似,但温和一点,眼部特征。
    Biallelic TENM3 variants were recently reported to cause non-syndromic microphthalmia with coloboma-9 (MCOPCB9) and microphthalmia and/or coloboma with developmental delay (MCOPS15). To date, only eight syndromic and non-syndromic microphthalmia cases with recessive TENM3 variants have been reported. Herein, we report two unrelated new cases with biallelic variants in TENM3, widening the molecular and clinical spectrum. Regarding patient 1, WES revealed compound heterozygous variants in the TENM3 gene: c.3847_3855del; p.Leu1283_Ser1285del and c.3698_3699insA; p.Thr1233Thrfs*20 in the index patient, who was presenting with bilateral microphthalmia, congenital cataract, microcephaly, and global developmental delay. Regarding patient 2, compound missense heterozygous variants in the TENM3 gene were identified: c.941C > T; p.Ala314Val and c.6464T > C; p.Leu2155Pro in the 3-year-old boy, who presented with congenital esotropia, speech delay, and motor developmental delay. The clinical features of these two cases revealed high concordance with the previously reported cases, including microphthalmia and developmental delay. The presence of microcephaly in our patient potentially expands the neurologic phenotype associated with loss of function variants in TENM3, as microcephaly has not previously been described. Furthermore, we present evidence that missense variants in TENM3 are associated with similar, but milder, ocular features.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Editorial
    暂无摘要。
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    动机:当前生物学的中心目标是在基因型和表型之间建立完整的功能联系,被称为所谓的基因型-表型图谱。随着高通量技术的不断发展和测序成本的下降,多组学分析已得到越来越广泛的应用。虽然这给了我们新的机会来揭示单核苷酸多态性(SNP)之间的相关机制,基因,和表型,多元组学仍然面临某些挑战,具体来说:1)当样本量足够大时,组学类型的数量往往不足以满足多组组学分析的要求;2)每个组学的内部相关性往往不清楚,如基因组学中基因之间的相关性;3)分析大量性状时(p),样本量(n)通常小于p,n< Motivation: A central goal of current biology is to establish a complete functional link between the genotype and phenotype, known as the so-called genotype-phenotype map. With the continuous development of high-throughput technology and the decline in sequencing costs, multi-omics analysis has become more widely employed. While this gives us new opportunities to uncover the correlation mechanisms between single-nucleotide polymorphism (SNP), genes, and phenotypes, multi-omics still faces certain challenges, specifically: 1) When the sample size is large enough, the number of omics types is often not large enough to meet the requirements of multi-omics analysis; 2) each omics\' internal correlations are often unclear, such as the correlation between genes in genomics; 3) when analyzing a large number of traits (p), the sample size (n) is often smaller than p, n << p, hindering the application of machine learning methods in the classification of disease outcomes. Results: To solve these issues with multi-omics and build a robust classification model, we propose a graph-embedded deep neural network (G-EDNN) based on expression quantitative trait loci (eQTL) data, which achieves sparse connectivity between network layers to prevent overfitting. The correlation within each omics is also considered such that the model more closely resembles biological reality. To verify the capabilities of this method, we conducted experimental analysis using the GSE28127 and GSE95496 data sets from the Gene Expression Omnibus (GEO) database, tested various neural network architectures, and used prior data for feature selection and graph embedding. Results show that the proposed method could achieve a high classification accuracy and easy-to-interpret feature selection. This method represents an extended application of genotype-phenotype association analysis in deep learning networks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号