RNA secondary structure

RNA 二级结构
  • 文章类型: Journal Article
    Gapmer反义寡核苷酸(ASO)具有等位基因特异性沉默的治疗前景,但在区分突变体和野生型转录物方面面临挑战。这项研究探索了新的设计策略,以提高ASO的特异性,重点研究了与Ullrich先天性肌营养不良相关的COL6A3基因中常见的显性突变。初始gapmerASO设计显示出高效率,但对突变等位基因的特异性较差。然后我们采用了混合器设计,基于突变体和野生型等位基因的二级结构的计算预测,整合额外的RNA碱基,旨在增强ASO对突变转录本的可及性。mixmerASO设计证明与经典gapmer设计相比特异性增加高达3倍。进一步的改进涉及引入核苷酸错配作为结构修饰,与gapmer设计相比,特异性提高了10倍,与mixmer设计相比提高了3倍。此外,我们首次确定了RNA诱导沉默复合物(RISC)的潜在作用,与RNaseH1一起,在gapmer介导的沉默中,与MixmerASO观察到的情况相反,其中仅涉及RNaseH1。总之,这项研究提出了利用mRNA二级结构和核苷酸错配的等位基因特异性ASO的新设计概念,并提示RISC可能参与gapmer介导的沉默.
    Gapmer antisense oligonucleotides (ASOs) hold therapeutic promise for allele-specific silencing, but face challenges in distinguishing between mutant and wild-type transcripts. This study explores new design strategies to enhance ASO specificity, focusing on a common dominant mutation in COL6A3 gene associated with Ullrich congenital muscular dystrophy. Initial gapmer ASO design exhibited high efficiency but poor specificity for the mutant allele. We then adopted a mixmer design, incorporating additional RNA bases based on computational predictions of secondary structures for both mutant and wild-type alleles, aiming to enhance ASO accessibility to mutant transcripts. The mixmer ASO design demonstrated up to a 3-fold increase in specificity compared with the classical gapmer design. Further refinement involved introducing a nucleotide mismatch as a structural modification, resulting in a 10-fold enhancement in specificity compared with the gapmer design and a 3-fold over the mixmer design. Additionally, we identified for the first time a potential role of the RNA-induced silencing complex (RISC), alongside RNase H1, in gapmer-mediated silencing, in contrast with what was observed with mixmer ASOs, where only RNase H1 was involved. In conclusion, this study presents a novel design concept for allele-specific ASOs leveraging mRNA secondary structures and nucleotide mismatching and suggests a potential involvement of RISC in gapmer-mediated silencing.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    甲型肝炎病毒(HAV),肝病毒属(PicornaviridaeHepV)的成员,仍然是一种重要的病毒病原体,经常在全球范围内引起经肠传播的肝炎。在这项研究中,我们对云南省野生小型陆生哺乳动物携带的HepV进行了流行病学调查,中国。利用HepV特异性广谱RT-PCR,下一代测序(NGS),和QNome纳米孔测序(QNS)技术,我们鉴定并表征了两种暂时命名为EpMa-HAV和EpLe-HAV的新型HepV,发现于长尾山sh(Episoriculusmacrurus)和长尾棕齿sh(Episoriculusleucops)中,分别。我们对EpMa-HAV和EpLe-HAV的序列和系统发育分析表明它们属于I型肝病毒(HepV-I)进化枝II,也被称为中国泼妇HepV进化枝。值得注意的是,新型HepV的密码子使用偏倚模式与先前鉴定的中国HepV一致。此外,我们的结构分析表明,与其他哺乳动物HepVs的RNA二级结构不同,并且在关键蛋白位点表现出差异.总的来说,在the中发现了两个新的HepV,扩大了HepV的宿主范围,并强调了HepV属中人类HAV的遗传多样性动物同源物的存在。
    Hepatitis A virus (HAV), a member of the genus Hepatovirus (Picornaviridae HepV), remains a significant viral pathogen, frequently causing enterically transmitted hepatitis worldwide. In this study, we conducted an epidemiological survey of HepVs carried by small terrestrial mammals in the wild in Yunnan Province, China. Utilizing HepV-specific broad-spectrum RT-PCR, next-generation sequencing (NGS), and QNome nanopore sequencing (QNS) techniques, we identified and characterized two novel HepVs provisionally named EpMa-HAV and EpLe-HAV, discovered in the long-tailed mountain shrew (Episoriculus macrurus) and long-tailed brown-toothed shrew (Episoriculus leucops), respectively. Our sequence and phylogenetic analyses of EpMa-HAV and EpLe-HAV indicated that they belong to the species Hepatovirus I (HepV-I) clade II, also known as the Chinese shrew HepV clade. Notably, the codon usage bias pattern of novel shrew HepVs is consistent with that of previously identified Chinese shrew HepV. Furthermore, our structural analysis demonstrated that shrew HepVs differ from other mammalian HepVs in RNA secondary structure and exhibit variances in key protein sites. Overall, the discovery of two novel HepVs in shrews expands the host range of HepV and underscores the existence of genetically diverse animal homologs of human HAV within the genus HepV.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:编码和非编码RNA分子参与许多重要的生物学过程。非编码RNA折叠成明确定义的二级结构以发挥其功能。然而,从原始RNA序列对二级结构的计算预测是一个长期未解决的问题,在经历了几十年几乎不变的性能之后,由于深度学习,现在又重新出现了。传统的RNA二级结构预测算法大多基于热力学模型和动态规划来实现自由能最小化。最近,与经典的方法相比,深度学习方法已经显示出竞争力。但仍有很大的改善空间。
    结果:在这项工作中,一种端到端的深度学习方法,仅使用RNA序列作为输入来预测核苷酸接触矩阵。该模型基于1D和2D残差神经网络,可以学习短期和长期的交互模式。我们证明了结构可以用最少的物理假设来准确预测。在几个基准数据集上进行了广泛的实验,考虑序列同源性和交叉家族验证。将其与经典方法和最近的深度学习模型进行了比较,表明它可以胜过最先进的方法。
    BACKGROUND: Coding and noncoding RNA molecules participate in many important biological processes. Noncoding RNAs fold into well-defined secondary structures to exert their functions. However, the computational prediction of the secondary structure from a raw RNA sequence is a long-standing unsolved problem, which after decades of almost unchanged performance has now re-emerged due to deep learning. Traditional RNA secondary structure prediction algorithms have been mostly based on thermodynamic models and dynamic programming for free energy minimization. More recently deep learning methods have shown competitive performance compared with the classical ones, but there is still a wide margin for improvement.
    RESULTS: In this work we present sincFold, an end-to-end deep learning approach, that predicts the nucleotides contact matrix using only the RNA sequence as input. The model is based on 1D and 2D residual neural networks that can learn short- and long-range interaction patterns. We show that structures can be accurately predicted with minimal physical assumptions. Extensive experiments were conducted on several benchmark datasets, considering sequence homology and cross-family validation. sincFold was compared with classical methods and recent deep learning models, showing that it can outperform the state-of-the-art methods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    N6-甲基腺苷(m6A)甲基化是一种内部转录后修饰,与病毒增殖和致病性有关。为了阐明禽白血病病毒J亚群(ALV-J)中潜在的5'-DRACH-3'基序的保护模式,149株ALV-J菌株(来自中国的139株;来自英国的ALV-J原型HPRS-103;来自美国的9株,俄罗斯,印度,和巴基斯坦)在2023年12月之前在GenBank中可用。根据SRAMPWeb服务器的预测结果,这些ALV-J基因组含有潜在的戏剧图案,总数从43到64,这不是根据隔离区和时间确定的。保守分析表明,37个基序表现出>80%的保守性,包括17个等级高于“高置信度”的图案。尽管这些基序分布在LTR的U5区和主要编码区,它们富集在p27,p68,p32和gp85的编码区。ALV-J基因组中DRACH基序最常见的m6A基序序列是GGACU。SRAMP和RNA结构网络服务器预测的每个保守基序的RNA二级结构主要是基于核心腺苷的两种类型-A-U对(21/37)和发夹环(16/37)。考虑到本研究中进行的系统比较分析,有必要进行未来的彻底生化研究,以确定m6A修饰在ALV-J复制和感染过程中的作用。ALV-J中潜在m6A位点的DRACH基序的这些保存和分布分析将为ALV-J感染和m6A修饰的未来干预提供基础。
    N6-methyladenosine (m6A) methylation is an internal post-transcriptional modification that has been linked to viral multiplication and pathogenicity. To elucidate the conservation patterns of potential 5\'-DRACH-3\' motifs in avian leukosis virus subgroup J (ALV-J), 149 ALV-J strains (139 isolates from China; ALV-J prototype HPRS-103 from the UK; and 9 strains from the USA, Russia, India, and Pakistan) available in GenBank before December 2023 were retrieved. According to the prediction results of the SRAMP web-server, these ALV-J genomes contained potential DRACH motifs, with the total number ranging from 43 to 64, which were not determined based on the isolation region and time. Conservative analysis suggested that 37 motifs exhibited a conservation of >80%, including 17 motifs with a grading above \"high confidence.\" Although these motifs were distributed in the U5 region of LTRs and major coding regions, they were enriched in the coding regions of p27, p68, p32, and gp85. The most common m6A-motif sequence of the DRACH motif in the ALV-J genome was GGACU. The RNA secondary structure of each conserved motif predicted by SRAMP and RNAstructure web-server was mainly of two types-A-U pair (21/37) and hairpin loop (16/37)-based on the core adenosine. Considering the systematic comparative analysis performed in this study, future thorough biochemical research is warranted to determine the role of m6A modification during the replication and infection of ALV-J. These conservation and distribution analysis of the DRACH motif for potential m6A sites in ALV-J would provide a foundation for the future intervention of ALV-J infection and m6A modification.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    不同的细菌种类有明显不同的世代时间,从大肠杆菌中20-30分钟到麻风分枝杆菌中大约两周。细胞中的翻译机制需要在每一代中合成新细胞的所有蛋白质。翻译的三个子过程,即,initiation,伸长率,和终止,与长代麻风分枝杆菌相比,短代细菌(SGB)(例如纳氏弧菌)有望在更强的选择压力下进行优化。起始效率取决于起始tRNA解码的起始密码子,由小亚基rRNA上的抗SD(aSD)序列解码的最佳Shine-Dalgarno(SD),以及可以嵌入启动信号并防止它们被解码的二级结构。延伸效率取决于tRNA库和密码子使用。在细菌中的终止效率主要取决于终止密码子的性质和紧邻终止密码子下游的核苷酸。通过将SGB与长代细菌(LGB)进行对比,我们预测(1)SGB有更多的核糖体RNA操纵子来产生核糖体,和更多的tRNA基因携带氨基酸到核糖体,(2)SGB使用AUG作为起始密码子和UAA作为终止密码子的基因百分比高于LGB,(3)SGB表现出比LGB更好的密码子和反密码子适应,和(4)SGB在翻译起始信号附近具有比LGB更弱的二级结构。SGB和LGB之间的这些差异在高表达基因中应该比其余基因更明显。我们提供了支持这些预测的经验证据。
    Different bacterial species have dramatically different generation times, from 20-30 min in Escherichia coli to about two weeks in Mycobacterium leprae. The translation machinery in a cell needs to synthesize all proteins for a new cell in each generation. The three subprocesses of translation, i.e., initiation, elongation, and termination, are expected to be under stronger selection pressure to optimize in short-generation bacteria (SGB) such as Vibrio natriegens than in the long-generation Mycobacterium leprae. The initiation efficiency depends on the start codon decoded by the initiation tRNA, the optimal Shine-Dalgarno (SD) decoded by the anti-SD (aSD) sequence on small subunit rRNA, and the secondary structure that may embed the initiation signals and prevent them from being decoded. The elongation efficiency depends on the tRNA pool and codon usage. The termination efficiency in bacteria depends mainly on the nature of the stop codon and the nucleotide immediately downstream of the stop codon. By contrasting SGB with long-generation bacteria (LGB), we predict (1) SGB to have more ribosome RNA operons to produce ribosomes, and more tRNA genes for carrying amino acids to ribosomes, (2) SGB to have a higher percentage of genes using AUG as the start codon and UAA as the stop codon than LGB, (3) SGB to exhibit better codon and anticodon adaptation than LGB, and (4) SGB to have a weaker secondary structure near the translation initiation signals than LGB. These differences between SGB and LGB should be more pronounced in highly expressed genes than the rest of the genes. We present empirical evidence in support of these predictions.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Preprint
    RNA二级结构的预测对于理解其基本原理和在不同领域的应用至关重要。包括分子诊断和基于RNA的治疗策略。然而,搜索空间的复杂性提出了挑战。这项工作提出了用于预测RNA二级结构的图卷积网络(GCNfold)。GCNfold将RNA序列视为图结构数据,并在给定先前碱基配对概率的情况下预测后验碱基配对概率,使用McCaskill的分区函数计算。GCNfold的性能超过了最先进的折叠算法,由于我们将最小自由能信息集成到丰富的参数化网络中,增强其预测非同源RNA二级结构的稳健性。对称Argmax后处理算法确保GCNfold形成有效结构。为了验证我们的算法,我们将其应用于SARS-CoV-2E基因,并确定了跨Betacoronavirus亚属的E基因的二级结构。
    The prediction of RNA secondary structures is essential for understanding its underlying principles and applications in diverse fields, including molecular diagnostics and RNA-based therapeutic strategies. However, the complexity of the search space presents a challenge. This work proposes a Graph Convolutional Network (GCNfold) for predicting the RNA secondary structure. GCNfold considers an RNA sequence as graph-structured data and predicts posterior base-pairing probabilities given the prior base-pairing probabilities, calculated using McCaskill\'s partition function. The performance of GCNfold surpasses that of the state-of-the-art folding algorithms, as we have incorporated minimum free energy information into the richly parameterized network, enhancing its robustness in predicting non-homologous RNA secondary structures. A Symmetric Argmax Post-processing algorithm ensures that GCNfold formulates valid structures. To validate our algorithm, we applied it to the SARS-CoV-2 E gene and determined the secondary structure of the E-gene across the Betacoronavirus subgenera.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    暂无摘要。
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:从等位基因特异性角度揭示功能性遗传变异对于促进我们对基因调控和遗传疾病的理解至关重要。最近,各种等位基因特异性事件,如等位基因特异性基因表达,等位基因特异性甲基化,和等位基因特异性结合,由于高通量测序方法的发展,已经在全基因组范围内进行了探索。RNA二级结构,在RNA修饰等多个RNA相关过程中起着至关重要的作用,翻译和拼接,已成为相关研究的重要焦点。然而,鉴定与等位基因特异性RNA二级结构相关的遗传变异的工具仍然缺乏。
    结果:这里,我们开发了一种称为“AStruct”的计算工具,使我们能够从基于RT-stop的结构探测数据中检测等位基因特异性RNA二级结构(ASRS)。AStruct在模拟数据集和公共icSHAPE数据集中均显示出强大的性能。我们发现,具有较高AStruct评分的单核苷酸多态性(SNP)在编码区富集并倾向于功能性。这些SNP是高度保守的,有可能破坏参与m6A修饰或蛋白质结合的位点,并且经常与疾病相关。
    结论:AStruct是一种工具,专门用于在基于RT-stop的结构探测数据中的杂合SNP处调用等位基因特异性RNA二级结构事件。它利用等位基因变体,不同小区条件下的碱基配对和RT-stop信息,以检测动态和功能性ASRS。与基于序列的工具相比,AStruct考虑了动态细胞条件,并在检测功能变体方面表现出色。AStruct以JAVA实现,可在以下网址免费访问:https://github.com/canceromics/AStruct。
    BACKGROUND: Uncovering functional genetic variants from an allele-specific perspective is of paramount importance in advancing our understanding of gene regulation and genetic diseases. Recently, various allele-specific events, such as allele-specific gene expression, allele-specific methylation, and allele-specific binding, have been explored on a genome-wide scale due to the development of high-throughput sequencing methods. RNA secondary structure, which plays a crucial role in multiple RNA-associated processes like RNA modification, translation and splicing, has emerged as an essential focus of relevant research. However, tools to identify genetic variants associated with allele-specific RNA secondary structures are still lacking.
    RESULTS: Here, we develop a computational tool called \'AStruct\' that enables us to detect allele-specific RNA secondary structure (ASRS) from RT-stop based structuromic probing data. AStruct shows robust performance in both simulated datasets and public icSHAPE datasets. We reveal that single nucleotide polymorphisms (SNPs) with higher AStruct scores are enriched in coding regions and tend to be functional. These SNPs are highly conservative, have the potential to disrupt sites involved in m6A modification or protein binding, and are frequently associated with disease.
    CONCLUSIONS: AStruct is a tool dedicated to invoke allele-specific RNA secondary structure events at heterozygous SNPs in RT-stop based structuromic probing data. It utilizes allelic variants, base pairing and RT-stop information under different cell conditions to detect dynamic and functional ASRS. Compared to sequence-based tools, AStruct considers dynamic cell conditions and outperforms in detecting functional variants. AStruct is implemented in JAVA and is freely accessible at: https://github.com/canceromics/AStruct .
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    变异效应的多重分析是分析罕见变异对基因表达和生物体适应性的影响的有力方法。然而,很少有研究整合了几个多重分析来绘制编码序列中变异对基因表达的影响。这里,我们开创了一种基于多聚体谱分析的多重检测方法,以测量尺度上对翻译的变异效应,发现增加或减少核糖体负载的单核苷酸变体。通过将高通量核糖体负荷数据与多重mRNA和蛋白质丰度读数相结合,我们绘制了从RNA到蛋白质的数千种儿茶酚-O-甲基转移酶(COMT)变体的顺式调控图,并发现了许多改变COMT表达的编码变体。最后,我们训练了机器学习模型以映射COMT基因表达的变异效应特征,并揭示了跨表达层的方向性和发散性影响。我们的分析揭示了COMT中数千个变体的表达表型,并强调了变体对单层和多层表达的影响。我们的发现促使未来的研究整合了几种用于基因表达读出的多重测定。
    Multiplexed assays of variant effect are powerful methods to profile the consequences of rare variants on gene expression and organismal fitness. Yet, few studies have integrated several multiplexed assays to map variant effects on gene expression in coding sequences. Here, we pioneered a multiplexed assay based on polysome profiling to measure variant effects on translation at scale, uncovering single-nucleotide variants that increase or decrease ribosome load. By combining high-throughput ribosome load data with multiplexed mRNA and protein abundance readouts, we mapped the cis-regulatory landscape of thousands of catechol-O-methyltransferase (COMT) variants from RNA to protein and found numerous coding variants that alter COMT expression. Finally, we trained machine learning models to map signatures of variant effects on COMT gene expression and uncovered both directional and divergent impacts across expression layers. Our analyses reveal expression phenotypes for thousands of variants in COMT and highlight variant effects on both single and multiple layers of expression. Our findings prompt future studies that integrate several multiplexed assays for the readout of gene expression.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Letter
    背景:ATM激酶构成了DNA损伤的主要调控中心,并通过磷酸化MDM2蛋白激活p53反应途径,对p53mRNA二级结构产生亲和力。这种相互作用的破坏阻止了新生p53的激活。MDM2蛋白-p53mRNA与上游DNA损伤传感器ATM激酶相互作用的联系以及p53mRNA在DNA损伤感知机制中的作用,仍然备受期待。
    方法:邻近连接测定(PLA)已被广泛用于揭示蛋白质-mRNA和蛋白质-蛋白质相互作用的亚细胞定位。ELISA和免疫共沉淀证实了体外和细胞中的相互作用。
    结果:这项研究提供了一种新的机制,使p53mRNA与ATM激酶相互作用,并表明L22L同义突变体,已知改变p53mRNA的二级结构,阻止互动。在DNA损伤感知途径中的相关机制作用,这与下游的DNA损伤反应有关,正在探索。DNA损伤(双链DNA断裂激活ATM)后,激活的MDMX蛋白竞争ATM-p53mRNA相互作用,并阻止p53mRNA与NBS1(MRN复合物)的关联。这些数据还揭示了ATM上的结合结构域和磷酸化事件,其调节复合物到细胞质的相互作用和运输。
    结论:提出的模型显示了ATM与p53mRNA的新型相互作用,并描述了DNA损伤传感与下游p53激活途径之间的联系;支持改变二级mRNA结构的同义突变的功能含义上升。
    The ATM kinase constitutes a master regulatory hub of DNA damage and activates the p53 response pathway by phosphorylating the MDM2 protein, which develops an affinity for the p53 mRNA secondary structure. Disruption of this interaction prevents the activation of the nascent p53. The link of the MDM2 protein-p53 mRNA interaction with the upstream DNA damage sensor ATM kinase and the role of the p53 mRNA in the DNA damage sensing mechanism, are still highly anticipated.
    The proximity ligation assay (PLA) has been extensively used to reveal the sub-cellular localisation of the protein-mRNA and protein-protein interactions. ELISA and co-immunoprecipitation confirmed the interactions in vitro and in cells.
    This study provides a novel mechanism whereby the p53 mRNA interacts with the ATM kinase enzyme and shows that the L22L synonymous mutant, known to alter the secondary structure of the p53 mRNA, prevents the interaction. The relevant mechanistic roles in the DNA Damage Sensing pathway, which is linked to downstream DNA damage response, are explored. Following DNA damage (double-stranded DNA breaks activating ATM), activated MDMX protein competes the ATM-p53 mRNA interaction and prevents the association of the p53 mRNA with NBS1 (MRN complex). These data also reveal the binding domains and the phosphorylation events on ATM that regulate the interaction and the trafficking of the complex to the cytoplasm.
    The presented model shows a novel interaction of ATM with the p53 mRNA and describes the link between DNA Damage Sensing with the downstream p53 activation pathways; supporting the rising functional implications of synonymous mutations altering secondary mRNA structures.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号