GC-rich

富含 GC
  • 文章类型: Journal Article
    随着生物合成基因簇(BGC)的快速自动化计算机识别的出现,基因组学为加速天然产物(NP)的发现提供了巨大的机会。然而,多产的NP生产者,链霉菌,特别富含GC(>80%)并且在BGC内高度重复。这些在测序和高质量基因组组装中提出了挑战,目前通过密集测序得以规避。这里,我们概述了使用多重Illumina和OxfordNanopore测序以及混合长短阅读组装算法生成高质量基因组的更具成本效益的工作流程。我们的方案涉及对长读取衍生的组件进行长达4轮的短读取抛光,以产生准确的BGC预测。我们成功地测序并组装了8个富含GC的链霉菌基因组,其长度范围为7.1至12.1Mb,N50中位数为8.2Mb。分类学分析揭示了这些菌株以前的错误陈述,并允许我们提出一个潜在的新物种,Sydneybrenneri链霉菌。进一步全面表征其生物合成,泛基因组和抗生素抗性特征,尤其是源自I型聚酮合成酶(PKS)BGC的分子,反映了它们作为替代NP宿主的潜力。因此,这里提出的基因组组装和见解被设想为科学界扩大NP发现途径的门户。
    With the advent of rapid automated in silico identification of biosynthetic gene clusters (BGCs), genomics presents vast opportunities to accelerate natural product (NP) discovery. However, prolific NP producers, Streptomyces, are exceptionally GC-rich (>80%) and highly repetitive within BGCs. These pose challenges in sequencing and high-quality genome assembly which are currently circumvented via intensive sequencing. Here, we outline a more cost-effective workflow using multiplex Illumina and Oxford Nanopore sequencing with hybrid long-short read assembly algorithms to generate high quality genomes. Our protocol involves subjecting long read-derived assemblies to up to 4 rounds of polishing with short reads to yield accurate BGC predictions. We successfully sequenced and assembled 8 GC-rich Streptomyces genomes whose lengths range from 7.1 to 12.1 Mb with a median N50 of 8.2 Mb. Taxonomic analysis revealed previous misrepresentation among these strains and allowed us to propose a potentially new species, Streptomyces sydneybrenneri. Further comprehensive characterization of their biosynthetic, pan-genomic and antibiotic resistance features especially for molecules derived from type I polyketide synthase (PKS) BGCs reflected their potential as alternative NP hosts. Thus, the genome assemblies and insights presented here are envisioned to serve as gateway for the scientific community to expand their avenues in NP discovery.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:手足生殖器综合征(HFGS)是一种常染色体显性遗传疾病,其特征是具有广泛的表型谱。HOXA13基因变异与HFGS相关。迄今为止,据报道,只有20个家庭患有HFGS。然而,HFGS的挑战是有限的样本量和表型异质性.下一代测序的出现允许鉴定HOXA13变异的患者,这些患者没有表现出完整的HFGS综合征特征。
    方法:本研究进行了Trio(父母先证者)全外显子组测序(WES)和全基因组测序(WGS),以研究具有多种临床异常的新生儿的潜在致病遗传因素。
    结果:Trio-WES未检测到可能的致病变异,和HOXA13中的重复变体(c.360_377dup,p.Ala128_Ala133dup),从她母亲那里继承下来,随后的WGS在先证者中发现营养不良,喂养困难,电解质紊乱,代谢性酸中毒,复发性尿路感染,肾积水,肾结石,输尿管形态异常,胆石症,子宫didelphys。变异区(外显子1)的序列分析表明73.92%的高GC含量。此外,对家族史的进一步调查显示,该家族4代中有5名成员有手脚异常。
    结论:通过基因分析,新生儿被诊断为HFGS。与WES分析相比,GC含量对WGS中序列覆盖率的影响较小。这是用于HFGS基因诊断的Trio-WGS研究的第一份报告,揭示了随后的WGS对于鉴定无法解释的遗传疾病中的潜在致病变异是必要的。
    BACKGROUND: Hand-Foot-Genital Syndrome (HFGS) is an autosomal dominant disorder characterized by a broad phenotypic spectrum. Variants in HOXA13 gene were associated with HFGS. To date, only twenty families with HFGS have been reported. However, the challenge in HFGS is the limited sample sizes and phenotypic heterogeneity. The advent of next-generation sequencing has permitted the identification of patients with HOXA13 variants who do not manifest with the full HFGS syndromic features.
    METHODS: Trio (parents-proband) Whole-exome sequence(WES) and whole-genome sequencing(WGS) was carried out in this study to investigate the underlying pathogenic genetic factor of the neonate with a wide variety of clinical abnormalities.
    RESULTS: No possible pathogenetic variation was detected by trio-WES, and a duplication variant in HOXA13 (c.360_377dup, p.Ala128_Ala133dup), inherited from her mother, was identified by the subsequent WGS in the proband with malnutrition, feeding difficulties, electrolyte disorders, metabolic acidosis, recurrent urinary tract infections, hydronephrosis, nephrolithiasis, abnormal ureter morphology, cholelithiasis, uterus didelphys. Sequence analysis of the variant region (exon1) indicated a high GC content of 73.92%. In addition, further enquiry of the family history revealed that 5 members of the family in 4 generations had hand and foot anomalies.
    CONCLUSIONS: The neonate was diagnosed with HFGS by genetic analysis. GC content had less influence on sequence coverage in WGS than WES analysis. This was the first report of trio-WGS study for HFGS genetic diagnosis, revealed that subsequent WGS was necessary for identification of potentially pathogenic variants in unexplained genetic disorders.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    短发夹RNA,或短发夹RNA(shRNA),是基因敲低的成熟工具,也是抑制疾病相关基因的有前途的治疗方法。由于shRNA发夹序列和结构的复杂性,shRNA表达载体的高效制备有时会成为瓶颈,特别是对于重复或高GC含量的目标。这里,我们提出了改进的shRNA克隆和验证方法,这些方法能够高效快速地克隆针对疾病相关重复扩增的几种shRNA,包括GGGGCC,CAG,CTG,CCTG,和CGG进入修饰的pLKO.1载体。改进包括shRNA插入物设计和制备,基于重组的克隆,和基于测序的验证,包括Sanger和纳米孔长读数测序。这种改进的方法应该能够实用,几乎任何shRNA序列的有效克隆。
    Short hairpin RNAs, or short hairpin RNAs (shRNAs), are a proven tool for gene knockdown and a promising therapeutic approach for suppression of disease-associated genes. The efficient preparation of shRNA-expressing vectors can sometimes become a bottleneck due to the complexity of shRNA hairpin sequence and structure, especially for repetitive or high GC-content targets. Here, we present improved shRNA cloning and validation methods that enabled efficient and rapid cloning of several shRNAs targeting disease-associated repeat expansions, including GGGGCC, CAG, CTG, CCTG, and CGG into modified pLKO.1 vectors. Improvements included shRNA insert design and preparation, recombination-based cloning, and sequencing-based validation that included Sanger and nanopore long-read sequencing. This improved method should enable practical, efficient cloning of nearly any shRNA sequence.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    The nonsense-mediated mRNA decay (NMD) pathway rapidly detects and degrades mRNA containing premature termination codons (PTCs). UP-frameshift 1 (UPF1), the master regulator of the NMD process, has two alternatively-spliced isoforms; one carries 353-GNEDLVIIWLR-363 insertion in the \'regulatory loop (involved in mRNA binding)\'. Such insertion can induce catalytic and/or ATPase activity, as determined experimentally; however, the kinetics and molecular level information are not fully understood. Herein, applying all-atom molecular dynamics, we probe the binding specificity of UPF1 with different GC- and AU-rich mRNA motifs and the influence of insertion to the viable control over UPF1 catalytic activity. Our results indicate two distinct conformations between 1B and RecA2 domains of UPF1: \'open (isoform_2; without insertion)\' and \'closed (isoform_1; with insertion)\'. These structural movements correspond to an important stacking pattern in mRNA motifs, i.e., absence of stack formation in mRNA, with UPF1 isoform_2 results in the \'open conformation\'. Particularly, for UPF1 isoform_1, the increased distance between 1B and RecA2 domains has resulted in reducing the mRNA-UPF1 interactions. Lower fluctuating GC-rich mRNA motifs have better binding with UPF1, compared with AU-rich sequences. Except CCUGGGG, all other GC-rich motifs formed a 4-stack pattern with UPF1. High occupancy R363, D364, T627, and G862 residues were common binding GC-rich motifs, as were R363, N535, and T627 for the AU-rich motifs. The GC-rich motifs behave distinctly when bound to either of the isoforms; lower stability was observed with UPF1 isoform_2. The cancer-associated UPF1 variants (P533L/T and A839T) resulted in decreased protein-mRNA binding efficiency. Lack of mRNA stacking poses in the UPF1P533T system significantly decreased UPF1-mRNA binding efficiency and increased distance between 1B-RecA2. These novel findings can serve to further inform NMD-associated mechanistic and kinetic studies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Peste des petits ruminants virus (PPRV) causes a highly devastating disease of sheep and goats, that threatens the conservation of small wild ruminants. The development of PPRV vaccines, diagnostics and therapeutics, greatly depends on in-depth genomic data. Yet, high guanine-cytosine (GC) content between matrix (M) and fusion (F) genes of PPRV poses difficulty for both primer design and nucleotide amplification. In turn, this has led into absence or low nucleotide sequence coverage in this region. This poses a risk of missing important part of the genome that could help to infer viral evolution. Here, an overlapping long-read primer-based amplification strategy was developed to amplify the GC-rich fragments between M-F gene junction using nexus gradient polymerase chain reaction (PCR). The resulting amplicons were sequenced by dideoxynucleotide cycle sequencing and compared with other PPRV nucleotide sequences available at GenBank. Our findings indicate clear PCR amplification products with expected size of the GC-rich fragments on agarose gel electrophoresis. The sequencing results of these fragments indicate 99.5 % nucleotide identity with PPRV strain KY628761. An extremely difficult PCR target of 67.4 % GC contents was successfully amplified and sequenced using this long-read primer approach. The long-read primer set may be used in tiling multiplex PCR for complete genome sequencing of PPRV.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Сonstitutive heterochromatin areas are revealed by differential staining as C-positive chromosomal regions. These C-positive bands may greatly vary by location, size, and nucleotide composition. CBG-banding is the most commonly used method to detect structural heterochromatin in animals. The difficulty in identification of individual chromosomes represents an unresolved problem of this method as the body of the chromosome is stained uniformly and does not have banding pattern beyond C-bands. Here, we present the method that we called CDAG for sequential heterochromatin staining after differential GTG-banding. The method uses G-banding followed by heat denaturation in the presence of formamide with consecutive fluorochrome staining. The new technique is valid for the concurrent revealing of heterochromatin position due to differential banding of chromosomes and heterochromatin composition (AT-/GC-rich) in animal karyotyping.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    Misidentification of the chicken leptin gene has hampered research of leptin signaling in this species for almost two decades. Recently, the genuine leptin gene with a GC-rich (~70%) repetitive-sequence content was identified in the chicken genome but without indicating its genomic position. This suggests that such GC-rich sequences are difficult to sequence and therefore substantial regions are missing from the current chicken genome assembly.
    A radiation hybrid panel of chicken-hamster Wg3hCl2 cells was used to map the genome location of the chicken leptin gene. Contrary to our expectations, based on comparative genome mapping and sequence characteristics, the chicken leptin was not located on a microchromosome, which are known to contain GC-rich and repetitive regions, but at the distal tip of the largest chromosome (1p). Following conserved synteny with other vertebrates, we also mapped five additional genes to this genomic region (ARF5, SND1, LRRC4, RBM28, and FLNC), bridging the genomic gap in the current Galgal5 build for this chromosome region. All of the short scaffolds containing these genes were found to consist of GC-rich (54 to 65%) sequences comparing to the average GC-content of 40% on chromosome 1. In this syntenic group, the RNA-binding protein 28 (RBM28) was in closest proximity to leptin. We deduced the full-length of the RBM28 cDNA sequence and profiled its expression patterns detecting a negative correlation (R = - 0.7) between the expression of leptin and of RBM28 across tissues that expressed at least one of the genes above the average level. This observation suggested a local regulatory interaction between these genes. In adipose tissues, we observed a significant increase in RBM28 mRNA expression in breeds with lean phenotypes.
    Mapping chicken leptin together with a cluster of five syntenic genes provided the final proof for its identification as the true chicken ortholog. The high GC-content observed for the chicken leptin syntenic group suggests that other similar clusters of genes in GC-rich genomic regions are missing from the current genome assembly (Galgal5), which should be resolved in future assemblies of the chicken genome.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • DOI:
    文章类型: Journal Article
    BACKGROUND: Myotonic dystrophies (DMs) are a group of autosomal dominant neuromuscular disorders which are caused by large CTG/CCTG-repeat expansions in untranslated regions of DMPK/ZNF9 gene. The \"phenotypic overlap\" in DMs creates complication in distinguishing patients with DM1 from patients with DM2 and underscores the need for these patients to undergo genetic test; therefore, detection and accurate sizing of the CTG/CCTG-repeat expansions are necessary. Templates with long CTG/CCTG tandem repeats are difficult to amplify by convention PCR.
    OBJECTIVE: The aim of our study was to develop an efficient, economic amplification method which based on combination of primer design, modified annealing, and extension conditions in PCR amplification.
    METHODS: We detected and analyzed the CTG-repeat expansions in patients having clinical, electrophysiological, and muscle pathology features indicative of DMs by optimization PCR. If no CTG-repeat expansions were detected, we subsequently analyzed the CCTG-repeat expansions in the remaining patients.
    RESULTS: 42 participants included 25 DMs patients and 17 family members. 22 patients showed CTG-repeat expansions, the CTG-repeat ranged from 53 to 683 and the average was 535; 3 patients showed CCTG-repeat expansions, the CCTG-repeat ranged from 400 to 450 and the average was 416.
    CONCLUSIONS: Molecular genetic tests are essential for DMs diagnosis; Optimization PCR under the optimal conditions of primer design, modified annealing, and extension conditions can be used for efficient PCR in DMs diagnosis; Optimization PCR can greatly improve the positive detection of DMs, provide an economic, accurate, and rapid method for routine diagnostic use.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    Allelic discrimination using TaqMan 5\'-nuclease assay chemistry has been in routine use for many years, and the catalog of Life Technologies\' predesigned SNP genotyping assays now exceeds 4 million entries. However, predesigned assays are often not available for genomic regions with a high GC content, nor can an assay necessarily be designed in this type of region using the manufacturer\'s design pipelines. Additionally, when an assay is available, the performance can be poor when using standard protocols. Here we report a modified allelic discrimination protocol for variants that reside in extremely GC-rich (GC > 75%) regions. The approach resolves fluorescent signal from reference and variant alleles, allowing all samples to be successfully assigned a genotype call. This protocol modification adds an extra step to the standard workflow, but the increased time is a productive compromise to generate high-quality data.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    背景:聚合酶链反应(PCR)是一种极其敏感的方法,通常需要优化,特别是当困难的模板需要放大。本研究的目的是优化PCR条件,以扩增具有极高鸟嘌呤-胞嘧啶(GC)含量的表皮生长因子受体(EGFR)启动子序列,以检测单核苷酸多态性-216G>T和-191C>A。
    方法:从福尔马林固定石蜡包埋的肺癌组织中提取用于扩增的基因组DNA,并通过琼脂糖凝胶电泳检测PCR产物。
    结果:结果表明,添加5%二甲基亚砜(DMSO),以及PCR反应中的DNA浓度至少为2μg/ml,是成功扩增所必需的。由于GC含量高,最佳退火温度比计算温度高7°C,而适当的MgCl2浓度范围为1.5至2.0mM。
    结论:结论:EGFR启动子区是一个困难的PCR目标,但是在DMSO和可接受浓度的DNA模板存在下,优化MgCl2浓度和退火温度后可以扩增。
    BACKGROUND: Polymerase chain reaction (PCR) is an extremely sensitive method that often demands optimization, especially when difficult templates need to be amplified. The aim of the present study was to optimize the PCR conditions for amplification of the epidermal growth factor receptor (EGFR) promoter sequence featuring an extremely high guanine-cytosine (GC) content in order to detect single nucleotide polymorphisms -216G>T and -191C>A.
    METHODS: Genomic DNA used for amplification was extracted from formalin-fixed paraffin-embedded lung tumor tissue and PCR products were detected by agarose gel electrophoresis.
    RESULTS: Results showed that addition of 5% dimethyl sulfoxide (DMSO), as well as DNA concentration in PCR reaction of at least 2 μg/ml, were necessary for successful amplification. Due to high GC content, optimal annealing temperature was 7°C higher than calculated, while adequate MgCl2 concentration ranged from 1.5 to 2.0 mM.
    CONCLUSIONS: In conclusion, EGFR promoter region is a difficult PCR target, but it could be amplified after optimization of MgCl2 concentration and annealing temperature in the presence of DMSO and the DNA template of acceptable concentration.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号