Orthologous

直系
  • 文章类型: Review
    棉属是理解多倍体和遗传进化模式的模型属。本研究旨在研究不同棉种的SPLs的特征及其在纤维发育中的作用。根据系统发育分析,将一种典型的单子叶植物和十种双子叶植物的891个基因自然分为三类。棉花中的SCPL基因家族经历了强烈的纯化选择,并具有一些功能变异。分段复制和全基因组复制被证明是棉花进化过程中基因数量增加的两个主要原因。在特定组织中表现出差异表达或对环境刺激的响应的Gh_SCPL基因的鉴定为深入表征所选重要基因提供了新的手段。Ga09G1039参与了纤维和胚珠的发育过程,它在系统发育方面与其他棉花品种的蛋白质有很大不同,基因结构,保守的蛋白质基序和三级结构。Ga09G1039的过表达显着增加了茎毛状体的长度。Ga09G1039可能是一种具有水解酶活性的丝氨酸羧肽酶蛋白,根据功能区域,原核表达,和西方印迹分析。该结果全面概述了棉花中SCPL的遗传基础,并进一步了解了棉花中SCPL的关键方面及其在纤维发育和抗逆性中的潜在作用。
    The Gossypium is a model genus for understanding polyploidy and the evolutionary pattern of inheritance. This study aimed to investigate the characteristics of SCPLs in different cotton species and their role in fiber development. A total of 891 genes from one typical monocot and ten dicot species were naturally divided into three classes based on phylogenetic analysis. The SCPL gene family in cotton has undergone intense purifying selection with some functional variation. Segmental duplication and whole genome duplication were shown to be the two main reasons for the increase in the number of genes during cotton evolution. The identification of Gh_SCPL genes exhibiting differential expression in particular tissues or response to environmental stimuli provides a new measure for the in-depth characterization of selected genes of importance. Ga09G1039 was involved in the developmental process of fibers and ovules, and it is significantly different from proteins from other cotton species in terms of phylogenetic, gene structure, conserved protein motifs and tertiary structure. Overexpression of Ga09G1039 significantly increased the length of stem trichomes. Ga09G1039 may be a serine carboxypeptidase protein with hydrolase activity, according to functional region, prokaryotic expression, and western blotting analysis. The results provide a comprehensive overview of the genetic basis of SCPLs in Gossypium and further our knowledge in understanding the key aspects of SCPLs in cotton with their potential role in fiber development and stress resistance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    考虑到赛马的个性特征(例如,飞行,焦虑,和亲和力)被认为对提高培训效率和减少事故频率至关重要,特别是在参加第二职业的再培训时,可能涉及从赛车退役后与缺乏经验的人员接触。人类人格相关基因的研究经常进行;然而,这样的研究在马匹中很少见,因为缺乏一致的人格评估方法。利用最近公布的101匹纯种马的全基因组变异数据库,我们比较了与大五人格特质相关的马基因和人类基因,并在马中鉴定出18个与人格相关的候选基因。这些基因包括55种变体,其涉及高度影响编码的蛋白质的非同义取代。此外,我们根据参考氨基酸和改变氨基酸之间的分子量和疏水性水平的差异评估了等位基因频率和对蛋白质的功能影响。我们确定了15个新发现的可能影响马人格的基因,但他们与人格的联系仍不清楚。尽管需要更多的研究来比较遗传和行为信息来验证这种方法,在有限的条件下,它可能对人格评估有用。
    Considering the personality traits of racehorses (e.g., flightiness, anxiety, and affability) is considered essential to improve training efficiency and decrease accident frequency, especially when retraining for a second career that may involve contact with inexperienced personnel after retiring from racing. Studies on human personality-related genes are frequently conducted; however, such studies are rare in horses because a consistent methodology for personality evaluation is lacking. Using the recently published whole genome variant database of 101 Thoroughbred horses, we compared horse genes orthologous to human genes related to the Big Five personality traits, and identified 18 personality-related candidate genes in horses. These genes include 55 variants that involve non-synonymous substitutions that highly impact the encoded protein. Moreover, we evaluated the allele frequencies and functional impact on the proteins in terms of the difference in molecular weights and hydrophobicity levels between reference and altered amino acids. We identified 15 newly discovered genes that may affect equine personality, but their associations with personality are still unclear. Although more studies are required to compare genetic and behavioral information to validate this approach, it may be useful under limited conditions for personality evaluation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    由于遗传数据有限,Sciaenidae家族的系统发育状态存在不确定性和争议。在这项研究中,一个69,098bp的数据集,涵盖309个共享直系同源基因,从属于Sciaenidae家族的12种物种的18个基因组和5个转录组中提取,并用于系统发育分析。使用最大似然(ML)和贝叶斯方法(BA)方法来重建系统发育树。解析的ML和BA树显示出相似的拓扑结构,从而揭示了Sciaenidae家族中的两个主要进化谱系,即,西大西洋(WA)和东大西洋-印度-西太平洋(EIP)。WA组包括属于四个属的四个物种:Cynoscion nbulosus,马尾刺,Sciaenopsocellatus,和微孢子虫。同时,EIP组形成了一个单系进化枝,藏有8种(Argylesomusregius,A.刺槐,Pennahiaanea,Nibeaalbiflora,Miichthysmiiuy,Collichthyslucidus,Larimichthyspolyactis,和大黄鱼)来自六个属。我们的结果表明,在所研究的物种中,西大西洋(WA)群更为古老,而东大西洋-印度-西太平洋(EIP)组是一个年轻的组。在所研究的物种中,Collichthys和Larmichthys属是最年轻的血统,我们不建议将Collichthys和Larmichthys视为一个属。然而,由于缺乏基因组,Sciaenidae家族的起源和有关基础属的问题尚未解决。因此,需要进一步的采样和测序工作.
    Uncertainty and controversy exist in the phylogenetic status of the Sciaenidae family because of the limited genetic data availability. In this study, a data set of 69,098 bp, covering 309 shared orthologous genes, was extracted from 18 genomes and 5 transcriptomes of 12 species belonging to the Sciaenidae family and used for phylogenetic analysis. The maximum likelihood (ML) and Bayesian approach (BA) methods were used to reconstruct the phylogenetic trees. The resolved ML and BA trees showed similar topology, thus revealing two major evolutionary lineages within the Sciaenidae family, namely, Western Atlantic (WA) and Eastern Atlantic−Indo−West Pacific (EIP). The WA group included four species belonging to four genera: Cynoscion nebulosus, Equetus punctatus, Sciaenops ocellatus, and Micropogonias undulatus. Meanwhile, the EIP group formed one monophyletic clade, harboring eight species (Argyrosomus regius, A. japonicus, Pennahia anea, Nibea albiflora, Miichthys miiuy, Collichthys lucidus, Larimichthys polyactis, and L. crocea) from six genera. Our results indicated that the Western Atlantic (WA) group was more ancient in the studied species, while the Eastern Atlantic−Indo−West Pacific (EIP) group was a younger group. Within the studied species, the genera Collichthys and Larmichthys were the youngest lineages, and we do not suggest that Collichthys and Larmichthys should be considered as one genus. However, the origin of the Sciaenidae family and problems concerning the basal genus were not resolved because of the lack of genomes. Therefore, further sampling and sequencing efforts are needed.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    GF14蛋白是一个保守蛋白家族,参与许多细胞过程,包括转运,增长,新陈代谢,和应激反应。然而,关于马铃薯中14-3-3基因的报道很少。在这项研究中,在马铃薯基因组中检测到12个14-3-3基因。基于它们的系统发育关系,StGF14家族成员分为两类.基因表达分析表明,StGF14h,StGF14a,StGF14k的基因表达量最高,在所有三个组织中由非生物和生物胁迫诱导。14-3-3基因的外显子数量为4至7个,同一亚家族中的大多数这些基因具有相似的外显子-内含子模式。我们的研究结果表明,每组中大多数蛋白质的保守基序相似。内含子-外显子模式和保守基序的组成验证了14-3-3基因的系统发育分类。根据基因组分布结果,14-3-3基因不均匀地位于12条马铃薯染色体上。我们发现了马铃薯和拟南芥之间的97个直系同源基因对以及马铃薯基因组中的15个同源基因。我们的结果表明,GF-14基因在响应环境胁迫的功能和分子机制中具有有效作用。
    GF14 proteins are a family of conserved proteins involved in many cellular processes including transport, growth, metabolism, and stress response. However, only few reports are available regarding the 14-3-3 genes in potato. In this study, twelve 14-3-3 genes were detected in the potato genome. Based on their phylogenetic relationships, the StGF14 family members were categorized into two classes. Gene expression analysis demonstrated that StGF14h, StGF14a, and StGF14k had the highest gene expression, induced by abiotic and biotic stresses in all three tissues. The number of exons in 14-3-3 genes ranged from four to seven and most of these genes in the same subfamily had similar exon-intron patterns. The results of our study showed that the conserved motifs are similar in most of the proteins in each group. The intron-exon patterns and the composition of conserved motifs validated the 14-3-3 gene phylogenetic classification. According to the genome distribution results, 14-3-3 genes were located unevenly on the 12 Solanum tuberosum chromosomes. We find out 97 orthologous gene pairs between potato and Arabidopsis as well as 15 paralogous genes among potato genomes. Our results showed that GF-14 genes have an effective role in functional and molecular mechanisms in response to environmental stresses.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    结论:在小麦三个主要品质性状的Meta分析中,鉴定出110个置信区间(CI)降低的meta-QTL(MQTL)。五个GWAS验证的MQTL(即,1A.1,1B.2,3B.4,5B.2和6B.2),每个涉及超过20个初始QTL和减少的CI(95%)(<2cM),被选入优质育种计划。包括候选基因挖掘和表达分析在内的功能表征发现了44个与质量性状相关的高置信度候选基因。与面团流变学特性相关的数量性状位点(QTL)的荟萃分析,营养性状,并对小麦的加工品质性状进行了研究。为此,从2013-2020年发表的50项区间作图研究中收集了多达2458个QTL.在总QTL中,将1126个QTL投影到具有249,603个标记的共有图谱上,从而鉴定出110个元QTL(MQTL)。与初始QTL的平均CI相比,这些MQTL的平均CI降低了18.84倍(范围为14.87至95.55cM,平均为40.35cM)。在110个MQTL中,108个MQTL被物理锚定到小麦参考基因组,包括通过早期全基因组关联研究报告的标记-性状关联(MTA)验证的51个MQTL。候选基因(CG)挖掘允许从MQTL区域鉴定2533个独特的基因模型。计算机表达分析发现439个差异表达基因模型,在谷物和相关组织中每百万表达>2个转录本,其中还包括44个高置信度CG,涉及与质量性状相关的各种细胞和生化过程。与谷物蛋白质含量相关的九个功能特征小麦基因,高分子量谷蛋白,还发现淀粉合酶与一些MQTL共定位。小麦和水稻MQTL区域之间的合成分析确定了23个小麦MQTL与16个水稻MQTL与品质性状相关。此外,在44个MQTL区域检测到30个已知水稻基因的64个小麦直系同源物。本研究中鉴定的MQTL侧翼标记可用于标记辅助育种,并在基因组选择模型中用作固定效应,以提高优质育种过程中的预测准确性。来自MQTL的水稻基因和其他CGs的小麦直系同源物可以成为进一步功能验证和更好地了解小麦品质性状背后的分子机制的有希望的目标。
    CONCLUSIONS: Meta-analysis in wheat for three major quality traits identified 110 meta-QTL (MQTL) with reduced confidence interval (CI). Five GWAS validated MQTL (viz., 1A.1, 1B.2, 3B.4, 5B.2, and 6B.2), each involving more than 20 initial QTL and reduced CI (95%) (< 2 cM), were selected for quality breeding programmes. Functional characterization including candidate gene mining and expression analysis discovered 44 high confidence candidate genes associated with quality traits. A meta-analysis of quantitative trait loci (QTL) associated with dough rheology properties, nutritional traits, and processing quality traits was conducted in wheat. For this purpose, as many as 2458 QTL were collected from 50 interval mapping studies published during 2013-2020. Of the total QTL, 1126 QTL were projected onto the consensus map saturated with 249,603 markers which led to the identification of 110 meta-QTL (MQTL). These MQTL exhibited an 18.84-fold reduction in the average CI compared to the average CI of the initial QTL (ranging from 14.87 to 95.55 cM with an average of 40.35 cM). Of the 110, 108 MQTL were physically anchored to the wheat reference genome, including 51 MQTL verified with marker-trait associations (MTAs) reported from earlier genome-wide association studies. Candidate gene (CG) mining allowed the identification of 2533 unique gene models from the MQTL regions. In-silico expression analysis discovered 439 differentially expressed gene models with > 2 transcripts per million expressions in grains and related tissues, which also included 44 high-confidence CGs involved in the various cellular and biochemical processes related to quality traits. Nine functionally characterized wheat genes associated with grain protein content, high-molecular-weight glutenin, and starch synthase enzymes were also found to be co-localized with some of the MQTL. Synteny analysis between wheat and rice MQTL regions identified 23 wheat MQTL syntenic to 16 rice MQTL associated with quality traits. Furthermore, 64 wheat orthologues of 30 known rice genes were detected in 44 MQTL regions. Markers flanking the MQTL identified in the present study can be used for marker-assisted breeding and as fixed effects in the genomic selection models for improving the prediction accuracy during quality breeding. Wheat orthologues of rice genes and other CGs available from MQTLs can be promising targets for further functional validation and to better understand the molecular mechanism underlying the quality traits in wheat.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    棘球青霉2HH和草酸青霉114-2是众所周知的纤维素酶真菌生产者。然而,到目前为止,针对这两种重要生物基因调控的全球机制的研究很少。最近发现的2HH野生型与草酸疟原虫密切相关,这导致了对这两个物种的联合研究。首先,我们提供了基于TF-TG正交关系的棘球蚴2HH和草酸草114-2的全球基因调控网络,考虑具有众所周知的调节相互作用的三个相关物种,并结合TFBS预测。然后根据拓扑结构分析了网络,将TFs识别为集线器,和模块。基于这种方法,我们探索了许多确定的模块,如纤维素分解和木聚糖分解系统的表达,其中XlnR在木聚糖分解系统的正向调节中起关键作用。它还通过纤维糊精诱导系统间接作用而积极调节纤维素分解系统。这一显著的发现表明,XlnR-依赖性纤维素分解和木聚糖分解调节系统可能在棘球芽孢杆菌和草酸芽孢杆菌中都是保守的。最后,我们探索了在群落方面聚集的基因的功能一致性,与细胞氮相关的基因,复合代谢过程和大分子代谢过程最为丰富。因此,我们的方法使我们能够对每个推断的相互作用的存在赋予一定程度的准确性。
    Penicillium echinulatum 2HH and Penicillium oxalicum 114-2 are well-known cellulase fungal producers. However, few studies addressing global mechanisms for gene regulation of these two important organisms are available so far. A recent finding that the 2HH wild-type is closely related to P. oxalicum leads to a combined study of these two species. Firstly, we provide a global gene regulatory network for P. echinulatum 2HH and P. oxalicum 114-2, based on TF-TG orthology relationships, considering three related species with well-known regulatory interactions combined with TFBSs prediction. The network was then analyzed in terms of topology, identifying TFs as hubs, and modules. Based on this approach, we explore numerous identified modules, such as the expression of cellulolytic and xylanolytic systems, where XlnR plays a key role in positive regulation of the xylanolytic system. It also regulates positively the cellulolytic system by acting indirectly through the cellodextrin induction system. This remarkable finding suggests that the XlnR-dependent cellulolytic and xylanolytic regulatory systems are probably conserved in both P. echinulatum and P. oxalicum. Finally, we explore the functional congruency on the genes clustered in terms of communities, where the genes related to cellular nitrogen, compound metabolic process and macromolecule metabolic process were the most abundant. Therefore, our approach allows us to confer a degree of accuracy regarding the existence of each inferred interaction.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    BACKGROUND: DELLAs play key roles in plant gibberellin signaling pathways and are generally important in plant development and growth. However, DELLAs in many plant taxa have not yet been systematically analyzed.
    RESULTS: In our study, we searched for DELLA genes across 58 green plant genomes and found 181 DELLAs. Structure analysis showed some DELLA domains do not contain \"D-E-L-L-A\" sequences and instead contain similar domains, including DGLLA and DSLLH domains. \"VHYNP\" motifs in plant DELLAs comprise 23 types of sequences, while some DELLAs did not contain GRAS domains. In grape, we found that the DELLA protein GSVIVT01015465001 contains an F-box domain, while apple DELLA proteins MDP0000220512 and MDP0000403162 contain a WW domain and a BCIP domain, respectively. These DELLAs can be divided into 22 homologous groups and 17 orthologous groups, and 35 paralogous genes were identified. In total, 35 positively selected genes (PSGs) and 121 negatively selected genes (NSGs) were found among DELLAs based on selective pressure analysis, with an average Ks of NSGs that was significantly higher than that of PSGs (P < 0.05). Among the paralogous groups, CBI and Fop were significantly positively correlated with GC, GC1, GC2, GC12, and GC3, while CAI was significantly positively correlated with GC, GC1, GC12, and GC. The paralogous groups with ω values exceeding 1 had significantly higher Ka values. We also found some paralogous groups with ω values exceeding 1 that differed in their motifs.
    CONCLUSIONS: This study provides helpful insights into the evolution of DELLA genes and offers exciting opportunities for the investigation of DELLA functions in different plants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

  • 文章类型: Journal Article
    小热休克蛋白(sHSPs)是昆虫生存的重要调节剂。先前的研究表明,昆虫中只有一个直系同源的shsps簇。这里,通过比较分析,我们在昆虫中发现了另一个新的直系同源shsps簇。在储存的谷物害虫Triboliumcastaneum中,对属于该直系同源簇的Tchsp21.8a和7种特定物种的shsps进行了多次胁迫实验和功能研究。结果表明,Tchsp21.8a的表达对不同胁迫的反应较弱。然而,大多数物种特异性shsps的表达对热应激表现出过度反应,所有物种特异性shsps的表达在其他压力下表现出不同的反应,以合作方式保护甲虫。此外,Tchsp21.8a和物种特异性Tcshsp19.7在蓖麻的发育中起着重要作用,所有Tcshsps对繁殖力都有一定的影响。我们的工作创建了一个全面可靠的昆虫shsps支架,可以进一步为害虫生物防治提供有益的见解。
    Small heat shock proteins (sHSPs) are important modulators of insect survival. Previous research revealed that there is only one orthologous cluster of shsps in insects. Here, we identified another novel orthologous cluster of shsps in insects by comparative analysis. Multiple stress experiments and function investigation of Tchsp21.8a belonging to this orthologous cluster and seven species-specific shsps were performed in the stored-grain pest Tribolium castaneum. The results indicated that expression of Tchsp21.8a showed weak responses to different stresses. However, expressions of most species-specific shsps exhibited hyper-responses to heat stress, and expressions of all species-specific shsps displayed diverse responses during other stresses to protect beetles in a cooperative manner. Additionally, Tchsp21.8a and species-specific Tcshsp19.7 played important roles in the development of T. castaneum, and all Tcshsps had a certain impact on the fecundity. Our work created a comprehensive reliable scaffold of insect shsps that can further provide instructive insights to pest bio-control.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    Circular RNAs (circRNAs) are a new type of intracellular regulator that have been widely identified in animals and plants by high-throughput sequencing. However, there are still few functional studies on circRNAs in plants. To better understand maize circRNAs and their potential functions, we identified 1199 circRNAs in maize from RiboMinus RNA-Seq transcriptome data, and found distinct features of splicing site selection bias, longer flanking introns, and miniature inverted-repeat transposable element (MITE) insertions in flanking introns in maize circRNAs compared to other plant circRNAs. In total, 31 and 36 orthologous circRNAs were identified in rice and maize, respectively, but the orthologous parental genes could not produce orthologous circRNAs, mostly because of long-sequence insertions/deletions at flanking introns and approximately 24.3% of them contained MITE sequences. The majority of maize circRNAs showed high diversity of expression under different treatments and/or in different genetic backgrounds, implying that circRNAs could be involved in various regulatory networks. Twenty-six ecircRNAs were predicted to contain one or more target mimics, and 229 circRNAs had high coding potential, indicating that circRNAs could perform peptide-encoding functions in plants. These results will broaden understanding of the roles of circRNAs in plants and support further functional work on maize.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    Essential proteins have vital functions, when they are destroyed in cells, the cells will die or stop reproducing. Therefore, it is very important to identify essential proteins from a large number of other proteins. Due to the time-consuming, expensive, and inefficient process in biological experimental methods, computational methods become more and more popular to recognize them. In the early stages, these methods mainly rely on protein-protein interaction (PPI) information, which limits their discovery capacities. Researchers find novel methods by fusing multi-information to improve prediction accuracy. According to these features, essential proteins are more important and conservative in the evolution process, their neighbors in PPI networks are usually likely to be essential, there are many false positives in PPI data, whether a protein is essential can be assessed by the importance of a protein itself, the relevance of neighbors and the reliability of PPIs. The importance of neighbors and the reliability of PPIs can be further integrated into neighborhood feature. In the study, orthologous information, edge-clustering coefficient and gene expression information are used to measure the importance of a protein itself, the importance of the neighbors and the reliability of PPIs, respectively. Then, a novel expanded POC model, E_POC, is proposed to fuse the above information to discover essential proteins, a weighted PPI network is constructed. The proteins ranked high according to their weights are treated as candidate essential proteins. This novel method is named as E_POC. E_POC outperforms the existing classical methods on S. cerevisiae and E. coli data.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号