Sequence features

序列特征
  • 文章类型: Journal Article
    推断基因调控网络(GRN)是系统生物学的重要挑战之一。和许多优秀的计算方法已经被提出;然而,仍然存在一些挑战,特别是在真实的数据集。在这项研究中,我们提出了基于有向图卷积神经网络的GRN推断方法(DGCGRN)。为了更好地理解和处理GRN的有向图结构数据,进行了有向图卷积神经网络,在保留有向图结构信息的同时,还充分利用了邻居节点特征。图神经网络采用局部增广策略解决了GRN中大量低度节点导致预测精度差的问题。此外,对于像大肠杆菌这样的真实数据,利用Bi-GRU提取隐藏特征,计算基因序列的统计理化特征,得到序列特征。在训练阶段,采用动态更新策略,将得到的边预测分数转换为边权重,指导模型后续的训练过程。在合成基准数据集和真实数据集上的结果表明,DGCGRN的预测性能明显优于现有模型。此外,膀胱尿路上皮癌和肺癌细胞的案例研究也说明了所提出模型的性能。
    Inferring gene regulatory network (GRN) is one of the important challenges in systems biology, and many outstanding computational methods have been proposed; however there remains some challenges especially in real datasets. In this study, we propose Directed Graph Convolutional neural network-based method for GRN inference (DGCGRN). To better understand and process the directed graph structure data of GRN, a directed graph convolutional neural network is conducted which retains the structural information of the directed graph while also making full use of neighbor node features. The local augmentation strategy is adopted in graph neural network to solve the problem of poor prediction accuracy caused by a large number of low-degree nodes in GRN. In addition, for real data such as E.coli, sequence features are obtained by extracting hidden features using Bi-GRU and calculating the statistical physicochemical characteristics of gene sequence. At the training stage, a dynamic update strategy is used to convert the obtained edge prediction scores into edge weights to guide the subsequent training process of the model. The results on synthetic benchmark datasets and real datasets show that the prediction performance of DGCGRN is significantly better than existing models. Furthermore, the case studies on bladder uroepithelial carcinoma and lung cancer cells also illustrate the performance of the proposed model.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    蛋白质被认为是必不可少的促进生物体的生存能力,生殖能力,和其他基本生理功能。传统的生物测定的特点是持续时间延长,广泛的劳动力需求,和财务费用,以确定必需的蛋白质。因此,人们普遍认为,采用计算方法是成功识别必需蛋白质的最迅速和有效的方法。尽管是机器学习(ML)应用程序中的热门选择,由于正样本和负样本的高质量训练集的可用性有限,因此不建议将深度学习(DL)方法用于基于序列特征的特定研究工作。然而,一些关于有限的数据可用性的DL工作也在最近执行,这将是我们未来的工作范围。因此,与DL方法相比,由于其优越的性能,因此在这项工作中使用了常规的ML技术。考虑到上述问题,这里提出了一种称为EPI-SF的技术,它使用ML来识别蛋白质-蛋白质相互作用网络(PPIN)中的必需蛋白质。蛋白质序列是蛋白质结构和功能的主要决定因素。所以,最初,从PPIN内的蛋白质中提取相关的蛋白质序列特征。这些特征随后被用作各种机器学习模型的输入,包括XGB增强分类器,AdaBoost分类器,逻辑回归(LR),支持向量分类(SVM),决策树模型(DT),随机森林模型(RF)和朴素贝叶斯模型(NB)。目的是检测PPIN内的必需蛋白。对酵母进行的初步调查检查了酵母PPIN的各种ML模型的性能。在这些模型中,射频模型技术的有效性最高,正如它的精确度所表明的,召回,F1分数,AUC值分别为0.703、0.720、0.711和0.745。与基于传统中心性的其他国家相比,也发现性能更好,例如中间性中心性(BC),接近中心性(CC),等。深度学习方法也像DeepEP,正如结果部分所强调的那样。由于其良好的性能,EPI-SF后来被用于预测人PPIN内部的新型必需蛋白。由于病毒倾向于选择性靶向参与人类PPIN内疾病传播的必需蛋白,进行调查以评估这些蛋白质可能参与COVID-19和其他相关严重疾病。
    Proteins are considered indispensable for facilitating an organism\'s viability, reproductive capabilities, and other fundamental physiological functions. Conventional biological assays are characterized by prolonged duration, extensive labor requirements, and financial expenses in order to identify essential proteins. Therefore, it is widely accepted that employing computational methods is the most expeditious and effective approach to successfully discerning essential proteins. Despite being a popular choice in machine learning (ML) applications, the deep learning (DL) method is not suggested for this specific research work based on sequence features due to the restricted availability of high-quality training sets of positive and negative samples. However, some DL works on limited availability of data are also executed at recent times which will be our future scope of work. Conventional ML techniques are thus utilized in this work due to their superior performance compared to DL methodologies. In consideration of the aforementioned, a technique called EPI-SF is proposed here, which employs ML to identify essential proteins within the protein-protein interaction network (PPIN). The protein sequence is the primary determinant of protein structure and function. So, initially, relevant protein sequence features are extracted from the proteins within the PPIN. These features are subsequently utilized as input for various machine learning models, including XGB Boost Classifier, AdaBoost Classifier, logistic regression (LR), support vector classification (SVM), Decision Tree model (DT), Random Forest model (RF), and Naïve Bayes model (NB). The objective is to detect the essential proteins within the PPIN. The primary investigation conducted on yeast examined the performance of various ML models for yeast PPIN. Among these models, the RF model technique had the highest level of effectiveness, as indicated by its precision, recall, F1-score, and AUC values of 0.703, 0.720, 0.711, and 0.745, respectively. It is also found to be better in performance when compared to the other state-of-arts based on traditional centrality like betweenness centrality (BC), closeness centrality (CC), etc. and deep learning methods as well like DeepEP, as emphasized in the result section. As a result of its favorable performance, EPI-SF is later employed for the prediction of novel essential proteins inside the human PPIN. Due to the tendency of viruses to selectively target essential proteins involved in the transmission of diseases within human PPIN, investigations are conducted to assess the probable involvement of these proteins in COVID-19 and other related severe diseases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    RNA修饰在各种生物过程和疾病中起着至关重要的作用。准确预测RNA修饰位点对于理解其功能至关重要。在这项研究中,我们提出了一种混合方法,该方法将预先训练的序列表示与各种序列特征融合,以在一个组合的预测框架中预测多种类型的RNA修饰。我们开发了MRM-BERT,一种深度学习方法,该方法结合了预训练的DNABERT深度序列表示模块和卷积神经网络(CNN),利用四种传统的序列特征编码来提高预测性能。MRM-BERT在12种常见RNA修饰的多个数据集上进行了评估,包括M6A,m5C,m1A等等。结果表明,对于所有12种类型的RNA修饰,我们的混合模型在接受者工作特征曲线(AUC)下面积方面优于其他模型。MRM-BERT可作为在线工具(http://117.122.208.21:8501)或源代码(https://github.com/abhhba999/MRM-BERT),它允许用户预测RNA修饰位点并可视化结果。总的来说,我们的研究提供了一种有效的方法来预测多种RNA修饰,有助于理解RNA生物学和发展治疗策略。
    RNA modifications play crucial roles in various biological processes and diseases. Accurate prediction of RNA modification sites is essential for understanding their functions. In this study, we propose a hybrid approach that fuses a pre-trained sequence representation with various sequence features to predict multiple types of RNA modifications in one combined prediction framework. We developed MRM-BERT, a deep learning method that combined the pre-trained DNABERT deep sequence representation module and the convolutional neural network (CNN) exploiting four traditional sequence feature encodings to improve the prediction performance. MRM-BERT was evaluated on multiple datasets of 12 commonly occurring RNA modifications, including m6A, m5C, m1A and so on. The results demonstrate that our hybrid model outperforms other models in terms of area under receiver operating characteristic curve (AUC) for all 12 types of RNA modifications. MRM-BERT is available as an online tool (http://117.122.208.21:8501) or source code (https://github.com/abhhba999/MRM-BERT), which allows users to predict RNA modification sites and visualize the results. Overall, our study provides an effective and efficient approach to predict multiple RNA modifications, contributing to the understanding of RNA biology and the development of therapeutic strategies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    热稳定的蛋白质在工业生产中得到了广泛的应用,制药开发,并作为蛋白质工程中高度进化的起点。蛋白质的热稳定性通常由它们的解链温度(Tm)表征。然而,由于实验确定的Tm数据的可用性有限以及现有计算方法在预测Tm时的准确性不足,迫切需要一种计算方法来准确预测嗜热蛋白的Tm值。这里,我们提出了一个基于深度学习的模型,叫做DeepTM,它仅利用蛋白质序列作为输入,并在由7790个嗜热蛋白条目组成的数据集上准确预测目标嗜热蛋白的Tm值。在一组1550个样品上,DeepTM表现出优异的性能,其决定系数(R2)为0.75,皮尔逊相关系数(P)为0.87,均方根误差(RMSE)为6.24℃。我们进一步分析了决定嗜热蛋白热稳定性的序列特征,发现二肽频率,宿主生物的最佳生长温度(OGT),蛋白质的进化信息会显著影响其解链温度。我们将DeepTM的性能与最近报道的方法进行了比较,ProTstab2和DeepSTABp,在预测两个盲测试数据集上的Tm值。一个数据集包括22种PET塑料降解酶,而其他包括29种更广泛分类的热稳定蛋白质。在PET塑料降解酶数据集中,DeepTM达到8.25℃的RMSE。与ProTstab2(20.05℃)和DeepSTABp(20.97℃)相比,DeepTM的RMSE降低了58.85%和60.66%,分别。在热稳定蛋白质的数据集中,与ProTstab2(RMSE=15.87℃)相比,DeepTM(RMSE=7.66℃)显示RMSE降低51.73%。DeepTM,只需要蛋白质序列信息,准确预测熔融温度,实现完全端到端预测过程,从而为进一步的蛋白质工程提供了增强的便利性和权宜之计。
    Thermally stable proteins find extensive applications in industrial production, pharmaceutical development, and serve as a highly evolved starting point in protein engineering. The thermal stability of proteins is commonly characterized by their melting temperature (Tm). However, due to the limited availability of experimentally determined Tm data and the insufficient accuracy of existing computational methods in predicting Tm, there is an urgent need for a computational approach to accurately forecast the Tm values of thermophilic proteins. Here, we present a deep learning-based model, called DeepTM, which exclusively utilizes protein sequences as input and accurately predicts the Tm values of target thermophilic proteins on a dataset consisting of 7790 thermophilic protein entries. On a test set of 1550 samples, DeepTM demonstrates excellent performance with a coefficient of determination (R2) of 0.75, Pearson correlation coefficient (P) of 0.87, and root mean square error (RMSE) of 6.24 ℃. We further analyzed the sequence features that determine the thermal stability of thermophilic proteins and found that dipeptide frequency, optimal growth temperature (OGT) of the host organisms, and the evolutionary information of the protein significantly affect its melting temperature. We compared the performance of DeepTM with recently reported methods, ProTstab2 and DeepSTABp, in predicting the Tm values on two blind test datasets. One dataset comprised 22 PET plastic-degrading enzymes, while the other included 29 thermally stable proteins of broader classification. In the PET plastic-degrading enzyme dataset, DeepTM achieved RMSE of 8.25 ℃. Compared to ProTstab2 (20.05 ℃) and DeepSTABp (20.97 ℃), DeepTM demonstrated a reduction in RMSE of 58.85% and 60.66%, respectively. In the dataset of thermally stable proteins, DeepTM (RMSE=7.66 ℃) demonstrated a 51.73% reduction in RMSE compared to ProTstab2 (RMSE=15.87 ℃). DeepTM, with the sole requirement of protein sequence information, accurately predicts the melting temperature and achieves a fully end-to-end prediction process, thus providing enhanced convenience and expediency for further protein engineering.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    血凝素(HA)通过促进宿主膜与病毒之间的融合来促进病毒进入和感染。鉴于其在流感病毒侵染过程中的重要性,医管局已成为流感药物和疫苗开发的目标。因此,准确识别HA对于开发靶向疫苗药物至关重要。然而,仍然缺乏使用计算机方法鉴定HA。本研究旨在设计一个计算模型来识别HA。
    在这项研究中,从UniProt获得包含106个HA和106个非HA序列的基准数据集。使用各种基于序列的特征来配制样品。通过进行特征优化并输入四种机器学习方法,我们使用堆叠算法构建了一个集成分类器模型。
    该模型实现了95.85%的精度,并且在5倍交叉验证中接收器工作特征(ROC)曲线下的面积为0.9863。在独立测试中,该模型的准确率为93.18%,ROC曲线下面积为0.9793.代码可以从https://github.com/Zouxidan/HA_predict找到。git.该模型具有良好的预测性能。该模型将为生化学者对HA的研究提供便利。
    UNASSIGNED: Hemagglutinin (HA) is responsible for facilitating viral entry and infection by promoting the fusion between the host membrane and the virus. Given its significance in the process of influenza virus infestation, HA has garnered attention as a target for influenza drug and vaccine development. Thus, accurately identifying HA is crucial for the development of targeted vaccine drugs. However, the identification of HA using in-silico methods is still lacking. This study aims to design a computational model to identify HA.
    UNASSIGNED: In this study, a benchmark dataset comprising 106 HA and 106 non-HA sequences were obtained from UniProt. Various sequence-based features were used to formulate samples. By perform feature optimization and inputting them four kinds of machine learning methods, we constructed an integrated classifier model using the stacking algorithm.
    UNASSIGNED: The model achieved an accuracy of 95.85% and with an area under the receiver operating characteristic (ROC) curve of 0.9863 in the 5-fold cross-validation. In the independent test, the model exhibited an accuracy of 93.18% and with an area under the ROC curve of 0.9793. The code can be found from https://github.com/Zouxidan/HA_predict.git. The proposed model has excellent prediction performance. The model will provide convenience for biochemical scholars for the study of HA.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    沉默子是位于基因组上的抑制基因表达的非编码DNA序列片段。特定细胞中沉默子的变异与基因表达和癌症的发展密切相关。完全依赖于DNA序列信息进行沉默子鉴定的计算方法无法考虑沉默子的细胞特异性,导致准确性下降。尽管在基因组上发现了几种与沉默子相关的转录因子和表观遗传修饰,仍然没有明确的生物信号或其组合来完全表征消音器,在选择合适的生物信号进行识别时面临挑战。因此,我们提出了一个复杂的深度学习框架,称为DeepICSH,基于多个生物数据源。具体来说,DeepICSH利用深度卷积神经网络自动捕获与消音器强烈相关的生物相关信号组合,源于各种各样的生物信号。此外,注意力机制的利用促进了这些信号组合的评分和可视化,而跳过连接的使用促进了多级序列特征和信号组合的融合,从而增强了特定细胞内消音器的准确识别。对HepG2和K562细胞系数据集的大量实验表明,DeepICSH在消音器鉴定方面优于最先进的方法。值得注意的是,我们首次引入了基于多组数据的深度学习框架,用于对强弱消音器进行分类,实现良好的性能。总之,DeepICSH在推进复杂疾病中消音器的研究和分析方面显示出巨大的希望。源代码可在https://github.com/lyli1013/DeepICSH获得。
    Silencers are noncoding DNA sequence fragments located on the genome that suppress gene expression. The variation of silencers in specific cells is closely related to gene expression and cancer development. Computational approaches that exclusively rely on DNA sequence information for silencer identification fail to account for the cell specificity of silencers, resulting in diminished accuracy. Despite the discovery of several transcription factors and epigenetic modifications associated with silencers on the genome, there is still no definitive biological signal or combination thereof to fully characterize silencers, posing challenges in selecting suitable biological signals for their identification. Therefore, we propose a sophisticated deep learning framework called DeepICSH, which is based on multiple biological data sources. Specifically, DeepICSH leverages a deep convolutional neural network to automatically capture biologically relevant signal combinations strongly associated with silencers, originating from a diverse array of biological signals. Furthermore, the utilization of attention mechanisms facilitates the scoring and visualization of these signal combinations, whereas the employment of skip connections facilitates the fusion of multilevel sequence features and signal combinations, thereby empowering the accurate identification of silencers within specific cells. Extensive experiments on HepG2 and K562 cell line data sets demonstrate that DeepICSH outperforms state-of-the-art methods in silencer identification. Notably, we introduce for the first time a deep learning framework based on multi-omics data for classifying strong and weak silencers, achieving favorable performance. In conclusion, DeepICSH shows great promise for advancing the study and analysis of silencers in complex diseases. The source code is available at https://github.com/lyli1013/DeepICSH.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    非编码RNA(ncRNA)是一种功能性RNA分子,在各种基本生物过程中起着关键作用。比如基因调控。因此,研究ncRNA与蛋白质之间的联系对于探索ncRNA的功能具有重要意义。尽管现代生物科学家已经开发了许多有效和准确的方法,准确的预测仍然对各种问题构成重大挑战。在我们的方法中,我们利用多头注意力机制来合并剩余连接,允许ncRNA和蛋白质序列特征的自动学习。具体来说,该方法基于多头注意力机制将节点特征投影到多个空间,从而在这些空间中获得不同的特征交互模式。通过堆叠交互层,可以导出高阶交互模式,同时仍然通过剩余连接保留初始特征信息。这种策略有效地利用了ncRNA和蛋白质的序列信息,实现隐藏的高阶特征的捕获。最后的实验结果证明了该方法的有效性,AUC值为97.4%,98.5%,在NPInterv2.0、RPI807和RPI488数据集上实现了94.8%,分别。这些令人印象深刻的结果巩固了我们的方法作为探索ncRNAs和蛋白质之间联系的强大工具。我们已经在GitHub上上传了实现代码:https://github.com/ZZCrazy00/MHAM-NPI。
    Non-coding RNA (ncRNA) is a functional RNA molecule that plays a key role in various fundamental biological processes, such as gene regulation. Therefore, studying the connection between ncRNA and proteins holds significant importance in exploring the function of ncRNA. Although many efficient and accurate methods have been developed by modern biological scientists, accurate predictions still pose a major challenge for various issues. In our approach, we utilize a multi-head attention mechanism to merge residual connections, allowing for the automatic learning of ncRNA and protein sequence features. Specifically, the proposed method projects node features into multiple spaces based on multi-head attention mechanism, thereby obtaining different feature interaction patterns in these spaces. By stacking interaction layers, higher-order interaction modes can be derived, while still preserving the initial feature information through the residual connection. This strategy effectively leverages the sequence information of ncRNA and protein, enabling the capture of hidden high-order features. The final experimental results demonstrate the effectiveness of our method, with AUC values of 97.4%, 98.5%, and 94.8% achieved on the NPInter v2.0, RPI807, and RPI488 datasets, respectively. These impressive results solidify our method as a powerful tool for exploring the connection between ncRNAs and proteins. We have uploaded the implementation code on GitHub: https://github.com/ZZCrazy00/MHAM-NPI.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    CRISPR/Cas9技术能够精确编辑基因组,是近年来各种科学和医学进步的核心。生物医学研究的进展受到阻碍,因为当使用基因组编辑器时,基因组的无意负担-脱靶效应。尽管检测脱靶的实验屏幕已经允许理解Cas9的活性,但由于规则不能很好地推断到新的靶序列,因此该知识仍然不完整。最近开发的脱靶预测工具越来越依赖于机器学习和深度学习技术来可靠地理解可能脱靶的完整威胁,因为驱动Cas9活动的规则尚未完全理解。在这项研究中,我们提出了一种基于计数和基于深度学习的方法来推导序列特征,这些特征对于决定序列中的Cas9活动非常重要。在脱靶确定中存在两个主要挑战-Cas9活性的可能位点的鉴定和Cas9活性在该位点的程度的预测。开发了混合多任务CNN-biLSTM模型,名为CRISP-RCNN,同时预测脱靶和脱靶活动的程度。采用集成梯度和加权内核的方法进行特征重要性逼近,核苷酸和位置偏好分析,并且已经执行了失配容限。
    CRISPR/Cas9 technology is capable of precisely editing genomes and is at the heart of various scientific and medical advances in recent times. The advances in biomedical research are hindered because of the inadvertent burden on the genome when genome editors are employed-the off-target effects. Although experimental screens to detect off-targets have allowed understanding the activity of Cas9, that knowledge remains incomplete as the rules do not extrapolate well to new target sequences. Off-target prediction tools developed recently have increasingly relied on machine learning and deep learning techniques to reliably understand the complete threat of likely off-targets because the rules that drive Cas9 activity are not fully understood. In this study, we present a count-based as well as deep-learning-based approach to derive sequence features that are important in deciding on Cas9 activity at a sequence. There are two major challenges in off-target determination-the identification of a likely site of Cas9 activity and the prediction of the extent of Cas9 activity at that site. The hybrid multitask CNN-biLSTM model developed, named CRISP-RCNN, simultaneously predicts off-targets and the extent of activity on off-targets. Employing methods of integrated gradients and weighting kernels for feature importance approximation, analysis of nucleotide and position preference, and mismatch tolerance have been performed.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    为解决自然环境干扰造成的路面裂缝检测精度低的问题,本文设计了一个轻量级的检测框架PCDETR(路面裂缝检测检测器)网络,基于卷积特征与序列特征的融合,提出了一种高效的路面裂缝检测方法。首先,将可扩展的Swin-Transformer网络和剩余网络作为骨干网络的两个并行通道,提取路面裂缝的长序列全局特征和潜在的视觉局部特征,分别,它们被串联和融合以丰富提取的特征信息。然后,对变压器检测框架的编码器和解码器进行了优化;利用集合预测可以直接获得路面裂缝的位置和类别信息,它提供了一种低代码的方法来降低实现的复杂性。研究结果表明,该方法在COCO数据集上的平均精度(AP)达到45.8%,显着高于DETR及其变体模型条件DETR的AP值分别为36.9%和42.8%,分别。在自收集的路面裂缝数据集上,拟议方法的AP达到45.6%,这比MaskR-CNN(基于区域的卷积神经网络)高3.8%,比FasterR-CNN高8.8%。因此,该方法是一种有效的路面裂缝检测算法。
    To solve the problem of low accuracy of pavement crack detection caused by natural environment interference, this paper designed a lightweight detection framework named PCDETR (Pavement Crack DEtection TRansformer) network, based on the fusion of the convolution features with the sequence features and proposed an efficient pavement crack detection method. Firstly, the scalable Swin-Transformer network and the residual network are used as two parallel channels of the backbone network to extract the long-sequence global features and the underlying visual local features of the pavement cracks, respectively, which are concatenated and fused to enrich the extracted feature information. Then, the encoder and decoder of the transformer detection framework are optimized; the location and category information of the pavement cracks can be obtained directly using the set prediction, which provided a low-code method to reduce the implementation complexity. The research result shows that the highest AP (Average Precision) of this method reaches 45.8% on the COCO dataset, which is significantly higher than that of DETR and its variants model Conditional DETR where the AP values are 36.9% and 42.8%, respectively. On the self-collected pavement crack dataset, the AP of the proposed method reaches 45.6%, which is 3.8% higher than that of Mask R-CNN (Region-based Convolution Neural Network) and 8.8% higher than that of Faster R-CNN. Therefore, this method is an efficient pavement crack detection algorithm.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    哺乳动物进化保守的信号传导中间体Toll途径(ECSIT)是一种重要的细胞内蛋白,涉及先天免疫,胚胎发生,以及线粒体复合物I的组装或稳定性。在本研究中,ECSIT的特征是鱼(Lizahematocheila)。鱼ECSIT的全长cDNA为1860bp,编码449个氨基酸。MulletECSIT与其teleost同行共享60.4%~78.2%的序列身份。两个保守的蛋白质结构域,ECSIT域和C末端域,是在乌鱼ECSIT中发现的。实时qPCR分析显示,鱼ECSIT分布在所有检查的组织中,在脾脏中高表达,头肾(香港)和ill。进一步分析表明,在无乳链球菌感染后6h至48h,脾脏中的鱼ECSIT上调。此外,免疫共沉淀(co-IP)试验证实,MulletECSIT可与肿瘤坏死因子受体相关因子6(TRAF6)相互作用.分子对接表明,极性相互作用和疏水相互作用在ECSIT-TRAF6复合物的形成中起着至关重要的作用。参与ECSIT和TRAF6相互作用的乌鱼ECSIT的残基是Arg107,Glu113,Phe114,Glu124,Lys120和Lys121,它们主要位于ECSIT域。我们的结果表明,鱼ECSIT参与针对细菌的免疫防御和通过与TRAF6相互作用调节TLRs信号通路。据我们所知,这是关于鱼ECSIT的第一份报告,从而加深了对ECSIT及其在硬骨鱼免疫应答中作用的认识。
    Mammalian evolutionary conserved signaling intermediate in Toll pathways (ECSIT) is an important intracellular protein that involves in innate immunity, embryogenesis, and assembly or stability of the mitochondrial complex I. In the present study, the ECSIT was characterized in soiny mullet (Liza haematocheila). The full-length cDNA of mullet ECSIT was 1860 bp, encoding 449 amino acids. Mullet ECSIT shared 60.4%∼78.2% sequence identities with its teleost counterparts. Two conserved protein domains, ECSIT domain and C-terminal domain, were found in mullet ECSIT. Realtime qPCR analysis revealed that mullet ECSIT was distributed in all examined tissues with high expressions in spleen, head kidney (HK) and gill. Further analysis showed that mullet ECSIT in spleen was up-regulated from 6 h to 48 h after Streptococcus dysgalactiae infection. In addition, the co-immunoprecipitation (co-IP) assay confirmed that mullet ECSIT could interact with tumor necrosis factor receptor-associated factor 6 (TRAF6). Molecular docking revealed that the polar interaction and hydrophobic interaction play crucial roles in the forming of ECSIT-TRAF6 complex. The resides of mullet ECSIT that involved in the interaction between ECSIT and TRAF6 were Arg107, Glu113, Phe114, Glu124, Lys120 and Lys121, which mainly located in the ECSIT domain. Our results demonstrated that mullet ECSIT involved in the immune defense against bacterial and regulation of TLRs signaling pathway by interaction with TRAF6. To the best of our knowledge, this is the first report on ECSIT of soiny mullet, which deepen the understanding of ECSIT and its functions in the immune response of teleosts.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号