脓毒症,危及生命的综合症,仍然是全球范围内的重大公共卫生问题。唾液酸化是影响多种细胞表面的热势标记。然而,唾液酸化和败血症相关基因的作用尚未得到充分探索。从开放存取数据库GEO获得大量RNA-seq数据集(GSE66099和GSE65682)。通过在大量RNA-seq数据上使用R包“ConsensusClusterPlus”来将败血症样品分类为亚型。通过应用R包“limma”和单变量回归分析来辨别集线器基因,使用R包“survminer”进行风险评分的计算。确定最佳学习方法并构建预后模型,我们使用了21种不同的机器学习组合,并显示了这些组合的C指数排名结果。ROC曲线,时间依赖性ROC曲线,和Kaplan-Meier曲线用于评估模型的诊断准确性。R包“ESTIMATE”和“GSVA”用于定量每个样品中免疫细胞浸润的分数。利用14个预后相关唾液酸化基因将大量RNA-seq样品分类为两种不同的脓毒症亚型。总共20个差异表达基因(DEGs)被鉴定为与脓毒症和唾液酸化之间的关系相关。RSF用于鉴定重要性得分高于0.01的关键基因。九个hub基因(SLA2A1,TMCC2,TFRC,RHAG,FKBP1B,KLF1,PILRA,ARL4A,选择重要性值大于0.01的GYPA)用于构建预后模型。这项研究为脓毒症和唾液酸化之间的关系提供了一些理解。此外,它包含一个可能发展成为脓毒症诊断生物标志物的预测模型.
Sepsis, a life-threatening syndrome, continues to be a significant public health issue worldwide. Sialylation is a hot potential marker that affects the surface of a variety of cells. However, the role of genes related to sialylation and sepsis has not been fully explored. Bulk RNA-seq data sets (GSE66099 and GSE65682) were obtained from the open-access databases GEO. The classification of sepsis samples into subtypes was achieved by employing the R package \"ConsensusClusterPlus\" on the bulk RNA-seq data. Hub genes were discerned through the application of the R package \"limma\" and univariate regression analysis, with the calculation of risk scores carried out using the R package \"survminer\". To identify the best learning method and construct a prognostic model, we used 21 different combinations of machine learning, and C-index ranking results of these combinations have been showed. ROC curves, time-dependent ROC curves, and Kaplan-Meier curves were utilized to evaluate the diagnostic accuracy of the model. The R packages \"ESTIMATE\" and \"GSVA\" were employed to quantify the fractions of immune cell infiltration in each sample. The bulk RNA-seq samples were categorized into two distinct sepsis subtypes utilizing 14 prognosis-related sialylation genes. A total of 20 differentially expressed genes (DEGs) were identified as being associated with the relationship between sepsis and sialylation. The RSF was used to identify key genes with importance scores higher than 0.01. The nine hub genes (SLA2A1, TMCC2, TFRC, RHAG, FKBP1B, KLF1, PILRA, ARL4A, and GYPA) with the importance values greater than 0.01 was selected for constructing the prognostic model. This research offers some understanding of the relationship between sepsis and sialylation. Besides, it contains one predictive model that might develop into diagnostic biomarkers for sepsis.