LASSO regression

LASSO 回归
  • 文章类型: Journal Article
    背景:早期识别顺铂诱导的肾毒性(CIN)高危个体对于避免CIN和改善预后至关重要。在这项研究中,我们基于一般临床数据开发并验证了aCIN预测模型,实验室适应症,肺癌患者化疗前的遗传特征。
    方法:我们回顾性纳入了2019年6月至2021年6月使用铂类化疗方案的696例肺癌患者作为使用绝对收缩和选择算子(LASSO)回归构建预测模型的追踪集,交叉验证,和Akaike的信息准则(AIC)来选择重要变量。我们前瞻性选择了2021年7月至2022年12月的283名独立肺癌患者作为测试集,以评估模型的性能。
    结果:预测模型显示出良好的判别和校准,AUC分别为0.9217和0.8288,灵敏度分别为79.89%和45.07%,特异性为94.48%和94.81%,分别在训练集和测试集中。临床决策曲线分析表明,当风险阈值在0.1和0.9之间时,该模型具有临床应用价值。以0.5到0.75的召回间隔显示的精度-召回(PR)曲线:随着召回的增加,精度逐渐下降,到0.9。
    结论:基于实验室和人口统计学变量的预测模型可以作为识别CIN高危人群的有益补充工具。
    BACKGROUND: Early identification of high-risk individuals with cisplatin-induced nephrotoxicity (CIN) is crucial for avoiding CIN and improving prognosis. In this study, we developed and validated a CIN prediction model based on general clinical data, laboratory indications, and genetic features of lung cancer patients before chemotherapy.
    METHODS: We retrospectively included 696 lung cancer patients using platinum chemotherapy regimens from June 2019 to June 2021 as the traing set to construct a predictive model using Absolute shrinkage and selection operator (LASSO) regression, cross validation, and Akaike\'s information criterion (AIC) to select important variables. We prospectively selected 283 independent lung cancer patients from July 2021 to December 2022 as the test set to evaluate the model\'s performance.
    RESULTS: The prediction model showed good discrimination and calibration, with AUCs of 0.9217 and 0.8288, sensitivity of 79.89% and 45.07%, specificity of 94.48% and 94.81%, in the training and test sets respectively. Clinical decision curve analysis suggested that the model has value for clinical use when the risk threshold ranges between 0.1 and 0.9. Precision-Recall (PR) curve shown in recall interval from 0.5 to 0.75: precision gradually declines with increasing Recall, up to 0.9.
    CONCLUSIONS: Predictive models based on laboratory and demographic variables can serve as a beneficial complementary tool for identifying high-risk populations with CIN.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    这项研究的主要目的是使用数据驱动的方法分析饮食模式,并探索肾结石疾病(KSD)的预防或危险饮食因素。在上海郊区的成年人(n=6396)中进行了病例对照匹配研究。食物频率问卷被用来评估各类食物的消费量,用B超鉴别肾结石。采用主成分分析和回归方法生成膳食模式,并进一步探讨膳食模式与KSD的关系。使用LASSO回归和选择后推断来识别与KSD最相关的食物组。在男性中,“平衡但不含糖的饮料模式”(OR=0.78,p<0.05)和“坚果和泡菜模式”(OR=0.84,p<0.05)是保护性饮食模式。在女性中,“高蔬菜和低糖饮料模式”(OR=0.83,p<0.05)和“高甲壳类和低蔬菜模式”(OR=0.79,p<0.05)是保护性饮食模式,而“偏爱肉类的综合模式”(OR=1.06,p<0.05)和“含糖饮料模式”(OR=1.16,p<0.05)是风险饮食模式。我们进一步推断含糖饮料(p<0.05)是危险因素,咸菜(p<0.05)和甲壳类动物(p<0.05)是保护因素。
    The main objective of this study was to analyze dietary patterns using data-driven approaches and to explore preventive or risk dietary factors for kidney stone disease (KSD). A case-control matching study was conducted in adults (n = 6396) from a suburb of Shanghai. A food frequency questionnaire was used to assess the consumption of various types of food, and B-ultrasound was used to identify kidney stones. Principal component analysis and regression were used to generate dietary patterns and further explore the relationship between dietary patterns and KSD. LASSO regression and post-selection inference were used to identify food groups most associated with KSD. Among males, the \"balanced but no-sugary-beverages pattern\" (OR = 0.78, p < 0.05) and the \"nuts and pickles pattern\" (OR = 0.84, p < 0.05) were protective dietary patterns. Among females, \"high vegetables and low-sugary-beverages pattern\" (OR = 0.83, p < 0.05) and \"high-crustaceans and low-vegetables pattern\" (OR = 0.79, p < 0.05) were protective dietary patterns, while the \"comprehensive pattern with a preference for meat\" (OR = 1.06, p < 0.05) and \"sugary beverages pattern\" (OR = 1.16, p < 0.05) were risk dietary patterns. We further inferred that sugary beverages (p < 0.05) were risk factors and pickles (p < 0.05) and crustaceans (p < 0.05) were protective factors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    发展性阅读障碍(DD)在最近几十年被普遍认为是一种多因素心理障碍。然而,阅读和学习环境研究,影响中国发展性阅读障碍(DD)的社会和人口因素仍然很少。本研究旨在探讨与出生前后DD相关的多维家庭影响因素。
    汕头共招募了60名阅读障碍学生和252名2-5级的正常小学生,中国。使用最小绝对收缩和选择算子(LASSO)回归模型进行社会和人口统计学变量筛选。通过多变量逻辑回归模型估计DD与相关因素之间关联的几率(OR)和95%置信区间(CI)。
    通过LASSO回归,我们最终确定了13个关键变量,包括产妇教育水平和家庭月收入,在其他人中。Logistic回归分析显示,母亲受教育程度较低的儿童发生DD的风险较高。与一致的父母教养方式相反,不同的父母教养方式可能是发展DD的风险因素(OR=4.93,95CI:1.11-21.91)。母亲在怀孕期间营养不良的儿童更容易发生DD(OR=10.31,95CI:1.84-37.86),以及每天在家中接触二手烟(OR=5.33,95CI:1.52-18.66)。有趣的是,儿童的主动阅读(OR=0.26,95CI:0.08-0.84;OR=0.17,95CI:0.04-0.76表示“有时”和“经常”与无相比,分别),有课外阅读童话书的孩子(OR=0.37,95CI:0.15-0.90),和有课外阅读作文书籍的儿童(OR=0.25,95CI:0.09-0.69)是DD的显着保护因素。
    家庭阅读环境,几个教育,社会计量和人口统计学因素可能会影响阅读障碍的发展。我们应该注意这些因素对阅读障碍的发展,从而提供良好的社会和家庭环境,以确保儿童的健康发展。
    UNASSIGNED: Developmental dyslexia (DD) has been generally recognized as a multifactorial psychological disorder in recent decades. However, studies on reading and learning environment, social and demographic factors affecting Chinese developmental dyslexia (DD) are still scarce in China. This study aims to explore multidimensional home influencing factors associated with DD before and after birth.
    UNASSIGNED: A total of 60 dyslexic and 252 normal elementary school students graded 2-5 were recruited in Shantou, China. The Least Absolute Shrinkage and Selection Operator (LASSO) regression model was used for the social and demographic variables screening. Odds ratios (ORs) with 95 % confidence intervals (CIs) for associations between DD and related factors were estimated by multivariate logistic regression models.
    UNASSIGNED: Through LASSO regression, we ultimately identified 13 key variables, including maternal education level and family monthly income, among others. The logistic regression analyses showed that the risk of DD was higher in children with lower maternal education levels. Divergent parenting styles may be a risk factor for developing DD as opposed to consistent parenting styles (OR = 4.93, 95%CI: 1.11-21.91). Children whose mothers suffered from malnutrition during pregnancy were more likely to develop DD (OR = 10.31, 95%CI: 1.84-37.86), as well as exposure to second-hand smoking at home every day (OR = 5.33, 95%CI: 1.52-18.66). Interestingly, children\'s active reading (OR = 0.26, 95%CI: 0.08-0.84; OR = 0.17, 95%CI: 0.04-0.76 for \"sometimes\" and \"often\" compared to none, respectively), children having extracurricular reading fairy tale books (OR = 0.37, 95%CI: 0.15-0.90), and children having extracurricular reading composition books (OR = 0.25, 95%CI: 0.09-0.69) were significant protective factors for DD.
    UNASSIGNED: Home reading environment, several educational, sociometric and demographic factors may influence the development of dyslexia. We should pay attention to these factors on the development of dyslexia, so as to provide the well social and familial environment to ensure the healthy development of children.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    UNASSIGNED:尽管已确定了小耳症的各种危险因素,这些研究的局限性,然而,是风险因素没有相互比较或来自不同领域的估计。我们的研究旨在通过横向和meachine学习统计方法揭示哪些因素应优先用于预防和干预非综合征性小耳畸形。
    UNASSIGNED:在2017-2019年期间,在上海第九人民医院就诊的293对1:1匹配的非综合征性小耳症病例和对照纳入本研究。测量了四个领域的39个风险因素(即,父母的社会人口统计学特征,产妇妊娠史,父母的健康状况和生活方式,和父母的环境和职业暴露)。进行了Lasso回归模型和多变量条件逻辑回归模型,以确定四个领域中微观结构的主要预测因子。曲线下面积(AUC)用于计算预测概率。
    未经评估:套索回归确定了八个预测因子,包括异常妊娠史,生殖系统感染,致畸药物的使用,补充叶酸,父系慢性病史,父母接触室内装修,父亲职业暴露于噪声和母亲急性呼吸道感染。通过多因素条件逻辑回归模型确定的其他预测因素是孕产妇年龄和孕产妇职业暴露于重金属。从条件逻辑回归和套索回归中选择的预测因子均产生0.83(0.79-0.86)的AUC(95%CIs)。
    UNASSIGNED:这项研究的结果表明,无论应用何种统计方法,多个领域的一些因素都是非综合征性微耳症的关键驱动因素。这些因素可用于生成进一步观察和临床研究的假设,并指导预防和干预策略。
    UNASSIGNED: Although a wide range of risk factors for microtia were identified, the limitation of these studies, however, is that risk factors were not estimated in comparison with one another or from different domains. Our study aimed to uncover which factors should be prioritized for the prevention and intervention of non-syndromic microtia via tranditonal and meachine-learning statistical methods.
    UNASSIGNED: 293 pairs of 1:1 matched non-syndromic microtia cases and controls who visited Shanghai Ninth People\'s Hospital were enrolled in the current study during 2017-2019. Thirty-nine risk factors across four domains were measured (i.e., parental sociodemographic characteristics, maternal pregnancy history, parental health conditions and lifestyles, and parental environmental and occupational exposures). Lasso regression model and multivariate conditional logistic regression model were performed to identify the leading predictors of microtia across the four domains. The area under the curve (AUC) was used to calculate the predictive probabilities.
    UNASSIGNED: Eight predictors were identified by the lasso regression, including abnormal pregnancy history, genital system infection, teratogenic drugs usage, folic acid supplementation, paternal chronic conditions history, parental exposure to indoor decoration, paternal occupational exposure to noise and maternal acute respiratory infection. The additional predictors identified by the multivariate conditional logistic regression model were maternal age and maternal occupational exposure to heavy metal. Predictors selected from the conditional logistic regression and lasso regression both yielded AUCs (95% CIs) of 0.83 (0.79-0.86).
    UNASSIGNED: The findings from this study suggest some factors across multiple domains are key drivers of non-syndromic microtia regardless of the applied statistical methods. These factors could be used to generate hypotheses for further observational and clinical studies on microtia and guide the prevention and intervention strategies for microtia.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:这项研究调查了参与18个月行为减肥干预的严重精神疾病患者体重减轻的预测因素,使用Lasso回归选择最强大的预测因子。
    方法:分析ACHIEVE试验干预组的数据,对患有严重精神疾病的成年人进行为期18个月的行为减肥干预。Lasso回归用于确定干预时间跨度中至少5磅体重减轻的预测因子。一旦确定了预测因子,我们创建了分类树,以显示如何根据基线和干预期间的特征将参与者分类为可能结果的示例.
    结果:分析的样本包含137名参与者。从基线到18个月,71例(51.8%)个体的净重减轻至少5磅。Lasso回归选择从基线到6个月的体重减轻作为至少5磅18个月体重减轻的主要预测指标,标准化系数为0.51(95%CI:-0.37,1.40)。在回归中还选择了其他三个变量,但增加了最小的预测能力。
    结论:本文的分析表明,在干预期间逐步跟踪体重减轻作为总体体重减轻指标的重要性。以及使用临床试验中常见的其他变量预测长期体重减轻的挑战。本文使用的方法还举例说明了如何有效地分析包含许多变量的临床试验数据集并确定与期望结果相关的因素。
    OBJECTIVE: This study investigates predictors of weight loss among individuals with serious mental illness participating in an 18-month behavioral weight loss intervention, using Lasso regression to select the most powerful predictors.
    METHODS: Data were analyzed from the intervention group of the ACHIEVE trial, an 18-month behavioral weight loss intervention in adults with serious mental illness. Lasso regression was employed to identify predictors of at least five-pound weight loss across the intervention time span. Once predictors were identified, classification trees were created to show examples of how to classify participants into having likely outcomes based on characteristics at baseline and during the intervention.
    RESULTS: The analyzed sample contained 137 participants. Seventy-one (51.8%) individuals had a net weight loss of at least five pounds from baseline to 18 months. The Lasso regression selected weight loss from baseline to 6 months as a primary predictor of at least five pound 18-month weight loss, with a standardized coefficient of 0.51 (95% CI: -0.37, 1.40). Three other variables were also selected in the regression but added minimal predictive ability.
    CONCLUSIONS: The analyses in this paper demonstrate the importance of tracking weight loss incrementally during an intervention as an indicator for overall weight loss, as well as the challenges in predicting long-term weight loss with other variables commonly available in clinical trials. The methods used in this paper also exemplify how to effectively analyze a clinical trial dataset containing many variables and identify factors related to desired outcomes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号