LR

LR
  • 文章类型: Journal Article
    这项研究旨在开发一种利用临床血液标志物的人工智能模型,超声数据,和乳腺活检病理信息来预测乳腺癌患者的远处转移。
    利用了两个医疗中心的数据,临床血液标志物,超声数据,分别提取和选择乳腺活检病理信息。使用Spearman相关和LASSO回归进行特征降维。使用LR和LightGBM机器学习算法构建预测模型,并在内部和外部验证集上进行验证。对两个模型进行了特征相关性分析。
    LR模型在训练中获得了0.892、0.816和0.817的AUC值,内部验证,和外部验证队列,分别。LightGBM模型在相同的队列中获得了0.971、0.861和0.890的AUC值,分别。临床决策曲线分析显示,LightGBM模型在预测乳腺癌远处转移方面优于LR模型。鉴定的关键特征包括肌酸激酶同工酶(CK-MB)和α-羟基丁酸脱氢酶。
    这项研究使用临床血液标志物开发了一种人工智能模型,超声数据,和病理信息来识别乳腺癌患者的远处转移。LightGBM模型表现出优越的预测准确性和临床适用性,表明它是乳腺癌远处转移的早期诊断工具。
    UNASSIGNED: This study aims to develop an artificial intelligence model utilizing clinical blood markers, ultrasound data, and breast biopsy pathological information to predict the distant metastasis in breast cancer patients.
    UNASSIGNED: Data from two medical centers were utilized, Clinical blood markers, ultrasound data, and breast biopsy pathological information were separately extracted and selected. Feature dimensionality reduction was performed using Spearman correlation and LASSO regression. Predictive models were constructed using LR and LightGBM machine learning algorithms and validated on internal and external validation sets. Feature correlation analysis was conducted for both models.
    UNASSIGNED: The LR model achieved AUC values of 0.892, 0.816, and 0.817 for the training, internal validation, and external validation cohorts, respectively. The LightGBM model achieved AUC values of 0.971, 0.861, and 0.890 for the same cohorts, respectively. Clinical decision curve analysis showed a superior net benefit of the LightGBM model over the LR model in predicting distant metastasis in breast cancer. Key features identified included creatine kinase isoenzyme (CK-MB) and alpha-hydroxybutyrate dehydrogenase.
    UNASSIGNED: This study developed an artificial intelligence model using clinical blood markers, ultrasound data, and pathological information to identify distant metastasis in breast cancer patients. The LightGBM model demonstrated superior predictive accuracy and clinical applicability, suggesting it as a promising tool for early diagnosis of distant metastasis in breast cancer.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    创伤性脑损伤(TBI)是年轻人死亡的主要原因,并且以其高死亡率和高发病率而闻名。本文旨在预测TBI患者的24h生存率。
    本次分析共涉及1224个样品,涉及的临床指标包括年龄,性别,血压,MGAP和其他字段,其中目标变量是“结果”,这是一个二进制变量。本文主要涉及的方法包括数据可视化分析,单因素分析,特征工程分析,随机森林模型(RF),K-近邻(KNN)模型,等等。Logistic回归模型(LR)和深度神经网络模型(DNN)。我们将使用SMOTE方法对训练集进行过采样,因为样本本身的标记非常不平衡。
    尽管所有模型的准确性都很高,召回率相对较低。性能最好的DNN模型仅达到0.17,对应的AUC为0.80。重新采样后,我们发现所有模型的阳性样本的召回率都提高了很多,但一些模型的AUC有所下降。最后,最优模型是LR,其阳性样本召回率为0.67,AUC为0.82。
    通过重采样,我们得到了最好的模型是射频模型,其召回率和AUC最好,且AUC水平约为0.87,说明模型的精度表现仍较好。
    UNASSIGNED: Traumatic brain injury (TBI) is the major reason for the death of young people and is well known for its high mortality and morbidity. This paper aim to predict the 24h survival of patients with TBI.
    UNASSIGNED: A total of 1224 samples were involved in this analysis, and the clinical indicators involved included age, gender, blood pressure, MGAP and other fields, among which the target variable was \"outcome\", which was a binary variable. The methods mainly involved in this paper include data visualization analysis, single factor analysis, feature engineering analysis, random forest model (RF), K-Nearst Neighbors (KNN) model, and so on. Logistic regression model (LR) and deep neural network model (DNN). We will oversample the training set using the SMOTE method because of the very unbalanced labeling of the sample itself.
    UNASSIGNED: Although the accuracy of all models is very high, the recall rate is relatively low. The DNN model with the best performance only reaches 0.17, and the corresponding AUC is 0.80. After resampling, we find that the recall rate of positive samples of all models has increased a lot, but the AUC of some models has decreased. Finally, the optimal model is LR, whose positive sample recall rate is 0.67 and AUC is 0.82.
    UNASSIGNED: Through resampling, we obtained that the best model is the RF model, whose recall rate and AUC are the best, and the AUC level is about 0.87, indicating that the accuracy performance of the model is still good.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在这项研究中,我们专注于使用胰腺来源的微阵列基因数据来检测糖尿病。使用降维(DR)技术来减少高维微阵列基因数据。像贝塞尔函数这样的DR方法,离散余弦变换(DCT),最小二乘线性回归(LSLR),并使用人工藻类算法(AAA)。随后,我们应用元启发式算法,如蜻蜓优化算法(DOA)和大象羊群优化算法(EHO)进行特征选择。分类器,如非线性回归(NLR),线性回归(LR),高斯混合模型(GMM)期望最大值(EM),贝叶斯线性判别分类器(BLDC),Logistic回归(LoR),Softmax判别分类器(SDC),以及具有三种类型内核的支持向量机(SVM),线性,多项式,和径向基函数(RBF),被用来检测糖尿病。分类器的性能是根据精度等参数进行分析的,F1得分,MCC,错误率,FM度量,还有Kappa.如果没有功能选择,SVM(RBF)分类器使用AAADR方法实现了90%的高准确率。使用AAADR方法进行EHO特征选择的SVM(RBF)分类器优于其他分类器,准确率为95.714%。分类器性能精度的提高强调了特征选择方法的作用。
    In this study, we focused on using microarray gene data from pancreatic sources to detect diabetes mellitus. Dimensionality reduction (DR) techniques were used to reduce the dimensionally high microarray gene data. DR methods like the Bessel function, Discrete Cosine Transform (DCT), Least Squares Linear Regression (LSLR), and Artificial Algae Algorithm (AAA) are used. Subsequently, we applied meta-heuristic algorithms like the Dragonfly Optimization Algorithm (DOA) and Elephant Herding Optimization Algorithm (EHO) for feature selection. Classifiers such as Nonlinear Regression (NLR), Linear Regression (LR), Gaussian Mixture Model (GMM), Expectation Maximum (EM), Bayesian Linear Discriminant Classifier (BLDC), Logistic Regression (LoR), Softmax Discriminant Classifier (SDC), and Support Vector Machine (SVM) with three types of kernels, Linear, Polynomial, and Radial Basis Function (RBF), were utilized to detect diabetes. The classifier\'s performance was analyzed based on parameters like accuracy, F1 score, MCC, error rate, FM metric, and Kappa. Without feature selection, the SVM (RBF) classifier achieved a high accuracy of 90% using the AAA DR methods. The SVM (RBF) classifier using the AAA DR method for EHO feature selection outperformed the other classifiers with an accuracy of 95.714%. This improvement in the accuracy of the classifier\'s performance emphasizes the role of feature selection methods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    肝炎是地球上最致命的疾病之一。机器学习方法可以基于一些特征来诊断肝炎疾病。在UCI数据集上,作者评估了不同分类器的性能,以制定肝炎疾病诊断的系统策略。使用的分类器是支持向量机,逻辑回归(LR),K-最近邻,和随机森林。分类器在没有类别平衡的情况下使用,并使用SMOTE策略与类别平衡结合使用。两项研究,没有类平衡和类平衡的分类,在不同的性能参数方面进行了比较。采用类平衡后,分类器的效率明显提高。具有SMOTE的LR提供最高水平的准确度(93.18%)。
    Hepatitis is among the deadliest diseases on the planet. Machine learning approaches can contribute toward diagnosing hepatitis disease based on a few characteristics. On the UCI dataset, authors assessed distinct classifiers\' performance in order to develop a systematic strategy for hepatitis disease diagnosis. The classifiers used are support vector machine, logistic regression (LR), K-nearest neighbor, and random forest. The classifiers were employed without class balancing and in conjunction with class balancing using SMOTE strategy. Both studies, classification without class balancing and with class balancing, were compared in terms of different performance parameters. After adopting class balancing, the efficiency of classifiers improved significantly. LR with SMOTE provided the highest level of accuracy (93.18%).
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    基于机器学习(ML)的分类模型广泛用于使用各种生理信号(例如心电图(ECG),心磁图(MCG),心音(HS),和阻抗心动图(ICG)信号。然而,基于ECG的HD识别是临床医生最常用的一种。在目前的调查中,已对ECG记录或受试者进行了采样,并将其用作分类模型的输入,以区分正常和异常患者。该研究采用了不平衡数量的ECG样本来训练各种分类模型。少数机器学习方法,如支持向量机(SVM),逻辑回归(LR),和自适应增强(AdaBoost)已经选择了很少用于HD检测。已在准确性方面评估了开发模型的性能,F1分数,和使用公开提供的受试者的ECG信号的曲线下面积(AUC)值(PTB-ECG,MIT-BIH)数据集。已基于这些性能指标分配了模型的排名,并且发现AdaBoost和LR分类器处于第一和第二位置。这两个模型已经基于多数投票原则进行了整合,并且还确定了该整合模型的性能度量。是的,总的来说,观察到所提出的集成模型在准确性方面展示了PTB-ECG数据集的0.946、0.949和0.951以及MIT-BIH数据集的0.921、0.926和0.950的最佳HD检测性能,F1分数,AUC,分别。所提出的方法也可以用于使用ICG对HD进行分类,MCG,和HS信号作为输入。Further,所提出的方法也可以应用于其他疾病的检测。
    The machine learning (ML)-based classification models are widely utilized for the automated detection of heart diseases (HDs) using various physiological signals such as electrocardiogram (ECG), magnetocardiography (MCG), heart sound (HS), and impedance cardiography (ICG) signals. However, ECG-based HD identification is the most common one used by clinicians. In the current investigation, the ECG records or subjects have been sampled and are used as inputs to the classification model to distinguish between normal and abnormal patients. The study has employed an imbalanced number of ECG samples for training the various classification models. Few ML methods such as support vector machine (SVM), logistic regression (LR), and adaptive boosting (AdaBoost) which have been rarely used for HD detection have been selected. The performance of the developed model has been evaluated in terms of accuracy, F1-score, and area under curve (AUC) values using ECG signals of subjects given in publicly available (PTB-ECG, MIT-BIH) datasets. Ranking of the models has been assigned based on these performance metrics and it is found that the AdaBoost and LR classifiers stand in first and second positions. These two models have been ensembled based on the majority voting principle and the performance measure of this ensemble model has also been determined. It is, in general, observed that the proposed ensemble model demonstrates the best HD detection performance of 0.946, 0.949, and 0.951 for the PTB-ECG dataset and 0.921, 0.926, and 0.950 for the MIT-BIH dataset in terms of accuracy, F1-score, and AUC, respectively. The proposed methodology can also be employed for the classification of HD using ICG, MCG, and HS signals as inputs. Further, the proposed methodology can also be applied to the detection of other diseases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    我们提出了一种基于加速度计和音频信号的自动非侵入性检测咳嗽事件的方法。加速度信号是由固定在患者床上的智能手机捕获的,使用其集成的加速度计。音频信号由同一智能手机使用外部麦克风同时捕获。我们已经编译了一个手动注释的数据集,其中包含来自14名成年男性患者的大约6000个咳嗽和68000个非咳嗽事件的这种同时捕获的加速度和音频信号。Logistic回归(LR),支持向量机(SVM)和多层感知器(MLP)分类器提供了一个基线,并与三种深度架构进行了比较,卷积神经网络(CNN)长短期记忆(LSTM)网络,以及使用留一交叉验证方案的基于残差的体系结构(Resnet50)。我们发现,可以使用加速度或音频信号来区分咳嗽和其他活动,包括打喷嚏,清嗓子,和运动在床上的高精度。然而,在所有情况下,深度神经网络的性能明显优于浅层分类器,而Resnet50提供了最佳性能,对于加速度和音频信号,ROC曲线下面积(AUC)分别超过0.98和0.99。虽然基于音频的分类始终比基于加速的分类提供更好的性能,我们观察到,对于最好的系统,差异非常小。由于加速度信号需要较少的处理能力,由于录制音频的需要被回避,因此隐私本质上是安全的,并且由于记录设备连接到床上并且没有磨损,基于加速度计的高精度非侵入性咳嗽检测器可以代表长期咳嗽监测中更方便和更容易接受的方法。
    We present an automatic non-invasive way of detecting cough events based on both accelerometer and audio signals. The acceleration signals are captured by a smartphone firmly attached to the patient\'s bed, using its integrated accelerometer. The audio signals are captured simultaneously by the same smartphone using an external microphone. We have compiled a manually-annotated dataset containing such simultaneously-captured acceleration and audio signals for approximately 6000 cough and 68000 non-cough events from 14 adult male patients. Logistic regression (LR), support vector machine (SVM) and multilayer perceptron (MLP) classifiers provide a baseline and are compared with three deep architectures, convolutional neural network (CNN), long short-term memory (LSTM) network, and residual-based architecture (Resnet50) using a leave-one-out cross-validation scheme. We find that it is possible to use either acceleration or audio signals to distinguish between coughing and other activities including sneezing, throat-clearing, and movement on the bed with high accuracy. However, in all cases, the deep neural networks outperform the shallow classifiers by a clear margin and the Resnet50 offers the best performance, achieving an area under the ROC curve (AUC) exceeding 0.98 and 0.99 for acceleration and audio signals respectively. While audio-based classification consistently offers better performance than acceleration-based classification, we observe that the difference is very small for the best systems. Since the acceleration signal requires less processing power, and since the need to record audio is sidestepped and thus privacy is inherently secured, and since the recording device is attached to the bed and not worn, an accelerometer-based highly accurate non-invasive cough detector may represent a more convenient and readily accepted method in long-term cough monitoring.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    This study aimed to determine the risk factors of ischemic/hemorrhagic stroke in patients suffering moyamoya disease (MMD), as well as to compare the effects of six analysis methods.
    In the present retrospective study, the data originated from the database of Jiang Xi Province Medical Big Data Engineering & Technology Research Center. In addition, the information of patients with MMD that were admitted to the second affiliated hospital of Nanchang university from January 1st, 2012 to December 31st, 2019 was acquired. Six different machine learning methods were adopted to build the models, and XGboost, Logistic regression (LR) and Support vector machine (SVM) models were adopted to determine the risk factors of ischemic/hemorrhagic stroke in patients with MMD because of their excellent performance. Next, the effects of the built models were compared and validated in internal and independent external validation sets. The external validation set involving 204 cases from January 1st, 2018 to December 31st, 2019.
    On the whole, 790 patients with MMD were screened, i.e., 397 patients with cerebral infarction and 393 patients with cerebral hemorrhage. In the internal validation set, XGboost model exhibited significant discrimination (AUC>0.75), with its area under the curve (AUC) reaching 0.874 (95% CI: 0.859, 0.889). Compared with the LR and SVM models, the XGboost model in the internal validation set achieved the improved accuracy by 3.2% and 3.1%, respectively, whereas no significant difference was identified.
    XGboost model could be more efficient in analyzing the risk factors of ischemic/hemorrhagic stroke in patients with MMD; the risk factors of hemorrhagic stroke in MMD might be closely related to Suzuki stages, presence of an aneurysm, rural residence, hospitalization times and age of onset.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    对使用大规模平行测序(MPS)技术对传统常染色体STR标记进行分型的兴趣日益增加,这引发了有关通过概率基因分型对结果进行解释的多个问题。为了开始解决其中的一些问题,我们检查了使用不同程度的序列信息的影响,预过滤,和数据建模,以在概率基因分型软件中解释复杂的MPS-STR混合物。对于两到四个贡献者的混合物,60个ForenSeq分型结果为:1)使用三种单独的格式表示,这些格式捕获了不同程度的序列信息,和2)在概率解释之前使用三种不同的过滤方法进行分析。随后根据十个参考概况解释了不同格式和过滤变体的所有混合物。使用定性(LRmix)和定量(EuroForMix)模型计算似然比(LR)。与常规毛细管电泳重复单元(RU)相比,当STR命名法基于最长的不间断延伸(LUS)时,LR结果表明信息增益适中,而当利用完整的序列信息时,额外的增益非常小。与动态(基于百分比)阈值相比,使用静态分析阈值进行数据预过滤改进了LRs,因为静态阈值防止了对源自次要贡献者的等位基因的过度过滤。对于使用定量模型执行的解释,如果采用口吃模型而不是使用口吃阈值预过滤数据,则可以观察到性能的微小改进,而正如预期的那样,当口吃没有预先过滤时,在定性模型下性能会大大恶化。鉴于本研究中的经验和理论发现,我们讨论了使用MPS系统利用序列级信息和潜在路径来增加信息增益的价值。
    The increased interest in the use of Massively Parallel Sequencing (MPS) technologies to type traditional autosomal STR markers raises multiple questions regarding interpretation of the results via probabilistic genotyping. To begin to address some of those questions, we examined the effects of using differing degrees of sequence information, pre-filtering, and data modeling to interpret complex MPS-STR mixtures in a probabilistic genotyping software. Sixty ForenSeq typing results for mixtures of from two to four contributors were: 1) represented using three separate formats that captured different degrees of sequence information, and 2) were analyzed using three different filtering approaches prior to probabilistic interpretation. All mixtures for the different format and filtering variants were subsequently interpreted with respect to ten reference profiles, using both qualitative (LRmix) and quantitative (EuroForMix) models to calculate the likelihood ratio (LR). The LR results indicated moderate information gain when the STR nomenclature was based upon the longest uninterrupted stretch (LUS) compared with conventional capillary electrophoresis repeat units (RU), whereas additional gains were very small when the complete sequence information was utilised. Use of a static analytical threshold for data pre-filtering improved LRs compared to a dynamic (percentage-based) threshold, as the static threshold prevented excessive filtering of alleles originating from minor contributors. For interpretations performed using a quantitative model, a small improvement in performance was observed if a stutter model was employed instead of using stutter thresholds to pre-filter the data, whereas - as expected - performance worsened considerably under the qualitative model when stutter was not pre-filtered. Given the empirical and theoretical findings in this study we discuss the value of utilizing sequence-level information and potential paths forward to increase information gain using MPS systems.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    BACKGROUND: Basal cell carcinoma (BCC) recurrence and metastatic rates are known to be very low. The risk factors for these rare outcomes are subsequently not well studied.
    OBJECTIVE: To identify risk factors independently associated with local recurrence (LR) and metastasis and/or death (M/D) in large (≥2 cm) BCC.
    METHODS: BCCs histologically confirmed between 2000 and 2009 were retrospectively screened for tumor diameter at 2 academic centers. Medical records of all large BCCs and an equal number of randomly selected small BCCs were reviewed for LR and M/D.
    RESULTS: Included were 248 large BCC and 248 small BCC tumors. Large BCCs had a significantly higher risk of LR and M/D than small BCCs (LR: 8.9% vs 0.8%, P < .001; M/D: 6.5% vs. 0%, P < .001). Because the risks were so low in small BCCs, they were excluded from further analysis. On multivariable logistic regression, head/neck location (odds ratio [OR], 9.7; 95% confidence interval [CI], 3.0-31.3) and depth beyond fat (OR, 3.1; 95% CI, 1.0-9.6) were associated with LR in large BCCs. Risk of LR was lower with Mohs micrographic surgery (OR, 0.14; 95% CI, 0.04-0.5). Head/neck location (OR, 5.3; 95% CI, 1.2-23.2), tumor diameter ≥4 cm (OR, 11.9; 95% CI, 2.4-59.4), and depth beyond fat (OR, 28.6; 95% CI, 6.7-121) were significant predictors of M/D in large BCCs.
    CONCLUSIONS: Retrospective cohort design.
    CONCLUSIONS: Large BCCs, particularly those with additional risk factors, have a high enough risk of recurrence and metastasis to warrant further investigation to optimize management.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

  • 文章类型: Journal Article
    High-grade bone osteosarcoma has a high relapse rate. The best treatment of local recurrence (LR) is under discussion. The aim of this study is to analyze LR patterns and factors prognostic for survival.
    LR diagnostic modality (clinical or imaging), pattern of recurrence, and post-LR survival (PLRS) were assessed.
    Sixty-two patients were identified, with median age 21 years (range, 9-75 years), including 11 (18%) ≤15 years, 30 (48%) from 16 to 29 years; 21 (34%) were older. Patterns of relapse were LR only 58%, LR + distant metastases (DM) 42%. Seventy-nine percent of patients relapsed within 24 months, and diagnosis was clinical in 88%. LR treatment was surgery 85%, chemotherapy 55%, chemotherapy + surgery 45%. Surgical complete remission after LR (CR2) was achieved in 60% (LR 86%; LR + DM 23%). With a median follow-up of 43 months (range, 5-235 months), the five-year PLRS was 37%, significantly better for patients with longer LR-free interval (LRFI; ≤24 months 31% vs > 24 months 61.5%, P = 0.03), absence of DM (no DM 56% vs DM 11.5%, P = 0.0001), and achievement of CR2 (no CR2 0% vs CR2 58.5%, P = 0.0001). No difference was found according to age and chemotherapy (LR only: five-year PLRS: 53% without chemotherapy vs 58% with chemotherapy, P = 0.9; LR + DM: five-year PLRS: 25% without chemotherapy vs 9% with chemotherapy, P = 0.7).
    Early relapse is detected by symptoms in 90% of cases and associated with worse outcome. The achievement of CR2, not age, is crucial for survival. For patients with LR only, better survival was demonstrated, as compared with DM, and no improvement with chemotherapy after surgery was found.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

公众号