Supervised learning

监督学习
  • 文章类型: Journal Article
    最常见的精神疾病之一是重度抑郁症(MDD),这增加了自杀意念或过早死亡的可能性。在言语流畅性任务(VFT)期间通过功能近红外光谱(fNIRS)检测到的异常额叶血液动力学变化有可能用作评估临床症状的客观指标。然而,针对表现出抑郁症症状的个体的全面定量和客观评估工具仍未开发。从包含289名MDD患者和178名健康对照的大规模数据集中的467个样本中提取。在整个VFT中获得fNIRS测量值。为了识别独特的MDD生物标志物,本研究引入了一种从fNIRS信号中提取时空特征的数据表示方法,随后被用作潜在的预测因子。机器学习分类器(例如,实施梯度增强决策树(GBDT)和多层感知器)以评估预测所选特征的能力。交叉验证的平均值和标准偏差表明,GBDT模型,当与180特征图案组合时,以最有效的方式区分MDD患者与健康对照组。测试集的正确分类的准确性为0.829±0.053,AUC为0.895(95%CI:0.864-0.925),灵敏度为0.914±0.051。使用Shapley加法解释方法识别对MDD识别做出最重要贡献的渠道,位于额极区和背外侧前额叶皮层,以及三角形Broca区。在MDD中VFT期间异常前额叶活动的评估充当客观可测量的生物标志物,其可用于评估认知缺陷并促进MDD的早期筛查。本研究中建议的模型可应用于大规模病例对照fNIRS数据集,以检测MDD的独特特征,并为临床医生提供客观的基于生物标志物的分析仪器,以协助评估可疑病例。
    One of the most prevalent psychiatric disorders is major depressive disorder (MDD), which increases the probability of suicidal ideation or untimely demise. Abnormal frontal hemodynamic changes detected by functional near-infrared spectroscopy (fNIRS) during verbal fluency task (VFT) have the potential to be used as an objective indicator for assessing clinical symptoms. However, comprehensive quantitative and objective assessment instruments for individuals who exhibit symptoms suggestive of depression remain undeveloped. Drawing from a total of 467 samples in a large-scale dataset comprising 289 MDD patients and 178 healthy controls, fNIRS measurements were obtained throughout the VFT. To identify unique MDD biomarkers, this research introduced a data representation approach for extracting spatiotemporal features from fNIRS signals, which were subsequently utilized as potential predictors. Machine learning classifiers (e.g., Gradient Boosted Decision Trees (GBDT) and Multilayer Perceptron) were implemented to assess the ability to predict selected features. The mean and standard deviation of the cross-validation indicated that the GBDT model, when combined with the 180-feature pattern, distinguishes patients with MDD from healthy controls in the most effective manner. The accuracy of correct classification for the test set was 0.829 ± 0.053, with an AUC of 0.895 (95 % CI: 0.864-0.925) and a sensitivity of 0.914 ± 0.051. Channels that made the most important contribution to the identification of MDD were identified using Shapley Additive Explanations method, located in the frontopolar area and the dorsolateral prefrontal cortex, as well as pars triangularis Broca\'s area. Assessment of abnormal prefrontal activity during the VFT in MDD serves as an objectively measurable biomarker that could be utilized to evaluate cognitive deficits and facilitate early screening for MDD. The model suggested in this research could be applied to large-scale case-control fNIRS datasets to detect unique characteristics of MDD and offer clinicians an objective biomarker-based analytical instrument to assist in the evaluation of suspicious cases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    由于硬件配置的变化,多变量校准模型在外推校准仪器方面经常遇到挑战。信号处理算法,或环境条件。已经开发了校准传递技术来缓解这个问题。在这项研究中,我们介绍了一种称为监督因子分析转移(SFAT)的新方法,旨在实现稳健和可解释的校准转移。SFAT从概率框架运行,并将响应变量集成到其传输过程中,以有效地将目标仪器的数据与源仪器的数据对齐。在SFAT模型中,来自源仪器的数据,目标仪器,并且响应变量被共同投影到一组共享的潜在变量上。这些潜在变量作为三个不同领域之间信息传递的管道,从而促进有效的光谱转移。此外,SFAT明确建模与每个变量相关的噪声方差,从而最大限度地减少非信息噪声的传输。此外,我们提供了经验证据,展示了SFAT在三个真实世界数据集的有效性,在校准转移方案中展示其卓越的性能。
    Multivariate calibration models often encounter challenges in extrapolating beyond the calibration instruments due to variations in hardware configurations, signal processing algorithms, or environmental conditions. Calibration transfer techniques have been developed to mitigate this issue. In this study, we introduce a novel methodology known as Supervised Factor Analysis Transfer (SFAT) aimed at achieving robust and interpretable calibration transfer. SFAT operates from a probabilistic framework and integrates response variables into its transfer process to effectively align data from the target instrument to that of the source instrument. Within the SFAT model, the data from the source instrument, the target instrument, and the response variables are collectively projected onto a shared set of latent variables. These latent variables serve as the conduit for information transfer between the three distinct domains, thereby facilitating effective spectra transfer. Moreover, SFAT explicitly models the noise variances associated with each variable, thereby minimizing the transfer of non-informative noise. Furthermore, we provide empirical evidence showcasing the efficacy of SFAT across three real-world datasets, demonstrating its superior performance in calibration transfer scenarios.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    点云配准是计算机视觉和图形学中的一项基本任务,广泛应用于三维重建,对象跟踪,和图集重建。基于学习的优化和深度学习方法由于其自身独特的优势在成对配准中得到了广泛的发展。深度学习方法提供了更大的灵活性,可以注册未经训练的看不见的点云。基于学习的优化方法在处理各种扰动下的配准时表现出增强的鲁棒性和稳定性,比如噪音,异常值,和闭塞。为了利用这两种方法的优势来实现更短的耗时,健壮,和多个实例的稳定注册,在本文中,我们提出了一种新的计算框架,称为SGRTmreg,用于多个成对注册。SGRTmreg框架利用三个组件-搜索方案,一种基于学习的优化方法,称为基于图的加权判别优化(GRDO),传输模块实现多实例点云配准。给定要匹配的实例集合,作为目标点云的模板,和一个实例作为源点云,搜索方案从集合中选择一个与源非常相似的点云。然后,GRDO通过将源与目标对齐来学习一系列回归变量,而传输模块存储并应用学习的回归量以将所选择的点云与目标对齐并估计所选择的点云的变换。总之,SGRTmreg利用回归量的共享序列将多个点云注册到目标点云。我们对各种数据集进行了广泛的注册实验,以评估所提出的框架。实验结果表明,SGRTmreg以更高的精度实现了多个成对配准,鲁棒性,和稳定性比国家的最先进的深度学习和传统的注册方法。
    Point cloud registration is a fundamental task in computer vision and graphics, which is widely used in 3D reconstruction, object tracking, and atlas reconstruction. Learning-based optimization and deep learning methods have been widely developed in pairwise registration due to their own distinctive advantages. Deep learning methods offer greater flexibility and enable registering unseen point clouds that are not trained. Learning-based optimization methods exhibit enhanced robustness and stability when handling registration under various perturbations, such as noise, outliers, and occlusions. To leverage the strengths of both approaches to achieve a less time-consuming, robust, and stable registration for multiple instances, we propose a novel computational framework called SGRTmreg for multiple pairwise registrations in this paper. The SGRTmreg framework utilizes three components-a Searching scheme, a learning-based optimization method called Graph-based Reweighted discriminative optimization (GRDO), and a Transfer module to achieve multi-instance point cloud registration.Given a collection of instances to be matched, a template as a target point cloud, and an instance as a source point cloud, the searching scheme selects one point cloud from the collection that closely resembles the source. GRDO then learns a sequence of regressors by aligning the source to the target, while the transfer module stores and applies the learned regressors to align the selected point cloud to the target and estimate the transformation of the selected point cloud. In short, SGRTmreg harnesses a shared sequence of regressors to register multiple point clouds to a target point cloud. We conduct extensive registration experiments on various datasets to evaluate the proposed framework. The experimental results demonstrate that SGRTmreg achieves multiple pairwise registrations with higher accuracy, robustness, and stability than the state-of-the-art deep learning and traditional registration methods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    随着人工智能技术的出现,机器学习算法已广泛应用于疾病预测领域。
    心血管疾病(CVD)严重危害全球人类健康,因此需要建立有效的CVD预测模型,对控制疾病风险、保障人群身心健康具有重要意义。
    以UCI心脏病数据集为例,最初,构建了单一的机器学习预测模型。随后,六种方法,如皮尔逊,卡方,RFE和LightGBM被综合用于特征筛选。在基本分类器的基础上,进行了软投票融合和堆叠融合,建立了心血管疾病的预测模型,从而实现对高危人群的早期预警和疾病干预。为了解决数据不平衡问题,采用SMOTE方法对数据集进行处理,并利用多维度、多指标对模型的预测效果进行了分析。
    在单分类器模型中,MLP算法对预处理后的心脏病数据集进行了优化。选择功能后,五个特点消除。ENSEM_SV算法结合了基分类器,通过对分类器结果的软投票来确定预测结果,在精度等五个指标上实现了最优值,Jaccard_Score,哈姆_损失,AUC,等。,AUC值达到0.951。RF,ET,GBDT,在由基本分类器组成的第一阶段子模型中采用了LGB算法。选择AB算法作为第二阶段模型,和集成算法ENSEM_ST,通过两个阶段的堆叠融合获得的,在精度等7个指标上表现出最佳性能,灵敏度,F1_Score,Mathew_Corcoef,等。,AUC达到0.952。此外,比较了基于不同训练集占用率的算法分类效果。结果表明,两种融合模型的预测性能均优于单一模型。ENSEM_ST融合的整体效果强于ENSEM_SV融合。
    本研究中建立的融合模型在很大程度上提高了模型的整体分类精度和稳定性。在CVD诊断的预测分析中具有很好的应用价值,为疾病诊断和干预策略提供有价值的参考。
    UNASSIGNED: With the advent of artificial intelligence technology, machine learning algorithms have been widely used in the area of disease prediction.
    UNASSIGNED: Cardiovascular disease (CVD) seriously jeopardizes human health worldwide, thereby needing the establishment of an effective CVD prediction model that can be of great significance for controlling the risk of the disease and safeguarding the physical and mental health of the population.
    UNASSIGNED: Considering the UCI heart disease dataset as an example, initially, a single machine learning prediction model was constructed. Subsequently, six methods such as Pearson, chi-squared, RFE and LightGBM were comprehensively used for the feature screening. On the basis of the base classifiers, Soft Voting fusion and Stacking fusion was carried out to build a prediction model for cardiovascular diseases, in order to realize an early warning and disease intervention for high-risk populations. To address the data imbalance problem, the SMOTE method was adopted to process the data set, and the prediction effect of the model was analyzed using multi-dimensional and multi-indicators.
    UNASSIGNED: In the single classifier model, the MLP algorithm performed optimally on the preprocessed heart disease dataset. After feature selection, five features eliminated. The ENSEM_SV algorithm that combines the base classifiers to determine the prediction results by soft voting on the results of the classifiers achieved the optimal value on five metrics such as Accuracy, Jaccard_Score, Hamm_Loss, AUC, etc., and the AUC value reached 0.951. The RF, ET, GBDT, and LGB algorithms were employed in the first stage sub-model composed of base classifiers. The AB algorithm was selected as the second stage model, and the ensemble algorithm ENSEM_ST, obtained by Stacking fusion of the two stages exhibited the best performance on 7 indicators such as Accuracy, Sensitivity, F1_Score, Mathew_Corrcoef, etc., and the AUC reached 0.952. Furthermore, a comparison of the algorithms\' classification effects based on different training set occupancy was carried out. The results indicated that the prediction performance of both the fusion models was better than the single models, and the overall effect of ENSEM_ST fusion was stronger than the ENSEM_SV fusion.
    UNASSIGNED: The fusion model established in this study improved the overall classification accuracy and stability of the model to a significant extent. It has a good application value in the predictive analysis of CVD diagnosis, and can provide a valuable reference in the disease diagnosis and intervention strategies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:已确定异柠檬酸脱氢酶(IDH)突变状态在神经胶质瘤分层和预后中的作用。虽然结构磁共振图像(MRI)是一种有前途的生物标志物,它可能不足以进行IDH突变状态的非侵入性表征.我们研究了基于卷积神经网络(CNN)和支持向量机(SVM)的深度影像组学方法增强的组合扩散张量成像(DTI)和结构MRI的诊断价值,确定中枢神经系统世界卫生组织(CNSWHO)2-4级胶质瘤中IDH突变状态。
    方法:这项回顾性研究分析了DTI衍生的分数各向异性(FA)和平均扩散率(MD)图像以及包括流体衰减反转恢复(FLAIR)的结构图像,非增强型T1-,和206例初治神经胶质瘤的T2加权图像,包括146个IDH突变体和60个IDH野生型突变体。由经验丰富的神经放射科医生手动分割病变,并将面罩应用于FA和MD图。通过应用预先训练的CNN和统计描述从每个受试者中提取深度影像组学特征。应用SVM分类器使用成像特征结合人口统计学数据来预测IDH状态。
    结果:我们比较评估了CNN-SVM分类器在预测IDH突变状态方面的性能,使用独立的和组合的结构和基于DTI的成像特征。联合成像特征超过了预测IDH突变状态的独立模式[曲线下面积(AUC)=0.846;灵敏度=0.925;和特异性=0.567]。重要的是,在结构和DTI成像特征中加入了人口统计学数据(患者年龄)[曲线下面积(AUC)=0.847;敏感性=0.911;特异性=0.617]后,模型表现最佳.
    结论:来自基于DTI的FA和MD图结合结构MRI的成像特征,具有优于独立结构或DTI序列提供的诊断价值。结合人口统计信息,该CNN-SVM模型为胶质瘤中IDH突变状态提供了进一步增强的非侵入性预测.
    BACKGROUND: The role of isocitrate dehydrogenase (IDH) mutation status for glioma stratification and prognosis is established. While structural magnetic resonance image (MRI) is a promising biomarker, it may not be sufficient for non-invasive characterisation of IDH mutation status. We investigated the diagnostic value of combined diffusion tensor imaging (DTI) and structural MRI enhanced by a deep radiomics approach based on convolutional neural networks (CNNs) and support vector machine (SVM), to determine the IDH mutation status in Central Nervous System World Health Organization (CNS WHO) grade 2-4 gliomas.
    METHODS: This retrospective study analyzed the DTI-derived fractional anisotropy (FA) and mean diffusivity (MD) images and structural images including fluid attenuated inversion recovery (FLAIR), non-enhanced T1-, and T2-weighted images of 206 treatment-naïve gliomas, including 146 IDH mutant and 60 IDH-wildtype ones. The lesions were manually segmented by experienced neuroradiologists and the masks were applied to the FA and MD maps. Deep radiomics features were extracted from each subject by applying a pre-trained CNN and statistical description. An SVM classifier was applied to predict IDH status using imaging features in combination with demographic data.
    RESULTS: We comparatively assessed the CNN-SVM classifier performance in predicting IDH mutation status using standalone and combined structural and DTI-based imaging features. Combined imaging features surpassed stand-alone modalities for the prediction of IDH mutation status [area under the curve (AUC) = 0.846; sensitivity = 0.925; and specificity = 0.567]. Importantly, optimal model performance was noted following the addition of demographic data (patients\' age) to structural and DTI imaging features [area under the curve (AUC) = 0.847; sensitivity = 0.911; and specificity = 0.617].
    CONCLUSIONS: Imaging features derived from DTI-based FA and MD maps combined with structural MRI, have superior diagnostic value to that provided by standalone structural or DTI sequences. In combination with demographic information, this CNN-SVM model offers a further enhanced non-invasive prediction of IDH mutation status in gliomas.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    森林火灾威胁着全球生态系统,社会经济结构,和公共安全。准确评估森林火灾敏感性对于有效的环境管理至关重要。监督学习方法主导了这一评估,依靠大量的森林火灾发生数据集进行模型训练。然而,获取精确的森林火灾位置数据仍然具有挑战性。为了解决这个问题,半监督学习成为一种可行的解决方案,利用一组有限的收集样本和包含环境因素的未标记数据进行训练。我们的研究采用了转换支持向量机(TSVM),一种关键的半监督学习方法,在样本有限的情况下评估森林火灾敏感性。我们进行了比较分析,根据广泛使用的监督学习方法评估其性能。森林火灾敏感性评估区位于大余县,江西省,中国,以其广阔的森林覆盖和频繁的火灾事件而闻名。我们分析并生成了描绘森林火灾敏感性的地图,评估各种小样本场景中监督和半监督学习方法的预测精度(例如,4、8、12、16、20、24、28和32个样品)。我们的发现表明,与有限样本的监督学习相比,TSVM表现出更高的预测准确性。产生更合理的森林火灾敏感性图。例如,在样本大小为4、16和28时,TSVM的预测精度分别约为0.8037、0.9257和0.9583。相比之下,随机森林,监督学习中表现最好的人,分别显示大约0.7424、0.8916和0.9431的精度,对于相同的小样本量。此外,我们讨论了三个关键方面:TSVM参数配置,未标记样本量的影响,和典型样本量内的性能。与森林火灾敏感性评估和制图的监督学习相比,我们的发现支持半监督学习作为一种有前途的方法。特别是在小样本量的情况下。
    Forest fires threaten global ecosystems, socio-economic structures, and public safety. Accurately assessing forest fire susceptibility is critical for effective environmental management. Supervised learning methods dominate this assessment, relying on a substantial dataset of forest fire occurrences for model training. However, obtaining precise forest fire location data remains challenging. To address this issue, semi-supervised learning emerges as a viable solution, leveraging both a limited set of collected samples and unlabeled data containing environmental factors for training. Our study employed the transductive support vector machine (TSVM), a key semi-supervised learning method, to assess forest fire susceptibility in scenarios with limited samples. We conducted a comparative analysis, evaluating its performance against widely used supervised learning methods. The assessment area for forest fire susceptibility lies in Dayu County, Jiangxi Province, China, renowned for its vast forest cover and frequent fire incidents. We analyzed and generated maps depicting forest fire susceptibility, evaluating prediction accuracies for both supervised and semi-supervised learning methods across various small sample scenarios (e.g., 4, 8, 12, 16, 20, 24, 28, and 32 samples). Our findings indicate that TSVM exhibits superior prediction accuracy compared to supervised learning with limited samples, yielding more plausible forest fire susceptibility maps. For instance, at sample sizes of 4, 16, and 28, TSVM achieves prediction accuracies of approximately 0.8037, 0.9257, and 0.9583, respectively. In contrast, random forests, the top performers in supervised learning, demonstrate accuracies of approximately 0.7424, 0.8916, and 0.9431, respectively, for the same small sample sizes. Additionally, we discussed three key aspects: TSVM parameter configuration, the impact of unlabeled sample size, and performance within typical sample sizes. Our findings support semi-supervised learning as a promising approach compared to supervised learning for forest fire susceptibility assessment and mapping, particularly in scenarios with small sample sizes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    车辆中挥发性有机化合物(VOC)的排放是一个重大问题,造成难闻的气味。为了减轻车辆中的挥发性有机化合物和气味,选择气味和VOC排放低的内饰零件至关重要。然而,流行的气味评价方法是主观的,昂贵的,并可能对评估人员的健康有害。在这项研究中,我们分析了139个汽车内饰件和92个车辆,建立一个具有成本效益的,气味评价的数据驱动方法。苯的含量,甲苯,乙苯,二甲苯,苯乙烯,甲醛,乙醛,丙烯醛,采用热脱附气相色谱-质谱(TD-GC/MS)和高效液相色谱-紫外检测器(HPLC-UV)检测总挥发性有机物(TVOC)。专业气味评估人员评估气味,确定内部零件的强度等级为2.0至4.5,整车的强度等级为2.5至3.5。利用这些数据,我们应用了四种监督学习算法来开发内部零件和整车气味强度的预测模型。在模型训练期间,我们为人工神经网络(ANN)和卷积神经网络-双向长短期记忆(CNN-BiLSTM)模型实现了早期停止技术,同时使用GridSearch算法优化支持向量机(SVM)和极限梯度提升(XGBoost)模型。评估结果表明,CNN-BiLSTM模型表现最好,在0.5的气味强度水平内,未知样品的平均准确度为89%。均方根误差(RMSE)为0.24,平均绝对误差(MAE)为0.08。该模型还经历了七倍交叉验证,达到83.43%的准确率。此外,我们采用沙普利加法扩张(SHAP)对模型进行了解释性分析,这证实了每种VOC的气味贡献与人类嗅觉规则的一致性。通过监督学习预测基于挥发性有机化合物的气味,这项研究降低了成本,提高了各种车辆内饰气味评估的效率和适用性。
    Emissions of volatile organic compounds (VOCs) in vehicles represent a significant problem, causing unpleasant odors. To mitigate VOCs and odors in vehicles, it is critical to choose interior parts with low odor and VOC emissions. However, prevailing odor evaluation methods are subjective, costly, and potentially harmful to the health of evaluators. In this study, we analyzed 139 automotive interior parts and 92 vehicles, establishing a cost-effective, data-driven method for odor evaluation. The contents of benzene, toluene, ethylbenzene, xylene, styrene, formaldehyde, acetaldehyde, acrolein, and total volatile organic compounds (TVOC) were detected by thermal desorption gas chromatography-mass spectrometry (TD-GC/MS) and high-performance liquid chromatography with an ultraviolet detector (HPLC-UV). Professional odor evaluators assessed the odors, identifying intensity levels from 2.0 to 4.5 in interior parts and 2.5 to 3.5 in whole vehicles. Leveraging this data, we applied four supervised learning algorithms to develop predictive models for the odor intensity of both interior parts and entire vehicles. During model training, we implemented early stopping techniques for the artificial neural network (ANN) and convolutional neural network-bidirectional long short-term memory (CNN-BiLSTM) models, while optimizing the support vector machine (SVM) and extreme gradient boosting (XGBoost) models using the GridSearch algorithm. The evaluation results reveal that the CNN-BiLSTM model performs the best, achieving an average accuracy of 89% for unknown samples within an odor intensity level of 0.5. The root mean square error (RMSE) is 0.24, and the mean absolute error (MAE) is 0.08. The model also underwent a sevenfold cross-validation, achieving an accuracy of 83.43%. Additionally, we employed SHapley Additive exPlanations (SHAP) for the interpretative analysis of the model, which confirmed the consistency of each VOC\'s odor contribution with human olfactory rules. By predicting odors based on VOCs through supervised learning, this study reduces the costs and enhances the efficiency and applicability of odor assessment across various vehicle interiors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    标记医学图像是一项艰巨且昂贵的任务,需要临床专业知识和大量合格图像。样本不足会导致训练过程中的欠拟合和监督学习模型的性能差。在这项研究中,我们旨在开发基于SimCLR的半监督学习框架,根据NICE分类对结直肠肿瘤进行分类.首先,所提出的框架是在使用大型未标记数据集的自监督学习下进行训练的;随后,根据NICE分类对有限的标记数据集进行了微调.该模型在独立的数据集上进行了评估,并与基于监督迁移学习和内窥镜医师使用准确性的模型进行了比较。马修相关系数(MCC),和科恩的卡帕。最后,应用Grad-CAM和t-SNE对模型解释进行可视化。ResNet支持的SimCLR模型(精度为0.908,MCC为0.862,Cohen的kappa为0.896)优于基于监督迁移学习的模型(均值:0.803、0.698和0.742)和初级内窥镜医师(0.816、0.724和0.863),而表现仅比高级内窥镜医师稍差(0.916、0.875和0.944)。此外,t-SNE在SimCLR中通过自监督学习比通过监督迁移学习显示出更好的三元样本聚类。与传统的监督学习相比,半监督学习使深度学习模型能够利用有限的标记内窥镜图像实现改进的性能。
    Labelling medical images is an arduous and costly task that necessitates clinical expertise and large numbers of qualified images. Insufficient samples can lead to underfitting during training and poor performance of supervised learning models. In this study, we aim to develop a SimCLR-based semi-supervised learning framework to classify colorectal neoplasia based on the NICE classification. First, the proposed framework was trained under self-supervised learning using a large unlabelled dataset; subsequently, it was fine-tuned on a limited labelled dataset based on the NICE classification. The model was evaluated on an independent dataset and compared with models based on supervised transfer learning and endoscopists using accuracy, Matthew\'s correlation coefficient (MCC), and Cohen\'s kappa. Finally, Grad-CAM and t-SNE were applied to visualize the models\' interpretations. A ResNet-backboned SimCLR model (accuracy of 0.908, MCC of 0.862, and Cohen\'s kappa of 0.896) outperformed supervised transfer learning-based models (means: 0.803, 0.698, and 0.742) and junior endoscopists (0.816, 0.724, and 0.863), while performing only slightly worse than senior endoscopists (0.916, 0.875, and 0.944). Moreover, t-SNE showed a better clustering of ternary samples through self-supervised learning in SimCLR than through supervised transfer learning. Compared with traditional supervised learning, semi-supervised learning enables deep learning models to achieve improved performance with limited labelled endoscopic images.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    深度学习算法在显微图像翻译中得到了广泛的应用。根据配对数据的可用性,可以通过有监督或无监督学习来训练相应的数据驱动模型。然而,一般情况是数据只大致配对,这样监督学习可能由于数据不对齐而无效,而无监督学习将不太理想,因为没有利用大致配对的信息。在这项工作中,我们提出了一个统一的框架(U-Frame),通过引入可以根据数据错位程度自动调整的公差大小来统一监督和无监督学习。与全局采样规则的实施一起,我们证明了U-Frame在无数图像翻译应用程序中,在所有级别的数据未对齐(即使是完美对齐的图像对)中,始终优于有监督和无监督学习。包括伪光学切片,虚拟组织学染色(与癌症诊断的临床评估),改善信噪比或分辨率,和荧光标记的预测,有可能成为图像翻译的新标准。
    Deep learning algorithms have been widely used in microscopic image translation. The corresponding data-driven models can be trained by supervised or unsupervised learning depending on the availability of paired data. However, general cases are where the data are only roughly paired such that supervised learning could be invalid due to data unalignment, and unsupervised learning would be less ideal as the roughly paired information is not utilized. In this work, we propose a unified framework (U-Frame) that unifies supervised and unsupervised learning by introducing a tolerance size that can be adjusted automatically according to the degree of data misalignment. Together with the implementation of a global sampling rule, we demonstrate that U-Frame consistently outperforms both supervised and unsupervised learning in all levels of data misalignments (even for perfectly aligned image pairs) in a myriad of image translation applications, including pseudo-optical sectioning, virtual histological staining (with clinical evaluations for cancer diagnosis), improvement of signal-to-noise ratio or resolution, and prediction of fluorescent labels, potentially serving as new standard for image translation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    道路是交通运输的基本要素,连接城市和农村地区,以及人们的生活和工作。它们在地图更新等各个领域发挥着重要作用,经济发展,旅游,和灾害管理。从高分辨率遥感图像中自动提取道路特征一直是遥感领域的热点和挑战性课题,近年来,深度学习网络模型被广泛应用于遥感图像中的道路提取。鉴于此,本文系统地回顾和总结了基于深度学习的高分辨率遥感图像道路自动提取技术。回顾了深度学习网络模型在道路提取任务中的应用,并将这些模型分类为完全监督学习,半监督学习,以及基于标签使用的弱监督学习。最后,对当前深度学习技术在道路提取中的发展进行了总结和展望。
    Roads are the fundamental elements of transportation, connecting cities and rural areas, as well as people\'s lives and work. They play a significant role in various areas such as map updates, economic development, tourism, and disaster management. The automatic extraction of road features from high-resolution remote sensing images has always been a hot and challenging topic in the field of remote sensing, and deep learning network models are widely used to extract roads from remote sensing images in recent years. In light of this, this paper systematically reviews and summarizes the deep-learning-based techniques for automatic road extraction from high-resolution remote sensing images. It reviews the application of deep learning network models in road extraction tasks and classifies these models into fully supervised learning, semi-supervised learning, and weakly supervised learning based on their use of labels. Finally, a summary and outlook of the current development of deep learning techniques in road extraction are provided.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号