Self-supervised learning

自监督学习
  • 文章类型: Journal Article
    背景:个人感应,利用从生态环境中患者的可穿戴设备被动和近乎连续地收集的数据,是监测情绪障碍(MD)的有希望的范例,全球疾病负担的主要决定因素。然而,收集和注释可穿戴数据是资源密集型的。因此,这种研究通常只能招募几十名患者。这构成了将现代监督机器学习技术应用于MD检测的主要障碍之一。
    目的:在本文中,我们克服了这一数据瓶颈,并在自监督学习(SSL)的最新进展的基础上,从可穿戴设备数据中推进了对急性MD发作的检测.这种方法利用未标记的数据在预训练期间学习表示,随后被用于监督任务。
    方法:我们收集了使用EmpaticaE4腕带记录的开放访问数据集,与MD监测无关,个人感知任务——从超级马里奥玩家的情感识别到本科生的压力检测——并设计了一个预处理管道,执行体内/体外检测,睡眠/唤醒检测,分割,和(可选地)特征提取。161例E4记录的受试者,我们引入了E4SelfLearning,迄今为止最大的开放访问集合,和它的预处理管道。我们开发了一种新颖的E4定制变压器(E4mer)架构,作为SSL和完全监督学习的蓝图;我们评估了自我监督预培训是否以及在何种条件下导致了对完全监督基线的改进(即,完全监督的E4mer和预深度学习算法)从64个记录片段中检测急性MD发作(n=32,50%,急性,n=32,50%,稳定)患者。
    结果:使用我们的新型E4mer或极端梯度增强(XGBoost),SSL的性能明显优于完全监督的管道:n=3353(81.23%)对n=3110(75.35%;E4mer)和n=2973(72.02%;XGBoost)从总共4128个片段中正确分类了记录片段。SSL性能与用于预训练的特定代理任务密切相关,以及无标签的数据可用性。
    结论:我们发现SSL,一种范式,其中模型在未标记的数据上进行预训练,在部署到感兴趣的有监督目标任务之前不需要人工注释,有助于克服注释瓶颈;预训练代理任务的选择和预训练的未标记数据的大小是SSL成功的关键决定因素。我们介绍了E4mer,可以用于SSL,并分享了E4SelfLearning系列,连同它的预处理管道,这可以促进和加快未来对个人感知SSL的研究。
    BACKGROUND: Personal sensing, leveraging data passively and near-continuously collected with wearables from patients in their ecological environment, is a promising paradigm to monitor mood disorders (MDs), a major determinant of the worldwide disease burden. However, collecting and annotating wearable data is resource intensive. Studies of this kind can thus typically afford to recruit only a few dozen patients. This constitutes one of the major obstacles to applying modern supervised machine learning techniques to MD detection.
    OBJECTIVE: In this paper, we overcame this data bottleneck and advanced the detection of acute MD episodes from wearables\' data on the back of recent advances in self-supervised learning (SSL). This approach leverages unlabeled data to learn representations during pretraining, subsequently exploited for a supervised task.
    METHODS: We collected open access data sets recording with the Empatica E4 wristband spanning different, unrelated to MD monitoring, personal sensing tasks-from emotion recognition in Super Mario players to stress detection in undergraduates-and devised a preprocessing pipeline performing on-/off-body detection, sleep/wake detection, segmentation, and (optionally) feature extraction. With 161 E4-recorded subjects, we introduced E4SelfLearning, the largest-to-date open access collection, and its preprocessing pipeline. We developed a novel E4-tailored transformer (E4mer) architecture, serving as the blueprint for both SSL and fully supervised learning; we assessed whether and under which conditions self-supervised pretraining led to an improvement over fully supervised baselines (ie, the fully supervised E4mer and pre-deep learning algorithms) in detecting acute MD episodes from recording segments taken in 64 (n=32, 50%, acute, n=32, 50%, stable) patients.
    RESULTS: SSL significantly outperformed fully supervised pipelines using either our novel E4mer or extreme gradient boosting (XGBoost): n=3353 (81.23%) against n=3110 (75.35%; E4mer) and n=2973 (72.02%; XGBoost) correctly classified recording segments from a total of 4128 segments. SSL performance was strongly associated with the specific surrogate task used for pretraining, as well as with unlabeled data availability.
    CONCLUSIONS: We showed that SSL, a paradigm where a model is pretrained on unlabeled data with no need for human annotations before deployment on the supervised target task of interest, helps overcome the annotation bottleneck; the choice of the pretraining surrogate task and the size of unlabeled data for pretraining are key determinants of SSL success. We introduced E4mer, which can be used for SSL, and shared the E4SelfLearning collection, along with its preprocessing pipeline, which can foster and expedite future research into SSL for personal sensing.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:开发和验证一种深度学习模型,用于从根尖周射线照片自动评估牙髓病例困难。
    方法:从两个临床地点编制了1,386个根尖周X线片的数据集。两名牙医和两名牙髓医师在Endocase申请中使用美国牙髓医师协会的“简单评估”标准对X射线照片进行了困难注释。分类任务将案例标记为“简单”或“困难”,而回归预测总体难度得分。使用了卷积神经网络(即VGG16、ResNet18、ResNet50、ResNext50和Inceptionv2),使用通过从ImageNet权重的迁移学习训练的基线模型。其他模型使用自监督对比学习进行预训练(即BYOL,SimCLR,MoCo,和DINO)在20,295个未标记的牙科射线照片上学习没有手动标签的表示。这两个模型都使用10倍交叉验证进行了评估,与保持测试装置中的七名人类检查者(三名普通牙医和四名牙髓医生)相比。
    结果:基线VGG16模型在分类难度方面达到了87.62%的准确率。自我监督的预训练并没有提高性能。回归预测得分,得分误差为±3.21。所有模型的性能都优于人类评估者,具有较差的考试者间可靠性。
    结论:这项初步研究证明了通过深度学习模型进行自动牙髓困难评估的可行性。
    BACKGROUND: To develop and validate a deep learning model for automated assessment of endodontic case difficulty from periapical radiographs.
    METHODS: A dataset of 1,386 periapical radiographs was compiled from two clinical sites. Two dentists and two endodontists annotated the radiographs for difficulty using the \"simple assessment\" criteria from the American Association of Endodontists\' case difficulty assessment form in the Endocase application. A classification task labeled cases as \"easy\" or \"hard\", while regression predicted overall difficulty scores. Convolutional neural networks (i.e. VGG16, ResNet18, ResNet50, ResNext50, and Inception v2) were used, with a baseline model trained via transfer learning from ImageNet weights. Other models was pre-trained using self-supervised contrastive learning (i.e. BYOL, SimCLR, MoCo, and DINO) on 20,295 unlabeled dental radiographs to learn representation without manual labels. Both models were evaluated using 10-fold cross-validation, with performance compared to seven human examiners (three general dentists and four endodontists) on a hold-out test set.
    RESULTS: The baseline VGG16 model attained 87.62% accuracy in classifying difficulty. Self-supervised pretraining did not improve performance. Regression predicted scores with ± 3.21 score error. All models outperformed human raters, with poor inter-examiner reliability.
    CONCLUSIONS: This pilot study demonstrated the feasibility of automated endodontic difficulty assessment via deep learning models.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    智能手表能够连续和非侵入性的时间序列监测心血管生物标志物,如心率(来自光电容积描记图),步数计数器,皮肤温度,等等;因此,他们有希望协助早期发现和预防心血管疾病。虽然这些生物标志物可能对医生没有直接的帮助,机器学习(ML)模型可以发现临床相关模式.不幸的是,ML模型通常需要监督(即,注释)数据,大量连续数据的标签是非常劳动密集型的。因此,数据高效的ML方法,ie,需要少量的标签,需要检测可穿戴数据中发现的模式的潜在临床价值。
    ME-TIME(医学中启用机器学习的时间序列分析)研究的主要研究目标是设计一种ML模型,该模型可以以数据高效的方式从可穿戴数据中检测心房颤动(AF)和心力衰竭(HF)。为了实现这一点,使用了自监督和弱监督学习技术。
    两百个科目(100个参考,50AF,和50HF)被邀请参加佩戴Fitbit健身追踪器3个月。有兴趣的志愿者会收到一份调查问卷,以确定他们的健康状况。尤其是心血管健康。没有任何严重疾病(病史)的志愿者被分配到参照组。AF和HF的参与者在海牙的Haga教学医院招募,荷兰。
    报名于2022年5月1日开始,截至本报告时,62名受试者被纳入研究。对数据的初步分析揭示了显著的受试者间差异。值得注意的是,我们将心率恢复曲线和心率与步数之间的时间延迟相关性确定为心脏病的潜在强指标.
    使用自监督和多实例学习技术,我们假设在从智能手表获得的连续数据中可以找到特定于AF和HF的模式。
    UNASSIGNED: Smartwatches enable continuous and noninvasive time series monitoring of cardiovascular biomarkers like heart rate (from photoplethysmograms), step counter, skin temperature, et cetera; as such, they have promise in assisting in early detection and prevention of cardiovascular disease. Although these biomarkers may not be directly useful to physicians, a machine learning (ML) model could find clinically relevant patterns. Unfortunately, ML models typically need supervised (ie, annotated) data, and labeling of large amounts of continuous data is very labor intensive. Therefore, ML methods that are data efficient, ie, needing a low number of labels, are required to detect potential clinical value in patterns found in wearable data.
    UNASSIGNED: The primary study objective of the ME-TIME (Machine Learning Enabled Time Series Analysis in Medicine) study is to design an ML model that can detect atrial fibrillation (AF) and heart failure (HF) from wearable data in a data-efficient manner. To achieve this, self-supervised and weakly supervised learning techniques are used.
    UNASSIGNED: Two hundred subjects (100 reference, 50 AF, and 50 HF) are being invited to participate in wearing a Fitbit fitness tracker for 3 months. Interested volunteers are sent a questionnaire to determine their health, in particular cardiovascular health. Volunteers without any (history of) serious illness are assigned to the reference group. Participants with AF and HF are recruited in the Haga teaching hospital in The Hague, The Netherlands.
    UNASSIGNED: Enrollment commenced on May 1, 2022, and as of the time of this report, 62 subjects have been included in the study. Preliminary analysis of the data reveals significant inter-subject variability. Notably, we identified heart rate recovery curves and time-delayed correlations between heart rate and step count as potential strong indicators for heart disease.
    UNASSIGNED: Using self-supervised and multiple-instance learning techniques, we hypothesize that patterns specific to AF and HF can be found in continuous data obtained from smartwatches.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    深度学习模型通过从精心注释的脑电图(EEG)数据中学习,在自动化睡眠医学任务方面取得了巨大成功。然而,有效地使用大量的原始脑电图数据仍然是一个挑战。
    在这项研究中,我们的目标是从大量未标记的EEG信号中学习鲁棒的向量表示,使得学习到的矢量化特征(1)具有足够的表现力来替换睡眠分期任务中的原始信号,(2)在涉及较少标签和噪声样本的场景中,提供比监督模型更好的预测性能。
    我们提出了一个自监督模型,与世界表现(ContraWR)的对比用于脑电信号表示学习。与以前使用一组阴性样本的模型不同,我们的模型使用全球统计数据(即,平均表示)来自数据集,以区分与不同睡眠阶段相关的信号。ContraWR模型在3个现实世界EEG数据集上进行评估,这些数据集包括以下两种设置:在家和实验室EEG记录。
    ContraWR在3个大型EEG数据集上的睡眠分期任务上优于最近报道的4种自我监督学习方法。当可用的训练标签较少时,ContraWR也取代监督学习(例如,当SleepEDF数据集上标记的数据少于2%时,精度提高了4%)。此外,该模型提供了丰富的信息,2D投影中的代表性特征结构。
    我们表明ContraWR对噪声具有鲁棒性,可以为下游预测任务提供高质量的EEG表示。所提出的模型可以推广到其他无监督的生理信号学习任务。未来的方向包括探索特定任务的数据增强,并将自监督方法与监督方法相结合,建立在本研究报告的自我监督学习的初步成功的基础上。
    UNASSIGNED: Deep learning models have shown great success in automating tasks in sleep medicine by learning from carefully annotated electroencephalogram (EEG) data. However, effectively using a large amount of raw EEG data remains a challenge.
    UNASSIGNED: In this study, we aim to learn robust vector representations from massive unlabeled EEG signals, such that the learned vectorized features (1) are expressive enough to replace the raw signals in the sleep staging task, and (2) provide better predictive performance than supervised models in scenarios involving fewer labels and noisy samples.
    UNASSIGNED: We propose a self-supervised model, Contrast with the World Representation (ContraWR), for EEG signal representation learning. Unlike previous models that use a set of negative samples, our model uses global statistics (ie, the average representation) from the data set to distinguish signals associated with different sleep stages. The ContraWR model is evaluated on 3 real-world EEG data sets that include both settings: at-home and in-laboratory EEG recording.
    UNASSIGNED: ContraWR outperforms 4 recently reported self-supervised learning methods on the sleep staging task across 3 large EEG data sets. ContraWR also supersedes supervised learning when fewer training labels are available (eg, 4% accuracy improvement when less than 2% of data are labeled on the Sleep EDF data set). Moreover, the model provides informative, representative feature structures in 2D projection.
    UNASSIGNED: We show that ContraWR is robust to noise and can provide high-quality EEG representations for downstream prediction tasks. The proposed model can be generalized to other unsupervised physiological signal learning tasks. Future directions include exploring task-specific data augmentations and combining self-supervised methods with supervised methods, building upon the initial success of self-supervised learning reported in this study.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    深度学习(DL)可以从常规的结直肠癌(CRC)组织病理学切片中预测微卫星不稳定性(MSI)。然而,目前尚不清楚DL是否也能预测其他具有高性能的生物标志物,以及DL预测是否可推广到外部患者人群.这里,我们从两项大型多中心研究中获取CRC组织样本.我们系统地比较了六种不同的最先进的DL架构,以预测病理切片中的生物标志物。包括MSI和BRAF的突变,KRAS,NRAS,PIK3CA使用大型外部验证队列提供真实的评估设置,我们展示了使用自监督的模型,基于注意力的多实例学习始终优于以前的方法,同时提供指示性区域和形态的可解释可视化。虽然MSI和BRAF突变的预测达到临床级表现,PIK3CA的突变预测,KRAS,NRAS在临床上不足。
    Deep learning (DL) can predict microsatellite instability (MSI) from routine histopathology slides of colorectal cancer (CRC). However, it is unclear whether DL can also predict other biomarkers with high performance and whether DL predictions generalize to external patient populations. Here, we acquire CRC tissue samples from two large multi-centric studies. We systematically compare six different state-of-the-art DL architectures to predict biomarkers from pathology slides, including MSI and mutations in BRAF, KRAS, NRAS, and PIK3CA. Using a large external validation cohort to provide a realistic evaluation setting, we show that models using self-supervised, attention-based multiple-instance learning consistently outperform previous approaches while offering explainable visualizations of the indicative regions and morphologies. While the prediction of MSI and BRAF mutations reaches a clinical-grade performance, mutation prediction of PIK3CA, KRAS, and NRAS was clinically insufficient.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    最近,深度学习在故障诊断领域的应用越来越广泛。然而,大多数深度学习方法依赖于大量标记数据来训练模型,这导致它们在不同场景的应用中的泛化能力较差。为了克服这一缺陷,提出了一种基于自监督学习和稀疏滤波的广义模型(GSLSF)。所提出的方法包括两个阶段。首先(1),考虑到样本对故障和工作状态信息的表示,设计自监督学习借口任务和伪标签,建立基于稀疏滤波的预训练模型。第二(2)建立了从训练前模型到目标任务的知识转移机制,基于稀疏滤波模型提取深度表示的故障特征,并应用softmax回归来区分故障类型。该方法可以在有限的训练数据下显著提高模型的诊断性能和泛化能力。通过两个轴承数据集的故障诊断结果证明了该方法的有效性。
    Recently, deep learning has become more and more extensive in the field of fault diagnosis. However, most deep learning methods rely on large amounts of labeled data to train the model, which leads to their poor generalized ability in the application of different scenarios. To overcome this deficiency, this paper proposes a novel generalized model based on self-supervised learning and sparse filtering (GSLSF). The proposed method includes two stages. Firstly (1), considering the representation of samples on fault and working condition information, designing self-supervised learning pretext tasks and pseudo-labels, and establishing a pre-trained model based on sparse filtering. Secondly (2), a knowledge transfer mechanism from the pre-training model to the target task is established, the fault features of the deep representation are extracted based on the sparse filtering model, and softmax regression is applied to distinguish the type of failure. This method can observably enhance the model\'s diagnostic performance and generalization ability with limited training data. The validity of the method is proved by the fault diagnosis results of two bearing datasets.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    量化动物行为的情感方面(例如,焦虑,社交互动,奖励,和应激反应)是神经科学研究的主要焦点。因为情绪相关行为的人工评分是耗时且主观的,经典方法依赖于容易量化的措施,例如杠杆按压或在设备的不同区域花费的时间(例如,开放式vs.高架加迷宫的闭合臂)。最近的进步使得从视频中提取姿势信息变得更加容易,并且已经提出了从姿态估计数据中提取有关行为状态的细微信息的多种方法。这些包括监督,无人监督,和自我监督的方法,采用各种不同的模型类型。从这些方法导出的行为状态的表示可以与神经活动的记录相关,以增加可以在大脑和行为之间绘制的连接的范围。在这个迷你评论中,我们将讨论如何在行为实验中使用深度学习技术,以及不同的模型架构和训练范例如何影响可以获得的表示类型。
    Quantifying emotional aspects of animal behavior (e.g., anxiety, social interactions, reward, and stress responses) is a major focus of neuroscience research. Because manual scoring of emotion-related behaviors is time-consuming and subjective, classical methods rely on easily quantified measures such as lever pressing or time spent in different zones of an apparatus (e.g., open vs. closed arms of an elevated plus maze). Recent advancements have made it easier to extract pose information from videos, and multiple approaches for extracting nuanced information about behavioral states from pose estimation data have been proposed. These include supervised, unsupervised, and self-supervised approaches, employing a variety of different model types. Representations of behavioral states derived from these methods can be correlated with recordings of neural activity to increase the scope of connections that can be drawn between the brain and behavior. In this mini review, we will discuss how deep learning techniques can be used in behavioral experiments and how different model architectures and training paradigms influence the type of representation that can be obtained.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    成像在制药行业中通常用作表征方法,包括用于量化固体和液体制剂中的亚可见颗粒。提取颗粒大小以外的信息,例如对形态亚群进行分类,需要某种类型的图像分析方法。对粒子进行分类的建议方法基于预先确定的形态特征或使用卷积神经网络的监督训练来学习与地面实况标签相关的图像表示。由高度复杂的形态引起的并发症,不可预见的课程,以及耗时的地面真相标签准备工作,是这些方法面临的一些挑战。在这项工作中,我们评估了自监督对比学习方法在研究治疗解决方案中的粒子图像中的应用。与有监督的培训不同,这种方法不需要地面实况标签,并且通过比较粒子图像及其增强来学习表示。该方法为形态学属性评估提供了一种快速且易于实现的粗筛选工具。此外,我们的分析表明,在数据集相对平衡的情况下,图像数据集的小子集足以训练能够提取有用图像表示的卷积神经网络编码器。这也表明,通常观察到的粒子类蛋白质溶液中的预填充注射器出现在编码器的嵌入空间的分离簇,促进执行任务,如训练弱监督分类器或识别新亚群的存在。
    Imaging is commonly used as a characterization method in the pharmaceuticals industry, including for quantifying subvisible particles in solid and liquid formulations. Extracting information beyond particle size, such as classifying morphological subpopulations, requires some type of image analysis method. Suggested methods to classify particles have been based on pre-determined morphological features or use supervised training of convolutional neural networks to learn image representations in relation to ground truth labels. Complications arising from highly complex morphologies, unforeseen classes, and time-consuming preparation of ground truth labels, are some of the challenges faced by these methods. In this work, we evaluate the application of a self-supervised contrastive learning method in studying particle images from therapeutic solutions. Unlike with supervised training, this approach does not require ground truth labels and representations are learned by comparing particle images and their augmentations. This method provides a fast and easily implementable tool of coarse screening for morphological attribute assessment. Furthermore, our analysis shows that in cases with relatively balanced datasets, a small subset of an image dataset is sufficient to train a convolutional neural network encoder capable of extracting useful image representations. It is also demonstrated that particle classes typically observed in protein solutions administered by pre-filled syringes emerge as separated clusters in the encoder\'s embedding space, facilitating performing tasks such as training weakly-supervised classifiers or identifying the presence of new subpopulations.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    现有的用于计算机断层结肠成像(CTC)的电子清洗(EC)方法通常基于图像分割,这将它们的准确性限制在潜在体素的准确性。由于可用于培训的CTC数据集的局限性,传统的深度学习在EC中的应用有限。这项研究的目的是评估使用新颖的自监督对抗学习方案以有限的训练数据集进行具有子体素精度的EC的技术可行性。对三维(3D)生成对抗网络(3DGAN)进行了预训练,以对拟人化体模的CTC数据集执行EC。然后通过使用自监督方案将3DGAN微调到每个输入情况。3DGAN的体系结构通过使用幻影研究进行了优化。在18例临床CTC的虚拟3D穿透检查中,所得3DGAN的虚拟清洗的视觉感知质量与商业EC软件的视觉感知质量相比具有优势。因此,提出的自监督3DGAN,它可以被训练为在没有图像注释的小数据集上执行EC,是解决反恐委员会中欧共体剩余技术问题的潜在有效方法。
    Existing electronic cleansing (EC) methods for computed tomographic colonography (CTC) are generally based on image segmentation, which limits their accuracy to that of the underlying voxels. Because of the limitations of the available CTC datasets for training, traditional deep learning is of limited use in EC. The purpose of this study was to evaluate the technical feasibility of using a novel self-supervised adversarial learning scheme to perform EC with a limited training dataset with subvoxel accuracy. A three-dimensional (3D) generative adversarial network (3D GAN) was pre-trained to perform EC on CTC datasets of an anthropomorphic phantom. The 3D GAN was then fine-tuned to each input case by use of the self-supervised scheme. The architecture of the 3D GAN was optimized by use of a phantom study. The visually perceived quality of the virtual cleansing by the resulting 3D GAN compared favorably to that of commercial EC software on the virtual 3D fly-through examinations of 18 clinical CTC cases. Thus, the proposed self-supervised 3D GAN, which can be trained to perform EC on a small dataset without image annotations with subvoxel accuracy, is a potentially effective approach for addressing the remaining technical problems of EC in CTC.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号