few shot

  • 文章类型: Journal Article
    在预测关键部件的剩余使用寿命(RUL)的领域,如飞机发动机,一个普遍的挑战仍然存在,可用的历史生活数据往往被证明是不够的。这种不足会产生障碍,如性能退化特征提取的障碍,全面捕捉时间关系的不足,并降低了预测准确性。为了解决这个问题,本文提出了少射条件下的一维CNN-GRU预测模型。为了追求更全面的数据特征提取和增强的RUL预测精度,选择卷积神经网络(CNN)是因为其在复杂的数据动态中辨别高维特征的能力。同时,门控递归单元(GRU)网络在提取数据固有的时间特征方面具有强大的能力。我们将两者结合起来构建CNN-GRU混合网络。此外,将数据分布与相关性和单调性指数相结合,将多传感器监测参数输入CNN-GRU网络。最后,发动机RUL由训练的模型预测。在本文中,在美国国家航空航天局(NASA)C-MAPSS多约束数据集的子数据集上进行了实验,验证了该方法的有效性。实验结果表明,该方法在RUL预测任务中具有较高的准确性,可以有力地证明其有效性。
    In the realm of prognosticating the remaining useful life (RUL) of pivotal components, such as aircraft engines, a prevalent challenge persists where the available historical life data often proves insufficient. This insufficiency engenders obstacles such as impediments in performance degradation feature extraction, inadequacies in capturing temporal relationships comprehensively, and diminished predictive accuracy. To address this issue, a 1D CNN-GRU prediction model for few-shot conditions is proposed in this paper. In pursuit of more comprehensive data feature extraction and enhanced RUL prognostication precision, the Convolutional Neural Network (CNN) is selected for its capacity to discern high-dimensional features amid the intricate dynamics of the data. Concurrently, the Gated Recurrent Unit (GRU) network is leveraged for its robust capability in extracting temporal features inherent within the data. We combine the two to construct a CNN-GRU hybrid network. Moreover, the integration of data distribution alongside correlation and monotonicity indices is employed to winnow the input of multi-sensor monitoring parameters into the CNN-GRU network. Finally, the engine RULs are predicted by the trained model. In this paper, experiments are conducted on a sub-dataset of the National Aeronautics and Space Administration (NASA) C-MAPSS multi-constraint dataset to validate the effectiveness of the method. Experimental results have demonstrated that this method has high accuracy in RUL prediction tasks, which can powerfully demonstrate its effectiveness.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    磁共振成像(MRI)中的自动运动伪影检测(MAD)是一个研究领域,旨在自动标记运动伪影,以防止重复扫描的要求。在本文中,我们确定并解决了自动化MAD领域当前的三个挑战;(1)依赖完全监督的培训,这意味着它们需要运动伪影(MA)的具体示例,(2)在不同工作中使用基准数据集的不一致,以及使用私有数据集来测试和训练新提出的MAD技术,以及(3)缺乏足够大的MRIMAD数据集。为了应对这些挑战,我们演示了如何通过将问题表述为无监督的异常检测(AD)任务来识别MA。我们比较了三种最先进的AD算法DeepSVDD的性能,两个开源脑MRI数据集上的插值高斯描述符和FewSOME,用于MAD和MA严重性分类,FewSOME在两个数据集上的MADAUC>90%,在MA严重性分类任务上的Spearman秩相关系数为0.8。这些模型在少数镜头设置中训练,这意味着构建强大的MAD算法不需要大型脑MRI数据集。这项工作还为在开源基准测试数据集上测试MAD算法设置了标准协议。除了应对这些挑战,我们演示了我们提出的“异常感知”评分函数如何在一个和两个异常类可用于训练的设置中提高了FewSOME的MAD性能。Codeavailableathttps://github.com/niamhbelton/Unsupervised-Brain-MRI-Motion-Artefact-Detection/.
    Automated Motion Artefact Detection (MAD) in Magnetic Resonance Imaging (MRI) is a field of study that aims to automatically flag motion artefacts in order to prevent the requirement for a repeat scan. In this paper, we identify and tackle the three current challenges in the field of automated MAD; (1) reliance on fully-supervised training, meaning they require specific examples of Motion Artefacts (MA), (2) inconsistent use of benchmark datasets across different works and use of private datasets for testing and training of newly proposed MAD techniques and (3) a lack of sufficiently large datasets for MRI MAD. To address these challenges, we demonstrate how MAs can be identified by formulating the problem as an unsupervised Anomaly Detection (AD) task. We compare the performance of three State-of-the-Art AD algorithms DeepSVDD, Interpolated Gaussian Descriptor and FewSOME on two open-source Brain MRI datasets on the task of MAD and MA severity classification, with FewSOME achieving a MAD AUC >90% on both datasets and a Spearman Rank Correlation Coefficient of 0.8 on the task of MA severity classification. These models are trained in the few shot setting, meaning large Brain MRI datasets are not required to build robust MAD algorithms. This work also sets a standard protocol for testing MAD algorithms on open-source benchmark datasets. In addition to addressing these challenges, we demonstrate how our proposed \'anomaly-aware\' scoring function improves FewSOME\'s MAD performance in the setting where one and two shots of the anomalous class are available for training. Code available at https://github.com/niamhbelton/Unsupervised-Brain-MRI-Motion-Artefact-Detection/.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:大型语言模型(LLM)在自然语言处理(NLP)中显示出非凡的能力,特别是在标记数据稀缺或昂贵的领域,例如临床领域。然而,为了解开隐藏在这些LLM中的临床知识,我们需要设计有效的提示,引导他们在没有任何任务特定训练数据的情况下执行特定的临床NLP任务.这被称为上下文学习,这是一门艺术和科学,需要了解不同LLM的优势和劣势,并迅速采用工程方法。
    目的:本研究的目的是评估各种即时工程技术的有效性,包括2个新引入的类型-启发式和合奏提示,使用预训练的语言模型进行零射和少射临床信息提取。
    方法:这项全面的实验研究评估了不同的提示类型(简单的前缀,简单的完形填空,思想链,预期,启发式,和合奏)跨越5个临床NLP任务:临床意义消歧,生物医学证据提取,共同参照决议,药物状态提取,和药物属性提取。使用3种最先进的语言模型评估了这些提示的性能:GPT-3.5(OpenAI),双子座(谷歌),和LLaMA-2(Meta)。该研究将零射与少射提示进行了对比,并探讨了合奏方法的有效性。
    结果:研究表明,针对特定任务的提示定制对于LLM在零射临床NLP中的高性能至关重要。在临床意义上的消歧,GPT-3.5在启发式提示下达到0.96的准确性,在生物医学证据提取中达到0.94的准确性。启发式提示,伴随着一连串的思想提示,跨任务非常有效。在复杂的场景中,很少有机会提示提高性能,和集合方法利用了多种即时优势。GPT-3.5在任务和提示类型上的表现始终优于Gemini和LLaMA-2。
    结论:本研究对即时工程方法进行了严格的评估,并介绍了临床信息提取的创新技术,证明了临床领域上下文学习的潜力。这些发现为未来基于提示的临床NLP研究提供了明确的指导方针。促进非NLP专家参与临床NLP进步。据我们所知,这是在这个生成人工智能时代,对临床NLP的不同提示工程方法进行实证评估的首批作品之一,我们希望它能激励和指导未来在这一领域的研究。
    BACKGROUND: Large language models (LLMs) have shown remarkable capabilities in natural language processing (NLP), especially in domains where labeled data are scarce or expensive, such as the clinical domain. However, to unlock the clinical knowledge hidden in these LLMs, we need to design effective prompts that can guide them to perform specific clinical NLP tasks without any task-specific training data. This is known as in-context learning, which is an art and science that requires understanding the strengths and weaknesses of different LLMs and prompt engineering approaches.
    OBJECTIVE: The objective of this study is to assess the effectiveness of various prompt engineering techniques, including 2 newly introduced types-heuristic and ensemble prompts, for zero-shot and few-shot clinical information extraction using pretrained language models.
    METHODS: This comprehensive experimental study evaluated different prompt types (simple prefix, simple cloze, chain of thought, anticipatory, heuristic, and ensemble) across 5 clinical NLP tasks: clinical sense disambiguation, biomedical evidence extraction, coreference resolution, medication status extraction, and medication attribute extraction. The performance of these prompts was assessed using 3 state-of-the-art language models: GPT-3.5 (OpenAI), Gemini (Google), and LLaMA-2 (Meta). The study contrasted zero-shot with few-shot prompting and explored the effectiveness of ensemble approaches.
    RESULTS: The study revealed that task-specific prompt tailoring is vital for the high performance of LLMs for zero-shot clinical NLP. In clinical sense disambiguation, GPT-3.5 achieved an accuracy of 0.96 with heuristic prompts and 0.94 in biomedical evidence extraction. Heuristic prompts, alongside chain of thought prompts, were highly effective across tasks. Few-shot prompting improved performance in complex scenarios, and ensemble approaches capitalized on multiple prompt strengths. GPT-3.5 consistently outperformed Gemini and LLaMA-2 across tasks and prompt types.
    CONCLUSIONS: This study provides a rigorous evaluation of prompt engineering methodologies and introduces innovative techniques for clinical information extraction, demonstrating the potential of in-context learning in the clinical domain. These findings offer clear guidelines for future prompt-based clinical NLP research, facilitating engagement by non-NLP experts in clinical NLP advancements. To the best of our knowledge, this is one of the first works on the empirical evaluation of different prompt engineering approaches for clinical NLP in this era of generative artificial intelligence, and we hope that it will inspire and inform future research in this area.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    深度学习对象检测网络需要大量的盒子标注数据进行训练,这在医学图像领域是很难获得的。少数镜头目标检测算法对于一个看不见的类别是重要的,可以用一些标记数据来识别和本地化。对于医学图像数据集,图像样式和目标特征与在原始数据集上训练获得的知识非常不同。我们针对这种跨域情况提出了背景抑制注意(BSA)和特征空间微调模块(FSF),在这种情况下,源域和目标域之间存在很大的差距。背景抑制注意力减少了训练过程中背景信息的影响。特征空间微调模块调整兴趣特征的特征分布,这有助于做出更好的预测。我们的方法通过仅使用从模型中提取的信息而无需维护其他信息来提高检测性能,这是方便的,可以很容易地插入到其他网络。我们评估了域内情况和跨域情况下的检测性能。在VOC和COCO数据集上的域内实验以及在VOC到医学图像数据集UriSed2K上的跨域实验表明,我们提出的方法有效地提高了少镜头检测性能。
    Deep learning object detection networks require a large amount of box annotation data for training, which is difficult to obtain in the medical image field. The few-shot object detection algorithm is significant for an unseen category, which can be identified and localized with a few labeled data. For medical image datasets, the image style and target features are incredibly different from the knowledge obtained from training on the original dataset. We propose a background suppression attention(BSA) and feature space fine-tuning module (FSF) for this cross-domain situation where there is a large gap between the source and target domains. The background suppression attention reduces the influence of background information in the training process. The feature space fine-tuning module adjusts the feature distribution of the interest features, which helps to make better predictions. Our approach improves detection performance by using only the information extracted from the model without maintaining additional information, which is convenient and can be easily plugged into other networks. We evaluate the detection performance in the in-domain situation and cross-domain situation. In-domain experiments on the VOC and COCO datasets and the cross-domain experiments on the VOC to medical image dataset UriSed2K show that our proposed method effectively improves the few-shot detection performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Editorial
    暂无摘要。
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    管理压力钻井(MPD)是保证钻井安全的最有效手段,MPD能够通过精确控制井口背压来避免复杂工况的进一步恶化。MPD成功的关键是井控策略,目前严重依赖人工经验,阻碍了井控的自动化和智能化进程。针对这个问题,本文构建了MPD知识图谱,从发表的论文和钻井报告中提取知识,指导井控。为了提高知识图谱中实体提取的性能,从EntLM模型扩展了一些中国实体识别模型CEntLM-KL,其中建立了KL熵以提高实体识别的准确性。通过对基准数据集的实验,已经表明,与最先进的方法相比,所提出的模型有了显着的改进。在少量钻井数据集上,实体识别的F-1得分达到33%。最后,知识图谱存储在Neo4J中,并应用于知识推理。
    Managed pressure drilling (MPD) is the most effective means to ensure drilling safety, and MPD is able to avoid further deterioration of complex working conditions through precise control of the wellhead back pressure. The key to the success of MPD is the well control strategy, which currently relies heavily on manual experience, hindering the automation and intelligence process of well control. In response to this issue, an MPD knowledge graph is constructed in this paper that extracts knowledge from published papers and drilling reports to guide well control. In order to improve the performance of entity extraction in the knowledge graph, a few-shot Chinese entity recognition model CEntLM-KL is extended from the EntLM model, in which the KL entropy is built to improve the accuracy of entity recognition. Through experiments on benchmark datasets, it has been shown that the proposed model has a significant improvement compared to the state-of-the-art methods. On the few-shot drilling datasets, the F-1 score of entity recognition reaches 33%. Finally, the knowledge graph is stored in Neo4J and applied for knowledge inference.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号