XAI

XAI
  • 文章类型: Journal Article
    转移性乳腺癌(MBC)仍然是女性癌症相关死亡的主要原因。这项工作介绍了一种创新的非侵入性乳腺癌分类模型,旨在改善癌症转移的识别。虽然这项研究标志着预测MBC的初步探索,额外的调查对于验证MBC的发生至关重要.我们的方法结合了大型语言模型(LLM)的优势,特别是来自变压器(BERT)模型的双向编码器表示,图神经网络(GNN)的强大功能,可根据组织病理学报告预测MBC患者。本文介绍了一种用于转移性乳腺癌预测(BG-MBC)的BERT-GNN方法,该方法集成了从BERT模型得出的图形信息。在这个模型中,节点是根据病人的医疗记录构建的,虽然BERT嵌入被用来对组织病理学报告中的单词进行矢量化表示,从而通过采用三种不同的方法(即单变量选择,用于特征重要性的额外树分类器,和Shapley值,以确定影响最显著的特征)。确定在模型训练期间作为嵌入生成的676个中最关键的30个特征,我们的模型进一步增强了其预测能力。BG-MBC模型具有出色的准确性,在识别MBC患者时,检出率为0.98,曲线下面积(AUC)为0.98。这种显著的表现归功于模型对LLM从组织病理学报告中产生的注意力得分的利用,有效地捕获相关特征进行分类。
    Metastatic breast cancer (MBC) continues to be a leading cause of cancer-related deaths among women. This work introduces an innovative non-invasive breast cancer classification model designed to improve the identification of cancer metastases. While this study marks the initial exploration into predicting MBC, additional investigations are essential to validate the occurrence of MBC. Our approach combines the strengths of large language models (LLMs), specifically the bidirectional encoder representations from transformers (BERT) model, with the powerful capabilities of graph neural networks (GNNs) to predict MBC patients based on their histopathology reports. This paper introduces a BERT-GNN approach for metastatic breast cancer prediction (BG-MBC) that integrates graph information derived from the BERT model. In this model, nodes are constructed from patient medical records, while BERT embeddings are employed to vectorise representations of the words in histopathology reports, thereby capturing semantic information crucial for classification by employing three distinct approaches (namely univariate selection, extra trees classifier for feature importance, and Shapley values to identify the features that have the most significant impact). Identifying the most crucial 30 features out of 676 generated as embeddings during model training, our model further enhances its predictive capabilities. The BG-MBC model achieves outstanding accuracy, with a detection rate of 0.98 and an area under curve (AUC) of 0.98, in identifying MBC patients. This remarkable performance is credited to the model\'s utilisation of attention scores generated by the LLM from histopathology reports, effectively capturing pertinent features for classification.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    识别面部表情一直是科学界的目标。自从人工智能兴起以来,卷积神经网络(CNN)已经成为识别面部表情的流行,作为图像可以直接用作输入。当前的CNN模型可以实现高识别率,但是他们不知道他们的推理过程。可解释人工智能(XAI)已被开发为一种帮助解释机器学习模型获得的结果的手段。处理图像时,最常用的XAI技术之一是LIME。LIME突出显示图像中有助于分类的区域。作为LIME的替代品,CEM方法出现了,以自然的方式提供对人类分类的解释:除了强调什么足以证明分类是合理的,它还确定了维护它应该缺少什么,并将其与另一种分类区分开来。这项研究提供了将LIME和CEM应用于复杂图像(例如面部表情图像)的比较结果。虽然CEM可以用来解释图像描述的结果,减少了数量的特征,当处理具有大量特征的图像时,LIME将是选择的方法。
    Recognizing facial expressions has been a persistent goal in the scientific community. Since the rise of artificial intelligence, convolutional neural networks (CNN) have become popular to recognize facial expressions, as images can be directly used as input. Current CNN models can achieve high recognition rates, but they give no clue about their reasoning process. Explainable artificial intelligence (XAI) has been developed as a means to help to interpret the results obtained by machine learning models. When dealing with images, one of the most-used XAI techniques is LIME. LIME highlights the areas of the image that contribute to a classification. As an alternative to LIME, the CEM method appeared, providing explanations in a way that is natural for human classification: besides highlighting what is sufficient to justify a classification, it also identifies what should be absent to maintain it and to distinguish it from another classification. This study presents the results of comparing LIME and CEM applied over complex images such as facial expression images. While CEM could be used to explain the results on images described with a reduced number of features, LIME would be the method of choice when dealing with images described with a huge number of features.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在本文中,讨论了识别不良事件的问题。这些事件在历史数据中可能表现不佳,从过去的例子中学习主要是不可能的。所讨论的问题是在两个用例的背景下考虑的,其中分析了无线传感器收集的振动和温度测量。这些用例包括燃煤电厂的破碎机和钢铁厂转炉的龙门架。的意识,由于与工业界的合作,需要一个系统,在冷启动条件下工作,不会淹没机器操作员的警报是提出一种新的预测性维护方法的动机。所提出的解决方案基于离群值识别方法。这些方法被应用于被转换成多维特征向量的所收集的数据。所提出的解决方案的新颖性源于创建减少假阳性警报的方法,它被应用于识别不良事件的系统。这种方法是基于系统对分析数据的适应性,与调度员的互动,以及XAI(eXplainable人工智能)方法的使用。在多个数据集上进行的实验表明,与独立的离群值检测方法的性能相比,所提出的方法平均将误报减少了90.25%。获得的结果允许将开发的方法实施到在实际工业设施中运行的系统。所进行的研究对于具有冷启动问题的系统可能是有价值的,其中频繁的警报可能导致用户对系统的劝阻和忽视。
    In this paper, the problem of the identification of undesirable events is discussed. Such events can be poorly represented in the historical data, and it is predominantly impossible to learn from past examples. The discussed issue is considered in the work in the context of two use cases in which vibration and temperature measurements collected by wireless sensors are analysed. These use cases include crushers at a coal-fired power plant and gantries in a steelworks converter. The awareness, resulting from the cooperation with industry, of the need for a system that works in cold start conditions and does not flood the machine operator with alarms was the motivation for proposing a new predictive maintenance method. The proposed solution is based on the methods of outlier identification. These methods are applied to the collected data that was transformed into a multidimensional feature vector. The novelty of the proposed solution stems from the creation of a methodology for the reduction of false positive alarms, which was applied to a system identifying undesirable events. This methodology is based on the adaptation of the system to the analysed data, the interaction with the dispatcher, and the use of the XAI (eXplainable Artificial Intelligence) method. The experiments performed on several data sets showed that the proposed method reduced false alarms by 90.25% on average in relation to the performance of the stand-alone outlier detection method. The obtained results allowed for the implementation of the developed method to a system operating in a real industrial facility. The conducted research may be valuable for systems with a cold start problem where frequent alarms can lead to discouragement and disregard for the system by the user.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号