Explainability

可解释性
  • 文章类型: Journal Article
    中风是死亡和发病的重要原因,需要早期预测策略以最大限度地降低风险。传统的评估患者的方法,如急性生理学和慢性健康评估(APACHEII,IV)和简化急性生理学评分III(SAPSIII),具有有限的准确性和可解释性。本文提出了一种新颖的方法:一种可解释的方法,基于注意力的早期卒中死亡率预测变压器模型。该模型旨在解决以前预测模型的局限性,同时提供可解释性(提供明确的,模型的可理解解释)和保真度(给出模型从输入到输出的动态的真实解释)。此外,本研究使用Shapley值和基于注意力的评分来探索和比较保真度和可解释性评分,以提高模型的可解释性.研究目标包括设计一个可解释的基于注意力的变压器模型,与现有模型相比,评估其性能,并提供从模型得出的特征重要性。
    Stroke is a significant cause of mortality and morbidity, necessitating early predictive strategies to minimize risks. Traditional methods for evaluating patients, such as Acute Physiology and Chronic Health Evaluation (APACHE II, IV) and Simplified Acute Physiology Score III (SAPS III), have limited accuracy and interpretability. This paper proposes a novel approach: an interpretable, attention-based transformer model for early stroke mortality prediction. This model seeks to address the limitations of previous predictive models, providing both interpretability (providing clear, understandable explanations of the model) and fidelity (giving a truthful explanation of the model\'s dynamics from input to output). Furthermore, the study explores and compares fidelity and interpretability scores using Shapley values and attention-based scores to improve model explainability. The research objectives include designing an interpretable attention-based transformer model, evaluating its performance compared to existing models, and providing feature importance derived from the model.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    原发性免疫性血小板减少症(ITP)是一种罕见的自身免疫性疾病,其特征是患者外周血血小板的免疫介导破坏,导致血小板计数降低和出血。ITP的诊断和有效管理具有挑战性,因为没有既定的测试来确认疾病,也没有生物标志物可以预测对治疗和结果的反应。在这项工作中,我们进行了一项可行性研究,在非急性门诊患者中,利用血常规和人口统计学数据,检查机器学习是否能有效地用于ITP的诊断.各种ML模型,包括Logistic回归,支持向量机,k-最近邻居,决策树和随机森林,应用于英国成人ITP登记处和普通血液科诊所的数据。研究了两种不同的方法:一种是人口感知的方法,另一种是人口感知的方法。我们进行了广泛的实验来评估这些模型和方法的预测性能,以及他们的偏见。结果表明,决策树模型和随机森林模型既优越又公平,实现近乎完美的预测性和公平性得分,血小板计数被确定为最显著的变量。未提供人口统计信息的模型在预测准确性方面表现更好,但公平性得分较低,说明了预测性能和公平性之间的权衡。
    Primary Immune Thrombocytopenia (ITP) is a rare autoimmune disease characterised by the immune-mediated destruction of peripheral blood platelets in patients leading to low platelet counts and bleeding. The diagnosis and effective management of ITP are challenging because there is no established test to confirm the disease and no biomarker with which one can predict the response to treatment and outcome. In this work, we conduct a feasibility study to check if machine learning can be applied effectively for the diagnosis of ITP using routine blood tests and demographic data in a non-acute outpatient setting. Various ML models, including Logistic Regression, Support Vector Machine, k-Nearest Neighbor, Decision Tree and Random Forest, were applied to data from the UK Adult ITP Registry and a general haematology clinic. Two different approaches were investigated: a demographic-unaware and a demographic-aware one. We conduct extensive experiments to evaluate the predictive performance of these models and approaches, as well as their bias. The results revealed that Decision Tree and Random Forest models were both superior and fair, achieving nearly perfect predictive and fairness scores, with platelet count identified as the most significant variable. Models not provided with demographic information performed better in terms of predictive accuracy but showed lower fairness scores, illustrating a trade-off between predictive performance and fairness.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    可解释性是提高人工智能在医学中可信度的关键。然而,医生对模型可解释性的期望与这些模型的实际行为之间存在显著差距。这种差距是由于缺乏以医生为中心的评估框架的共识。需要定量评估有效的可解释性应为从业者提供的实际利益。这里,我们假设优越的注意力映射,作为一种模型解释的机制,应该与医生关注的信息保持一致,潜在地降低预测不确定性并提高模型可靠性。我们使用多模式变压器使用临床数据和磁共振成像来预测直肠癌的淋巴结转移。我们探索了注意力地图有多好,通过最先进的技术可视化,可以与医生的理解达成一致。随后,我们比较了两种不同的不确定性估计方法:仅使用预测概率方差的独立估计,以及考虑预测概率方差和量化一致性的人在环估计。我们的发现表明,与独立方法相比,人在环方法没有显着优势。总之,本案例研究未证实该解释在增强模型可靠性方面的预期益处.肤浅的解释可能弊大于利,误导医生依赖不确定的预测,这表明,在模型可解释性的背景下,不应高估注意力机制的当前状态。
    Explainability is key to enhancing the trustworthiness of artificial intelligence in medicine. However, there exists a significant gap between physicians\' expectations for model explainability and the actual behavior of these models. This gap arises from the absence of a consensus on a physician-centered evaluation framework, which is needed to quantitatively assess the practical benefits that effective explainability should offer practitioners. Here, we hypothesize that superior attention maps, as a mechanism of model explanation, should align with the information that physicians focus on, potentially reducing prediction uncertainty and increasing model reliability. We employed a multimodal transformer to predict lymph node metastasis of rectal cancer using clinical data and magnetic resonance imaging. We explored how well attention maps, visualized through a state-of-the-art technique, can achieve agreement with physician understanding. Subsequently, we compared two distinct approaches for estimating uncertainty: a standalone estimation using only the variance of prediction probability, and a human-in-the-loop estimation that considers both the variance of prediction probability and the quantified agreement. Our findings revealed no significant advantage of the human-in-the-loop approach over the standalone one. In conclusion, this case study did not confirm the anticipated benefit of the explanation in enhancing model reliability. Superficial explanations could do more harm than good by misleading physicians into relying on uncertain predictions, suggesting that the current state of attention mechanisms should not be overestimated in the context of model explainability.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    早期识别有医院获得性尿路感染(HA-UTI)风险的患者,可以及时启动针对性的预防和治疗策略。机器学习(ML)模型在这方面显示出巨大的潜力。然而,感染控制中现有的ML模型显示出支持可解释性的能力差,这挑战了临床实践中对结果的解释,限制ML模型适应日常临床常规。在这项研究中,我们开发了贝叶斯网络(BN)模型,以便在入院后24小时内对HA-UTI风险进行可解释的评估.我们的数据集包含138,250个独特的住院患者。我们包括了入院细节的数据,人口统计,生活方式因素,合并症,重要参数,实验室结果,和导尿管.与从完整的50个特征集开发的模型相比,从减少的五个特征集开发的模型具有透明度。与朴素和树增强的朴素BN模型相比,在减少的特征空间上基于专家的临床BN模型显示出最高的性能(曲线下面积=0.746)。此外,从基于专家的知识开发的模型具有增强的可解释性的特点。
    Early identification of patients at risk of hospital-acquired urinary tract infections (HA-UTI) enables the initiation of timely targeted preventive and therapeutic strategies. Machine learning (ML) models have shown great potential for this purpose. However, existing ML models in infection control have demonstrated poor ability to support explainability, which challenges the interpretation of the result in clinical practice, limiting the adaption of the ML models into a daily clinical routine. In this study, we developed Bayesian Network (BN) models to enable explainable assessment within 24 h of admission for risk of HA-UTI. Our dataset contained 138,250 unique hospital admissions. We included data on admission details, demographics, lifestyle factors, comorbidities, vital parameters, laboratory results, and urinary catheter. Models developed from a reduced set of five features were characterized by transparency compared to models developed from a full set of 50 features. The expert-based clinical BN model over the reduced feature space showed the highest performance (area under the curve = 0.746) compared to the naïve- and tree-augmented-naïve BN models. Moreover, models developed from expert-based knowledge were characterized by enhanced explainability.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:不良事件是指在医院对患者有潜在或实际伤害的事件。这些事件通常通过患者安全事件(PSE)报告进行记录。其中包括详细的叙述,提供有关事件的上下文信息。PSE报告的准确分类对于患者安全监测至关重要。然而,由于分类不一致和报告数量庞大,这一过程面临挑战。文本表示的最新进展,特别是从基于转换器的语言模型派生的上下文文本表示,为更精确的PSE报告分类提供了一个有前途的解决方案。集成机器学习(ML)分类器需要在人类专业知识和人工智能(AI)之间取得平衡。这种整合的核心是可解释性的概念,这对于建立信任和确保有效的人与人工智能协作至关重要。
    目的:本研究旨在研究使用上下文文本表示训练的ML分类器在自动分类PSE报告中的功效。此外,该研究提出了一个界面,该界面将ML分类器与可解释性技术集成在一起,以促进PSE报告分类的人与人工智能协作。
    方法:本研究使用了来自美国东南部一家大型学术医院产科的861份PSE报告的数据集。使用PSE报告的静态和上下文文本表示来训练各种ML分类器。使用多类分类度量和混淆矩阵评估训练的ML分类器。使用本地可解释模型不可知解释(LIME)技术来提供ML分类器预测的基本原理。为事件报告系统设计了将ML分类器与LIME技术集成的接口。
    结果:使用上下文表示的最佳分类器能够获得75.4%(95/126)的准确性,而使用静态文本表示训练的最佳分类器的准确性为66.7%(84/126)。已设计了PSE报告界面,以促进PSE报告分类中的人类与AI协作。在这个设计中,ML分类器推荐前2个最可能的事件类型,以及对预测的解释,使PSE记者和患者安全分析师选择最合适的一个。LIME技术表明,分类器偶尔依赖于任意单词进行分类,强调人类监督的必要性。
    结论:这项研究表明,使用上下文文本表示训练ML分类器可以显着提高PSE报告分类的准确性。本研究设计的界面为PSE报告分类中的人与人协作奠定了基础。从这项研究中获得的见解增强了PSE报告分类中的决策过程,使医院能够更有效地识别潜在的风险和危害,并使患者安全分析师能够及时采取行动,防止患者受到伤害。
    BACKGROUND: Adverse events refer to incidents with potential or actual harm to patients in hospitals. These events are typically documented through patient safety event (PSE) reports, which consist of detailed narratives providing contextual information on the occurrences. Accurate classification of PSE reports is crucial for patient safety monitoring. However, this process faces challenges due to inconsistencies in classifications and the sheer volume of reports. Recent advancements in text representation, particularly contextual text representation derived from transformer-based language models, offer a promising solution for more precise PSE report classification. Integrating the machine learning (ML) classifier necessitates a balance between human expertise and artificial intelligence (AI). Central to this integration is the concept of explainability, which is crucial for building trust and ensuring effective human-AI collaboration.
    OBJECTIVE: This study aims to investigate the efficacy of ML classifiers trained using contextual text representation in automatically classifying PSE reports. Furthermore, the study presents an interface that integrates the ML classifier with the explainability technique to facilitate human-AI collaboration for PSE report classification.
    METHODS: This study used a data set of 861 PSE reports from a large academic hospital\'s maternity units in the Southeastern United States. Various ML classifiers were trained with both static and contextual text representations of PSE reports. The trained ML classifiers were evaluated with multiclass classification metrics and the confusion matrix. The local interpretable model-agnostic explanations (LIME) technique was used to provide the rationale for the ML classifier\'s predictions. An interface that integrates the ML classifier with the LIME technique was designed for incident reporting systems.
    RESULTS: The top-performing classifier using contextual representation was able to obtain an accuracy of 75.4% (95/126) compared to an accuracy of 66.7% (84/126) by the top-performing classifier trained using static text representation. A PSE reporting interface has been designed to facilitate human-AI collaboration in PSE report classification. In this design, the ML classifier recommends the top 2 most probable event types, along with the explanations for the prediction, enabling PSE reporters and patient safety analysts to choose the most suitable one. The LIME technique showed that the classifier occasionally relies on arbitrary words for classification, emphasizing the necessity of human oversight.
    CONCLUSIONS: This study demonstrates that training ML classifiers with contextual text representations can significantly enhance the accuracy of PSE report classification. The interface designed in this study lays the foundation for human-AI collaboration in the classification of PSE reports. The insights gained from this research enhance the decision-making process in PSE report classification, enabling hospitals to more efficiently identify potential risks and hazards and enabling patient safety analysts to take timely actions to prevent patient harm.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Outlier detection is a fundamental data analytics technique often used for many security applications. Numerous outlier detection techniques exist, and in most cases are used to directly identify outliers without any interaction. Typically the underlying data used is often high dimensional and complex. Even though outliers may be identified, since humans can easily grasp low dimensional spaces, it is difficult for a security expert to understand/visualize why a particular event or record has been identified as an outlier. In this paper we study the extent to which outlier detection techniques work in smaller dimensions and how well dimensional reduction techniques still enable accurate detection of outliers. This can help us to understand the extent to which data can be visualized while still retaining the intrinsic outlyingness of the outliers.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Observational Study
    目的:由于COVID-19感染的癌症患者发生急性心脏损伤(ACI)的风险增加和预后不良,我们的目的是开发一种新的可解释的模型,用于预测COVID-19感染的癌症患者的ACI发生.
    方法:这项回顾性观察性研究从2022年12月至2023年4月筛选了740名感染COVID-19的癌症患者。最小绝对收缩和选择算子(LASSO)回归用于指标的初步筛选。为了提高模型的准确性,我们引入了alpha指数,以根据其重要性对指数进行进一步筛选和排名。采用随机森林(RF)构建预测模型。Shapley加法解释(SHAP)和局部可解释模型-不可知解释(LIME)方法用于解释模型。
    结果:根据纳入标准,201例COVID-19癌症患者,包括36个变量指标,包括在分析中。前八个指数(白蛋白,乳酸脱氢酶,胱抑素C,中性粒细胞计数,肌酸激酶同工酶,红细胞分布宽度,D-二聚体和胸部计算机断层扫描)用于预测患有COVID-19感染的癌症患者发生ACI。该模型实现0.940的曲线下面积(AUC)、0.866的准确度、0.750的灵敏度和0.900的特异性。校准曲线和决策曲线分析显示了良好的校准和临床实用性。SHAP结果表明,白蛋白是预测ACI发生的最重要指标。LIME结果表明,该模型可以预测每位感染COVID-19的癌症患者的ACI概率。
    结论:我们开发了一种新颖的机器学习模型,该模型在预测COVID-19感染的癌症患者中ACI的发生方面具有很高的可解释性和准确性,使用实验室和成像指数。
    OBJECTIVE: Due to the increased risk of acute cardiac injury (ACI) and poor prognosis in cancer patients with COVID-19 infection, our aim was to develop a novel and interpretable model for predicting ACI occurrence in cancer patients with COVID-19 infection.
    METHODS: This retrospective observational study screened 740 cancer patients with COVID-19 infection from December 2022 to April 2023. The least absolute shrinkage and selection operator (LASSO) regression was used for the preliminary screening of the indices. To enhance the model accuracy, we introduced an alpha index to further screen and rank the indices based on their significance. Random forest (RF) was used to construct the prediction model. The Shapley Additive Explanation (SHAP) and Local Interpretable Model-Agnostic Explanation (LIME) methods were utilized to explain the model.
    RESULTS: According to the inclusion criteria, 201 cancer patients with COVID-19, including 36 variables indices, were included in the analysis. The top eight indices (albumin, lactate dehydrogenase, cystatin C, neutrophil count, creatine kinase isoenzyme, red blood cell distribution width, D-dimer and chest computed tomography) for predicting the occurrence of ACI in cancer patients with COVID-19 infection were included in the RF model. The model achieved an area under curve (AUC) of 0.940, an accuracy of 0.866, a sensitivity of 0.750 and a specificity of 0.900. The calibration curve and decision curve analysis showed good calibration and clinical practicability. SHAP results demonstrated that albumin was the most important index for predicting the occurrence of ACI. LIME results showed that the model could predict the probability of ACI in each cancer patient infected with COVID-19 individually.
    CONCLUSIONS: We developed a novel machine-learning model that demonstrates high explainability and accuracy in predicting the occurrence of ACI in cancer patients with COVID-19 infection, using laboratory and imaging indices.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    物理化学和药代动力学化合物概况对化合物成为未来药物的效力具有至关重要的影响。如果具有所需活性特征的配体具有不利的物理化学或ADMET特性,则它们不能用于治疗。在研究中,我们考虑代谢稳定性,并专注于细胞色素P450-蛋白的选定亚型,参与生物体复合转化的第一阶段。我们开发了一种产生选定细胞色素同工型的新潜在抑制剂的方案。其后续阶段包括已知细胞色素抑制剂的新衍生物的生成和评估,在获得的配体-蛋白质复合物的基础上,对接和评估化合物可能的抑制作用。除了抑制特定细胞色素亚型的新潜在药物文库,我们还准备了一个图形神经网络,它可以预测起始分子所有修饰的活性变化。此外,我们对特定取代对生成化合物的潜在抑制性质的影响进行了系统的统计研究(单取代和双取代都被考虑),提供抑制性预测的解释,并准备一个在线可视化平台,以便手动检查结果。所开发的方法可以极大地支持新的细胞色素P450抑制剂的设计,其总体目标是产生新的代谢稳定的化合物。它能够即时评估可能的化合物-细胞色素相互作用和选择具有具有所需生物活性的最高潜力的配体。
    Physicochemical and pharmacokinetic compound profile has crucial impact on compound potency to become a future drug. Ligands with desired activity profile cannot be used for treatment if they are characterized by unfavourable physicochemical or ADMET properties. In the study, we consider metabolic stability and focus on selected subtypes of cytochrome P450 - proteins, which take part in the first phase of compound transformations in the organism. We develop a protocol for generation of new potential inhibitors of selected cytochrome isoforms. Its subsequent stages are composed of generation and assessment of new derivatives of known cytochrome inhibitors, docking and evaluation of the compound possible inhibition on the basis of the obtained ligand-protein complexes. Besides the library of new potential agents inhibiting particular cytochrome subtypes, we also prepare a graph neural network that predicts the change in activity for all modifications of the starting molecule. In addition, we perform a systematic statistical study on the influence of particular substitutions on the potential inhibition properties of generated compounds (both mono- and di-substitutions are considered), provide explanations of the inhibitory predictions and prepare an on-line visualization platform enabling manual inspection of the results. The developed methodology can greatly support the design of new cytochrome P450 inhibitors with the overarching goal of generation of new metabolically stable compounds. It enables instant evaluation of possible compound-cytochrome interactions and selection of ligands with the highest potential of possessing desired biological activity.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    慢性眼部疾病(COD),如近视,糖尿病视网膜病变,年龄相关性黄斑变性,青光眼,白内障会影响眼睛,甚至可能导致严重的视力障碍或失明。根据世界卫生组织(WHO)最近关于愿景的报告,全世界至少有22亿人患有视力障碍。通常,直到疾病发展到晚期,表明COD的明显迹象才会出现。然而,如果早期检测到COD,通过早期干预和经济有效的治疗可以避免视力障碍.眼科医生通过检查视网膜的某些微小变化来检测COD,比如微动脉瘤,黄斑水肿,出血,血管的改变.眼睛状况的范围是多种多样的,这些情况中的每一种都需要独特的针对患者的治疗。卷积神经网络(CNN)在多学科领域显示出巨大的潜力,包括各种眼部疾病的检测。在这项研究中,我们将几种预处理方法与卷积神经网络相结合,以准确检测眼底图像中的COD。据我们所知,这是第一项使用CNN模型对COD分类的预处理方法进行定性分析的工作。实验结果表明,在感兴趣区域分割图像上训练的CNN在很大程度上优于在原始输入图像上训练的模型。此外,三种预处理技术的集合比其他最先进的方法高出30%和3%,就Kappa和F1得分而言,分别。开发的原型已经过广泛的测试,可以在更全面的COD数据集上进行评估,以便在临床设置中部署。
    Chronic Ocular Diseases (COD) such as myopia, diabetic retinopathy, age-related macular degeneration, glaucoma, and cataract can affect the eye and may even lead to severe vision impairment or blindness. According to a recent World Health Organization (WHO) report on vision, at least 2.2 billion individuals worldwide suffer from vision impairment. Often, overt signs indicative of COD do not manifest until the disease has progressed to an advanced stage. However, if COD is detected early, vision impairment can be avoided by early intervention and cost-effective treatment. Ophthalmologists are trained to detect COD by examining certain minute changes in the retina, such as microaneurysms, macular edema, hemorrhages, and alterations in the blood vessels. The range of eye conditions is diverse, and each of these conditions requires a unique patient-specific treatment. Convolutional neural networks (CNNs) have demonstrated significant potential in multi-disciplinary fields, including the detection of a variety of eye diseases. In this study, we combined several preprocessing approaches with convolutional neural networks to accurately detect COD in eye fundus images. To the best of our knowledge, this is the first work that provides a qualitative analysis of preprocessing approaches for COD classification using CNN models. Experimental results demonstrate that CNNs trained on the region of interest segmented images outperform the models trained on the original input images by a substantial margin. Additionally, an ensemble of three preprocessing techniques outperformed other state-of-the-art approaches by 30% and 3%, in terms of Kappa and F 1 scores, respectively. The developed prototype has been extensively tested and can be evaluated on more comprehensive COD datasets for deployment in the clinical setup.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    患者对治疗反应的异质性在精神疾病中普遍存在。个性化的医学方法-涉及将患者分为更适合特定治疗的亚组-因此可以改善患者的预后,并作为临床试验中患者选择的有力工具。机器学习方法可以识别患者亚群,但由于使用复杂的算法而无法反映临床医生的自然决策过程,因此通常无法“解释”。
    在这里,我们结合了两种分析方法-个性化优势指数和贝叶斯规则列表-以强调模型可解释性的方式识别帕潘立酮指示的精神分裂症患者。我们将这些方法回顾性地应用于随机,安慰剂对照临床试验数据,以确定帕利哌酮指示的精神分裂症患者亚组,这些患者表现出比Cohen'sd评估的完整随机样本更大的治疗效果(治疗结果优于安慰剂)。对于这项研究,结果对应于测量阳性和阴性综合征量表(PANSS)总分的降低(例如,幻觉,妄想),负(例如,钝的影响,情绪退缩),和一般精神病理学(例如,意志的干扰,不合作)精神分裂症的症状。
    使用我们的联合可解释的AI方法来确定一个对帕潘立酮比安慰剂更敏感的亚组,与全样本相比,治疗效果显著增加(单样本t检验p<0.0001,比较全样本Cohen'sd=0.82和产生的亚组Cohen'sd's分布,平均d=1.22,stddd=0.09)。此外,我们的建模方法产生简单的逻辑语句(if-then-else),称为“规则列表”,便于临床医生的解释性。交叉验证生成的大多数规则列表发现了两个一般的精神病理学症状,意志和不合作的干扰,预测帕利哌酮指示的亚组中的成员资格。
    这些结果有助于通过确定具有改善治疗效果的亚组,从技术上验证我们的可解释的AI方法来选择患者进行临床试验。有了这些数据,可解释的规则列表还表明,帕利哌酮可能为精神分裂症患者的治疗提供改善的治疗益处,精神分裂症患者的症状是意志高度紊乱或高度不合作。
    clincialtrials.gov标识符:NCT00,083,668;预期注册于2004年5月28日。
    Heterogeneity among patients\' responses to treatment is prevalent in psychiatric disorders. Personalized medicine approaches-which involve parsing patients into subgroups better indicated for a particular treatment-could therefore improve patient outcomes and serve as a powerful tool in patient selection within clinical trials. Machine learning approaches can identify patient subgroups but are often not \"explainable\" due to the use of complex algorithms that do not mirror clinicians\' natural decision-making processes.
    Here we combine two analytical approaches-Personalized Advantage Index and Bayesian Rule Lists-to identify paliperidone-indicated schizophrenia patients in a way that emphasizes model explainability. We apply these approaches retrospectively to randomized, placebo-controlled clinical trial data to identify a paliperidone-indicated subgroup of schizophrenia patients who demonstrate a larger treatment effect (outcome on treatment superior than on placebo) than that of the full randomized sample as assessed with Cohen\'s d. For this study, the outcome corresponded to a reduction in the Positive and Negative Syndrome Scale (PANSS) total score which measures positive (e.g., hallucinations, delusions), negative (e.g., blunted affect, emotional withdrawal), and general psychopathological (e.g., disturbance of volition, uncooperativeness) symptoms in schizophrenia.
    Using our combined explainable AI approach to identify a subgroup more responsive to paliperidone than placebo, the treatment effect increased significantly over that of the full sample (p < 0.0001 for a one-sample t-test comparing the full sample Cohen\'s d = 0.82 and a generated distribution of subgroup Cohen\'s d\'s with mean d = 1.22, std d = 0.09). In addition, our modeling approach produces simple logical statements (if-then-else), termed a \"rule list\", to ease interpretability for clinicians. A majority of the rule lists generated from cross-validation found two general psychopathology symptoms, disturbance of volition and uncooperativeness, to predict membership in the paliperidone-indicated subgroup.
    These results help to technically validate our explainable AI approach to patient selection for a clinical trial by identifying a subgroup with an improved treatment effect. With these data, the explainable rule lists also suggest that paliperidone may provide an improved therapeutic benefit for the treatment of schizophrenia patients with either of the symptoms of high disturbance of volition or high uncooperativeness.
    clincialtrials.gov identifier: NCT 00,083,668; prospectively registered May 28, 2004.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Sci-hub)

       PDF(Pubmed)

公众号