Multilayer perceptron

多层感知器
  • 文章类型: Journal Article
    目的:开发并验证一种影像组学模型,用于在实施治疗前预测隐匿性局部晚期食管鳞状细胞癌(LA-ESCC)的计算机断层扫描(CT)影像特征。
    方法:该研究回顾性收集了来自两个医疗中心的574例食管鳞状细胞癌(ESCC)患者,分为三组进行培训,内部和外部验证。在描绘感兴趣的体积(VOI)之后,使用三种稳健方法提取影像组学特征并进行特征选择。随后,构建了10个机器学习模型,其中,利用最佳模型建立了影像组学签名。此外,我们开发了一个结合了临床和影像组学特征的预测性列线图.通过接收器工作特性曲线评估了这些模型的性能,校正曲线,决策曲线分析以及包括准确性在内的措施,灵敏度,和特异性。
    结果:总共选择了19个影像组学特征。多层感知器(MLP),被发现是最优的,在训练中达到0.919、0.864和0.882的AUC,内部和外部验证队列,分别。同样,MLP在区分cT1-2N0M0亚组中隐匿性LA-ESCC方面显示出良好的准确性,在两个验证队列中分别为0.803和0.789。通过将影像组学签名与临床签名相结合,在外部验证队列中,预测列线图显示出优异的预测性能,AUC为0.877,准确度为0.85.
    结论:影像组学和机器学习模型可以提高隐匿性LA-ESCC预测的准确性,为临床医生选择治疗方案提供有价值的帮助。
    OBJECTIVE: Development and validation of a radiomics model for predicting occult locally advanced esophageal squamous cell carcinoma (LA-ESCC) on computed tomography (CT) radiomic features before implementation of treatment.
    METHODS: The study retrospectively collected 574 patients with esophageal squamous cell carcinoma (ESCC) from two medical centers, which were divided into three cohorts for training, internal and external validation. After delineating volume of interest (VOI), radiomics features were extracted and subjected to feature selection using three robust methods. Subsequently, 10 machine learning models were constructed, among which the optimal model was utilized to establish a radiomics signature. Furthermore, a predictive nomogram incorporating both clinical and radiomics signatures was developed. The performance of these models was evaluated through receiver operating characteristic curves, calibration curves, decision curve analysis as well as measures including accuracy, sensitivity, and specificity.
    RESULTS: A total of 19 radiomics features were selected. The multilayer perceptron (MLP), which was found to be optimal, achieved an AUC of 0.919, 0.864 and 0.882 in the training, internal and external validation cohorts, respectively. Similarly, MLP showed good accuracy in distinguish occult LA-ESCC in subgroup of cT1-2N0M0 diagnosed by clinicians with 0.803 and 0.789 in two validation cohorts respectively. By incorporating the radiomics signature with clinical signature, a predictive nomogram demonstrated superior prediction performance with an AUC of 0.877 and accuracy of 0.85 in external validation cohort.
    CONCLUSIONS: The radiomics and machine learning model can offers improved accuracy in prediction of occult LA-ESCC, providing valuable assistance to clinicians when choosing treatment plans.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    手机和可穿戴设备等设备的广泛使用允许自动监控人类日常活动,生成大量数据集,提供对人类长期行为的洞察。结构化和受控的数据收集过程对于释放这些信息的全部潜力至关重要。虽然用于身体活动监测的可穿戴传感器在医疗保健领域获得了巨大的吸引力,体育科学,和健身应用,为研究和算法开发确保多样化和全面的数据集是一个显著的挑战。在这个概念验证研究中,我们强调了语义表示在增强数据互操作性和促进身体活动传感器观测的高级分析方面的重要性。我们的方法侧重于通过采用医疗级(CE认证)传感器来生成合成数据集,从而增强身体活动数据集的可用性。此外,我们提供与合成数据集相关的伦理考虑因素的见解。该研究对真实和合成活性数据集进行了比较分析,评估其在减轻模型偏差和促进预测分析公平性方面的有效性。我们已经创建了一个本体,用于在语义上表示来自物理活动传感器的观察结果,并对使用MOX2-5活动传感器收集的数据进行了预测分析。直到现在,一直以来,缺乏使用MOX2-5活动监测医疗级(CE认证)设备收集的身体活动公开数据集.MOX2-5捕获和传输高分辨率数据,包括活动强度,承重,久坐,站立,低,中度,和剧烈的体力活动,以及每分钟的步数。我们的数据集包括在30-45天(约1.5个月)的时间内从16名成年人(男性:12;女性:4)收集的身体活动数据。产生相对较小的539条记录。为了解决这个限制,我们采用各种合成数据生成方法,如高斯Capula(GC),条件表格一般对抗网络(CTGAN),和表格一般对抗网络(TABGAN),用合成数据扩充数据集。对于真实的和合成的数据集,我们开发了多层感知器(MLP)分类模型,用于对日常身体活动水平进行准确分类。研究结果强调了语义本体在语义搜索中的有效性,知识表示,数据集成,推理,并捕获数据之间有意义的关系。该分析支持以下假设:预测模型的效率随着额外合成训练数据量的增加而提高。本体和生成AI具有加速行为监测研究进步的潜力。所提供的数据,包括真正的MOX2-5及其合成对应物,作为开发活动类型分类中的健壮方法的宝贵资源。此外,它为探索与合成数据相关的研究方向开辟了道路,包括模型效率,检测生成的数据,以及有关数据隐私的考虑。
    The widespread use of devices like mobile phones and wearables allows for automatic monitoring of human daily activities, generating vast datasets that offer insights into long-term human behavior. A structured and controlled data collection process is essential to unlock the full potential of this information. While wearable sensors for physical activity monitoring have gained significant traction in healthcare, sports science, and fitness applications, securing diverse and comprehensive datasets for research and algorithm development poses a notable challenge. In this proof-of-concept study, we underscore the significance of semantic representation in enhancing data interoperability and facilitating advanced analytics for physical activity sensor observations. Our approach focuses on enhancing the usability of physical activity datasets by employing a medical-grade (CE certified) sensor to generate synthetic datasets. Additionally, we provide insights into ethical considerations related to synthetic datasets. The study conducts a comparative analysis between real and synthetic activity datasets, assessing their effectiveness in mitigating model bias and promoting fairness in predictive analysis. We have created an ontology for semantically representing observations from physical activity sensors and conducted predictive analysis on data collected using MOX2-5 activity sensors. Until now, there has been a lack of publicly available datasets for physical activity collected with MOX2-5 activity monitoring medical grade (CE certified) device. The MOX2-5 captures and transmits high-resolution data, including activity intensity, weight-bearing, sedentary, standing, low, moderate, and vigorous physical activity, as well as steps per minute. Our dataset consists of physical activity data collected from 16 adults (Male: 12; Female: 4) over a period of 30-45 days (approximately 1.5 months), yielding a relatively small volume of 539 records. To address this limitation, we employ various synthetic data generation methods, such as Gaussian Capula (GC), Conditional Tabular General Adversarial Network (CTGAN), and Tabular General Adversarial Network (TABGAN), to augment the dataset with synthetic data. For both the authentic and synthetic datasets, we have developed a Multilayer Perceptron (MLP) classification model for accurately classifying daily physical activity levels. The findings underscore the effectiveness of semantic ontology in semantic search, knowledge representation, data integration, reasoning, and capturing meaningful relationships between data. The analysis supports the hypothesis that the efficiency of predictive models improves as the volume of additional synthetic training data increases. Ontology and Generative AI hold the potential to expedite advancements in behavioral monitoring research. The data presented, encompassing both real MOX2-5 and its synthetic counterpart, serves as a valuable resource for developing robust methods in activity type classification. Furthermore, it opens avenues for exploration into research directions related to synthetic data, including model efficiency, detection of generated data, and considerations regarding data privacy.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:为了系统的运动训练和有针对性的人才培养,详细了解顶尖运动员的身体素质和运动能力至关重要。然而,很难识别性能要求的差异,从而提供足够的支持,特别是对于乍一看似乎有类似需求的运动,例如田径投掷学科。因此,这项研究的目的是检查来自不同投掷学科的顶尖运动员的身体素质和运动能力,并检查运动员的性能参数是否符合各自运动的具体要求。方法:该研究涉及289名男性青年运动员(14-18岁),涉及四个不同的投掷学科:铅球(n=101),锤击(n=16),铁饼投掷(n=63),和标枪投掷(n=109)。性能评估包括适用于投掷学科的三个人体测量和十二个运动性能先决条件。实施了判别分析和神经网络(多层感知器),以确定将运动员与四项运动区分开的可能性。结果:研究结果表明,在男性投掷运动员中,一般身体素质和运动能力评估的差异根据其特定运动识别出大多数有才华的年轻运动员(判别分析:68.2%;多层感知器分析:72.2%)。无论采用何种分类方法,这仍然适用。铁饼投掷者拥有身高优势,而推杆和锤击手表现出优越的手臂力量。标枪投掷者表现出更好的爆发力和冲刺速度。除了投锤者,在药球向前或向后投掷测试中,所有事件都显示出高水平的爆发力,这对铅球和铁饼运动员尤其重要。结论:肯定了身体素质和运动能力测试在识别和转移田径投掷学科有才华的运动员中的意义。使用线性和非线性分类方法,大多数运动员可以被分配到他们正确的运动。然而,这也表明,每种运动都需要略有不同的培训和人才识别。此外,非线性分析方法可以为青少年竞技体育的发展过程提供有用的支持。
    Purpose: For systematic athletic training and targeted talent development, it is essential to know the physical fitness and motor competencies of top athletes in detail. However, it can be difficult to identify differences in performance requirements and thus to provide adequate support, especially for sports that at first glance appear to have similar demands-such as track and field throwing disciplines. Therefore, the aim of the study was to examine the physical fitness and motor competence of top athletes from different throwing disciplines and to check whether the athletes\' performance parameters match the specific requirements of the respective sport. Methods: The study involved 289 male youth athletes (aged 14-18 years) across four distinct throwing disciplines: shot put (n = 101), hammer throw (n = 16), discus throw (n = 63), and javelin throw (n = 109). The performance evaluation comprised three anthropometric measurements and twelve motor performance prerequisites applicable to the throwing disciplines. Discriminant analysis and neural networks (Multilayer Perceptron) were implemented to determine the possibility of distinguishing among athletes from the four sports. Results: The study\'s findings indicate that in male throwing athletes, disparities in general physical fitness and motor proficiency assessments discern the majority of talented young athletes based on their specific sport (discriminant analysis: 68.2%; multilayer perceptron analysis: 72.2%). This remains applicable irrespective of the classification method employed. Discus throwers possessed a height advantage, while shot putters and hammer throwers exhibited superior arm strength. Javelin throwers displayed better explosive strength and sprinting speed. Except for the hammer throwers, all events demonstrated a high level of explosive power in the medicine ball forward or backward throw test, which was especially crucial for shot put and discus athletes. Conclusion: The significance of physical fitness and motor competence tests in identifying and transferring talented athletes in track and field throwing disciplines has been affirmed. Using linear and non-linear classification methods, most athletes could be assigned to their correct sport. However, this also shows that slightly different training and talent identification is required for each of these sports. Furthermore, non-linear analysis methods can provide useful support for the development processes in junior competitive sports.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    无监督缺陷检测方法由于能够规避复杂的故障样本收集而在工业缺陷检测中引起了极大的关注。然而,这些模型努力在复杂的场景中在正常和异常情况之间建立一个稳健的边界,导致假阳性预测的频率增加。虚假警报加剧了重新确认的工作,并阻碍了无监督异常检测模型在工业应用中的广泛采用。为此,我们深入研究无监督缺陷检测模型中唯一可用的数据源,无监督训练数据集,引入一种称为错误警报识别(FAI)方法的解决方案,旨在使用无异常图像学习潜在错误警报的分布。它利用多层感知器从对象级别的无异常训练图像上训练的检测器捕获潜在错误警报的语义信息。在测试阶段,FAI模型作为在基线检测算法之后应用的后处理模块操作。FAI算法通过其语义特征来确定归一化流算法预测的每个正片是否是虚警。当肯定预测被识别为错误警报时,相应的像素级预测设置为负。在广泛的工业应用中,两种最先进的归一化流算法证明了FAI方法的有效性。
    Unsupervised defect detection methods have garnered substantial attention in industrial defect detection owing to their capacity to circumvent complex fault sample collection. However, these models grapple with establishing a robust boundary between normal and abnormal conditions in intricate scenarios, leading to a heightened frequency of false-positive predictions. Spurious alerts exacerbate the work of reconfirmation and impede the widespread adoption of unsupervised anomaly detection models in industrial applications. To this end, we delve into the sole available data source in unsupervised defect detection models, the unsupervised training dataset, to introduce a solution called the False Alarm Identification (FAI) method aimed at learning the distribution of potential false alarms using anomaly-free images. It exploits a multi-layer perceptron to capture the semantic information of potential false alarms from a detector trained on anomaly-free training images at the object level. During the testing phase, the FAI model operates as a post-processing module applied after the baseline detection algorithm. The FAI algorithm determines whether each positive patch predicted by the normalizing flow algorithm is a false alarm by its semantic features. When a positive prediction is identified as a false alarm, the corresponding pixel-wise predictions are set to negative. The effectiveness of the FAI method is demonstrated by two state-of-the-art normalizing flow algorithms on extensive industrial applications.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    确定肝细胞癌(HCC)患者肝切除术后复发的高风险,有助于及时实施介入治疗。本研究旨在开发一种机器学习(ML)模型来预测肝癌患者肝切除术后的复发风险。
    我们回顾性收集了2013年4月至2017年10月在中山大学附属第三医院接受根治性肝切除术的315例HCC患者,并以7:3的比例随机分为训练集和验证集。根据HCC患者术后复发情况,将患者分为复发组和未复发组,并对两组进行单因素和多因素logistic回归。我们应用了六种机器学习算法来构建预测模型,并通过10倍交叉验证进行了内部验证。Shapley加性解释(SHAP)方法用于解释机器学习模型。我们还建立了一个基于最佳机器学习模型的网络计算器,以个性化评估肝癌患者肝切除术后的复发风险。
    机器学习模型中包含了总共13个变量。多层感知器(MLP)机器学习模型在测试集(AUC=0.680)中具有最佳预测值。SHAP方法显示γ-谷氨酰转肽酶(GGT),纤维蛋白原,中性粒细胞,谷草转氨酶(AST)和总胆红素(TB)是肝癌患者肝切除术后复发风险的前5个重要因素。此外,我们通过分析两名患者进一步证明了模型的可靠性.最后,我们成功构建了基于MLP机器学习模型的在线网络预测计算器。
    MLP是预测肝癌患者肝切除术后复发风险的最佳机器学习模型。该预测模型可以帮助识别肝切除术后高复发风险的HCC患者,以提供早期和个性化的治疗。
    UNASSIGNED: Identifying patients with hepatocellular carcinoma (HCC) at high risk of recurrence after hepatectomy can help to implement timely interventional treatment. This study aimed to develop a machine learning (ML) model to predict the recurrence risk of HCC patients after hepatectomy.
    UNASSIGNED: We retrospectively collected 315 HCC patients who underwent radical hepatectomy at the Third Affiliated Hospital of Sun Yat-sen University from April 2013 to October 2017, and randomly divided them into the training and validation sets at a ratio of 7:3. According to the postoperative recurrence of HCC patients, the patients were divided into recurrence group and non-recurrence group, and univariate and multivariate logistic regression were performed for the two groups. We applied six machine learning algorithms to construct the prediction models and performed internal validation by 10-fold cross-validation. Shapley additive explanations (SHAP) method was applied to interpret the machine learning model. We also built a web calculator based on the best machine learning model to personalize the assessment of the recurrence risk of HCC patients after hepatectomy.
    UNASSIGNED: A total of 13 variables were included in the machine learning models. The multilayer perceptron (MLP) machine learning model was proved to achieve optimal predictive value in test set (AUC = 0.680). The SHAP method displayed that γ-glutamyl transpeptidase (GGT), fibrinogen, neutrophil, aspartate aminotransferase (AST) and total bilirubin (TB) were the top 5 important factors for recurrence risk of HCC patients after hepatectomy. In addition, we further demonstrated the reliability of the model by analyzing two patients. Finally, we successfully constructed an online web prediction calculator based on the MLP machine learning model.
    UNASSIGNED: MLP was an optimal machine learning model for predicting the recurrence risk of HCC patients after hepatectomy. This predictive model can help identify HCC patients at high recurrence risk after hepatectomy to provide early and personalized treatment.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    人工神经网络可以解决计算机视觉中的各种任务,例如图像分类,物体检测,和普遍认可。我们的比较研究涉及四种类型的人工神经网络-多层感知器,概率神经网络,径向基函数神经网络,和卷积神经网络-并研究它们对2D矩阵码(数据矩阵码,QR码,和阿兹特克代码)以及它们的旋转。本文介绍了这些人工神经网络的基本构建块及其体系结构,并比较了这些神经网络在不同配置下的2D矩阵代码的分类精度。使用3000个合成代码样本的数据集来训练和测试神经网络。当神经网络在完整的数据集上训练时,卷积神经网络显示了它的优越性,其次是RBF神经网络和多层感知器。
    Artificial neural networks can solve various tasks in computer vision, such as image classification, object detection, and general recognition. Our comparative study deals with four types of artificial neural networks-multilayer perceptrons, probabilistic neural networks, radial basis function neural networks, and convolutional neural networks-and investigates their ability to classify 2D matrix codes (Data Matrix codes, QR codes, and Aztec codes) as well as their rotation. The paper presents the basic building blocks of these artificial neural networks and their architecture and compares the classification accuracy of 2D matrix codes under different configurations of these neural networks. A dataset of 3000 synthetic code samples was used to train and test the neural networks. When the neural networks were trained on the full dataset, the convolutional neural network showed its superiority, followed by the RBF neural network and the multilayer perceptron.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    这项研究涉及存储在具有不同属性集的独立本地表中的分散数据。本文提出了一种训练单个神经网络的新方法-基于分散数据的多层感知器。这个想法是根据本地表训练具有相同结构的本地模型;但是,由于本地表中存在不同的条件属性集,需要生成一些人工对象来训练局部模型。本文介绍了在所提出的创建人造对象以训练局部模型的方法中使用变化的参数值的研究。本文对基于单个原始对象生成的人造对象的数量进行了详尽的比较,数据分散的程度,数据平衡,和不同的网络结构-隐藏层中神经元的数量。结果发现,对于具有大量对象的数据集,较少数量的人造物体是最佳的。对于较小的数据集,更多的人造物体(三个或四个)产生更好的结果。对于大型数据集,数据平衡和离散程度对分类质量没有显著影响。相反,隐藏层中更多的神经元会产生更好的结果(范围是输入层中神经元数量的3到5倍)。
    This study concerns dispersed data stored in independent local tables with different sets of attributes. The paper proposes a new method for training a single neural network-a multilayer perceptron based on dispersed data. The idea is to train local models that have identical structures based on local tables; however, due to different sets of conditional attributes present in local tables, it is necessary to generate some artificial objects to train local models. The paper presents a study on the use of varying parameter values in the proposed method of creating artificial objects to train local models. The paper presents an exhaustive comparison in terms of the number of artificial objects generated based on a single original object, the degree of data dispersion, data balancing, and different network structures-the number of neurons in the hidden layer. It was found that for data sets with a large number of objects, a smaller number of artificial objects is optimal. For smaller data sets, a greater number of artificial objects (three or four) produces better results. For large data sets, data balancing and the degree of dispersion have no significant impact on quality of classification. Rather, a greater number of neurons in the hidden layer produces better results (ranging from three to five times the number of neurons in the input layer).
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    使用能够预测光伏(PV)能源生产的模型对于确保该能源与传统配电网的最佳集成至关重要。长短期记忆网络(LSTM)通常用于此目的,但它们的使用可能不是更好的选择,因为它们的计算复杂性很大,推理和训练时间较慢。因此,在这项工作中,我们寻求评估神经网络MLP(多层感知器)的使用,循环神经网络(RNN),和LSTMs,用于预测5min的光伏能源产量。预测的每次迭代都使用从光伏系统收集的最后120分钟的数据(功率,辐照,和PV电池温度),从2019年到2022年年中在Maceió(巴西)测量。此外,使用贝叶斯超参数优化来获得每个模型的最佳结果,并在平等的基础上进行比较。结果表明,MLP表现令人满意,需要更少的时间来训练和预测,表明在特定情况下处理非常短期的预测时,它们可能是一个更好的选择,例如,在计算资源很少的系统中。
    The use of models capable of forecasting the production of photovoltaic (PV) energy is essential to guarantee the best possible integration of this energy source into traditional distribution grids. Long Short-Term Memory networks (LSTMs) are commonly used for this purpose, but their use may not be the better option due to their great computational complexity and slower inference and training time. Thus, in this work, we seek to evaluate the use of neural networks MLPs (Multilayer Perceptron), Recurrent Neural Networks (RNNs), and LSTMs, for the forecast of 5 min of photovoltaic energy production. Each iteration of the predictions uses the last 120 min of data collected from the PV system (power, irradiation, and PV cell temperature), measured from 2019 to mid-2022 in Maceió (Brazil). In addition, Bayesian hyperparameters optimization was used to obtain the best of each model and compare them on an equal footing. Results showed that the MLP performs satisfactorily, requiring much less time to train and forecast, indicating that they can be a better option when dealing with a very short-term forecast in specific contexts, for example, in systems with little computational resources.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    微阵列是电气工程和技术在生物学中的应用,可以同时测量许多基因的表达。它们可以用来分析特定的疾病。本研究对各种微阵列进行分类分析,以比较分类算法在不同数据特征上的性能。根据五种使用的机器学习方法,将数据集分为测试组和对照组,包括多层感知器(MLP),支持向量机(SVM)决策树(DT)随机森林(RF),和k-最近邻居(KNN),并比较了所得的准确性。k折交叉验证用于评估性能,并通过比较五种机器学习方法的性能来分析结果。通过实验,据观察,这两种基于树的方法,DT和RF,结果和其余三种方法显示出相似的趋势,MLP,SVM,和DT,表现出类似的趋势。除了一个数据集之外,DT和RF通常表现出比其他方法更差的性能。这表明,为了对微阵列数据进行有效分类,选择适合数据特点的分类算法对于确保最佳性能至关重要。
    Microarrays are applications of electrical engineering and technology in biology that allow simultaneous measurement of expression of numerous genes, and they can be used to analyze specific diseases. This study undertakes classification analyses of various microarrays to compare the performances of classification algorithms over different data traits. The datasets were classified into test and control groups based on five utilized machine learning methods, including MultiLayer Perceptron (MLP), Support Vector Machine (SVM), Decision Tree (DT), Random Forest (RF), and k-Nearest Neighbors (KNN), and the resulting accuracies were compared. k-fold cross-validation was used in evaluating the performance and the result was analyzed by comparing the performances of the five machine learning methods. Through the experiments, it was observed that the two tree-based methods, DT and RF, showed similar trends in results and the remaining three methods, MLP, SVM, and DT, showed similar trends. DT and RF generally showed worse performance than other methods except for one dataset. This suggests that, for the effective classification of microarray data, selecting a classification algorithm that is suitable for data traits is crucial to ensure optimum performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    揭示大脑各种结构和功能模式之间的关联可以产生有关健康和无序大脑的高度信息的结果。使用神经成像数据的研究最近开始利用各种功能和解剖领域内的信息(即,大脑网络组)。然而,大多数全脑方法都假设整个大脑中相互作用的复杂性相似。在这里,我们研究了大脑网络之间的相互作用捕获不同数量的复杂性的假设,并且我们可以通过基于可用的训练数据改变模型子空间结构的复杂性来更好地捕获这些信息。要做到这一点,我们采用基于贝叶斯优化的框架,称为树parzen估计器(TPE)来识别,利用和分析从大脑的功能磁共振成像(fMRI)子域中提取的时间信息编码的信息的变化模式。在精神分裂症分类任务上使用重复的交叉验证程序,我们证明有证据表明,特定功能子域之间的相互作用通过更复杂的模型架构更好地表征,而其他功能子域需要的较不复杂的模型架构对分类和理解大脑的功能相互作用做出最佳贡献.我们表明,已知与精神分裂症有关的功能子域需要更复杂的体系结构,以最佳地解开有关该疾病的歧视性信息。我们的研究指出了适应性的必要性,分层学习框架,以不同的方式满足不同子域的特征,不仅为了更好的预测,而且还能够识别预测感兴趣结果的特征。
    Revealing associations among various structural and functional patterns of the brain can yield highly informative results about the healthy and disordered brain. Studies using neuroimaging data have more recently begun to utilize the information within as well as across various functional and anatomical domains (i.e., groups of brain networks). However, most whole-brain approaches assume similar complexity of interactions throughout the brain. Here we investigate the hypothesis that interactions between brain networks capture varying amounts of complexity, and that we can better capture this information by varying the complexity of the model subspace structure based on available training data. To do this, we employ a Bayesian optimization-based framework known as the Tree Parzen Estimator (TPE) to identify, exploit and analyze patterns of variation in the information encoded by temporal information extracted from functional magnetic resonance imaging (fMRI) subdomains of the brain. Using a repeated cross-validation procedure on a schizophrenia classification task, we demonstrate evidence that interactions between specific functional subdomains are better characterized by more sophisticated model architectures compared to less complicated ones required by the others for optimally contributing towards classification and understanding the brain\'s functional interactions. We show that functional subdomains known to be involved in schizophrenia require more complex architectures to optimally unravel discriminatory information about the disorder. Our study points to the need for adaptive, hierarchical learning frameworks that cater differently to the features from different subdomains, not only for a better prediction but also for enabling the identification of features predicting the outcome of interest.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号