reliefF

ReliefF
  • 文章类型: Journal Article
    街头可卡因通常与各种物质混合,加剧其有害影响。本文提出了一种框架,用于识别衰减全反射傅里叶变换红外光谱(ATR-FTIR)间隔,该间隔可以最好地预测可卡因样品中掺假物的浓度。波长通过ReliefF和mRMR特征选择方法根据其相关性进行排名,并且迭代过程基于每种方法建议的排序来移除不太相关的波长。高斯过程(GP)回归模型是在每个波长去除后构建的,并使用RMSE评估预测性能。选择平衡低RMSE值和小百分比的保留波长的子集。使用由345个可卡因样品和不同量的左旋咪唑组成的数据集验证了所提出的框架,咖啡因,非那西丁,还有利多卡因.平均四个掺假者,GP回归加上mRMR保留了662个原始波长的1.07%,在预测性能方面优于PLS和SVR。
    Street cocaine is often mixed with various substances that intensify its harmful effects. This paper proposes a framework to identify attenuated total reflection Fourier transform infrared spectroscopy (ATR-FTIR) intervals that best predict the concentration of adulterants in cocaine samples. Wavelengths are ranked according to their relevance through ReliefF and mRMR feature selection approaches, and an iterative process removes less relevant wavelengths based on the ranking suggested by each approach. Gaussian Process (GP) regression models are constructed after each wavelength removal and the prediction performance is evaluated using RMSE. The subset balancing a low RMSE value and a small percentage of retained wavelengths is chosen. The proposed framework was validated using a dataset consisting of 345 samples of cocaine with different amounts of levamisole, caffeine, phenacetin, and lidocaine. Averaged over the four adulterants, the GP regression coupled with the mRMR retained 1.07 % of the 662 original wavelengths, outperforming PLS and SVR regarding prediction performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    开发一种先进的确定技术,通过深度学习和机器学习方法的不同应用,从胸部X射线和CT扫描胶片中检测COVID-19模式。
    新增强的混合分类网络(SVM-RLF-DNN)包括三个阶段:特征提取,选择和分类。从一系列3×3卷积中提取深度特征,2×2最大轮询操作,然后是深度神经网络(DNN)的扁平化和完全连接层。模型中使用了ReLU激活函数和Adam优化器。ReliefF是Relief的一种改进的特征选择算法,它使用曼哈顿距离代替欧几里德距离。基于特征的意义,ReliefF将权重分配给从全连接层接收的每个提取特征。每个特征的权重是多类问题中相邻实例对的每个类中k个最接近命中和未命中的平均值。ReliefF通过将节点值设置为零来消除较低权重的功能。保持特征的较高权重以获得特征选择。在神经网络的最后一层,多类支持向量机(SVM)用于对COVID-19、病毒性肺炎和健康病例的模式进行分类。具有三个二进制SVM分类器的三个类按照一对全方法对每个二进制SVM使用线性核函数。选择铰链损失函数和L2范数正则化以获得更稳定的结果。所提出的方法是在Kaggle和GitHub的公开可用的胸部X射线和CT扫描图像数据库上进行评估的。所提出的分类模型的性能具有可比的训练,验证,和测试精度,除了敏感性,特异性,和混淆矩阵,用于五重交叉验证的定量评估。
    我们提出的网络在2级X射线和CT上实现了98.48%和95.34%的测试精度。更重要的是,所提出的模型的测试精度,灵敏度,特异性为87.9%,86.32%,3类分类为90.25%(COVID-19,肺炎,正常)胸部X光片。所提出的模型提供了测试精度,灵敏度,特异性为95.34%,94.12%,胸部CT2类分类(COVID-19,非COVID)占96.15%。
    我们提出的分类网络实验结果表明与现有神经网络的竞争力。所提出的神经网络帮助临床医生确定和监测疾病。
    UNASSIGNED: To develop an advanced determination technology for detecting COVID-19 patterns from chest X-ray and CT-scan films with distinct applications of deep learning and machine learning methods.
    UNASSIGNED: The newly enhanced proposed hybrid classification network (SVM-RLF-DNN) comprises of three phases: feature extraction, selection and classification. The in-depth features are extracted from a series of 3×3 convolution, 2×2 max polling operations followed by a flattened and fully connected layer of the deep neural network (DNN). ReLU activation function and Adam optimizer are used in the model. The ReliefF is an improved feature selection algorithm of Relief that uses Manhattan distance instead of Euclidean distance. Based on the significance of the feature, the ReliefF assigns weight to each extracted feature received from a fully connected layer. The weight to each feature is the average of k closest hits and misses in each class for a neighbouring instance pair in multiclass problems. The ReliefF eliminates lower-weight features by setting the node value to zero. The higher weights of the features are kept to obtain the feature selection. At the last layer of the neural network, the multiclass Support Vector Machine (SVM) is used to classify the patterns of COVID-19, viral pneumonia and healthy cases. The three classes with three binary SVM classifiers use linear kernel function for each binary SVM following a one-versus-all approach. The hinge loss function and L2-norm regularization are selected for more stable results. The proposed method is assessed on publicly available chest X-ray and CT-scan image databases from Kaggle and GitHub. The performance of the proposed classification model has comparable training, validation, and test accuracy, as well as sensitivity, specificity, and confusion matrix for quantitative evaluation on five-fold cross-validation.
    UNASSIGNED: Our proposed network has achieved test accuracy of 98.48% and 95.34% on 2-class X-rays and CT. More importantly, the proposed model\'s test accuracy, sensitivity, and specificity are 87.9%, 86.32%, and 90.25% for 3-class classification (COVID-19, Pneumonia, Normal) on chest X-rays. The proposed model provides the test accuracy, sensitivity, and specificity of 95.34%, 94.12%, and 96.15% for 2-class classification (COVID-19, Non-COVID) on chest CT.
    UNASSIGNED: Our proposed classification network experimental results indicate competitiveness with existing neural networks. The proposed neural network assists clinicians in determining and surveilling the disease.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:头颈部鳞状细胞癌(HNSCC)是一种恶性程度高的恶性肿瘤,侵入性,和转移率。放射治疗,作为HNSCC的重要辅助治疗,可以降低术后复发率,提高生存率。鉴定与HNSCC放疗抵抗(HNSCC-RR)相关的基因有助于寻找潜在的治疗靶点。然而,从数以万计的基因中鉴定出放疗抵抗相关基因是一项具有挑战性的任务。虽然基因之间的相互作用对于阐明复杂的生物过程很重要,大量的基因使得基因相互作用的计算不可行。
    方法:我们提出了一种基因选择算法,RGIE,基于ReliefF,树木集合的基因网络推断(GENIE3)和特征消除。ReliefF用于选择对HNSCC-RR具有区别性的特征子集,GENIE3基于该子集构建了基因调控网络,分析了基因间的调控关系,特征消除用于消除冗余和嘈杂的特征。
    结果:9个基因(SPAG1,FIGN,NUBPL,CHMP5,TCF7L2,COQ10B,BSDC1,ZFPM1,GRPEL1)被鉴定并用于鉴定HNSCC-RR,在精度方面达到0.9730、0.9679、0.9767和0.9885的性能,精度,召回,AUC,分别。最后,qRT-PCR验证了9个标记基因在细胞系(SCC9、SCC9-RR)中的差异表达。
    结论:RGIE可有效筛选HNSCC-RR相关基因。这种方法可能有助于指导患者的临床治疗方式并开发潜在的治疗方法。
    BACKGROUND: Head and Neck Squamous Cell Carcinoma (HNSCC) is a malignant tumor with a high degree of malignancy, invasiveness, and metastasis rate. Radiotherapy, as an important adjuvant therapy for HNSCC, can reduce the postoperative recurrence rate and improve the survival rate. Identifying the genes related to HNSCC radiotherapy resistance (HNSCC-RR) is helpful in the search for potential therapeutic targets. However, identifying radiotherapy resistance-related genes from tens of thousands of genes is a challenging task. While interactions between genes are important for elucidating complex biological processes, the large number of genes makes the computation of gene interactions infeasible.
    METHODS: We propose a gene selection algorithm, RGIE, which is based on ReliefF, Gene Network Inference with Ensemble of Trees (GENIE3) and Feature Elimination. ReliefF was used to select a feature subset that is discriminative for HNSCC-RR, GENIE3 constructed a gene regulatory network based on this subset to analyze the regulatory relationship among genes, and feature elimination was used to remove redundant and noisy features.
    RESULTS: Nine genes (SPAG1, FIGN, NUBPL, CHMP5, TCF7L2, COQ10B, BSDC1, ZFPM1, GRPEL1) were identified and used to identify HNSCC-RR, which achieved performances of 0.9730, 0.9679, 0.9767, and 0.9885 in terms of accuracy, precision, recall, and AUC, respectively. Finally, qRT-PCR validated the differential expression of the nine signature genes in cell lines (SCC9, SCC9-RR).
    CONCLUSIONS: RGIE is effective in screening genes related to HNSCC-RR. This approach may help guide clinical treatment modalities for patients and develop potential treatments.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在快速发展的生物医学工程领域,假肢的有效实时控制是一个紧迫的研究关注。解决这个问题,当前的研究引入了一种开创性的方法来增强假肢控制系统中的任务识别,结合基于ReliefF的深度神经网络(DNN)方法。本文利用了MILimbEEG数据集,全面丰富的EEG信号来源收集,计算算术平均值(AM)的统计特征,标准偏差(SD),以及各种运动活动的偏度(S)。最高特征选择(SFS),在采用的时域特征中,是使用ReliefF算法执行的。得分最高的DNN-ReliefF开发的模型表现出卓越的性能,实现精度,精度,召回率为97.4%,97.3%,和97.4%,分别。相比之下,传统的DNN模型产生了准确性,精度,召回率为50.8%,51.1%,50.8%,强调通过纳入SFS可能实现的重大改进。这种鲜明的对比凸显了整合ReliefF的变革潜力,将DNN-ReliefF模型定位为实时假肢控制系统即将取得进展的强大平台。
    In the rapidly advancing field of biomedical engineering, effective real-time control of artificial limbs is a pressing research concern. Addressing this, the current study introduces a pioneering method for augmenting task recognition in prosthetic control systems, combining a ReliefF-based Deep Neural Networks (DNNs) approach. This paper has leveraged the MILimbEEG dataset, a comprehensive rich source collection of EEG signals, to calculate statistical features of Arithmetic Mean (AM), Standard Deviation (SD), and Skewness (S) across various motor activities. Supreme Feature Selection (SFS), of the adopted time-domain features, was performed using the ReliefF algorithm. The highest scored DNN-ReliefF developed model demonstrated remarkable performance, achieving accuracy, precision, and recall rates of 97.4 %, 97.3 %, and 97.4 %, respectively. In contrast, a traditional DNN model yielded accuracy, precision, and recall rates of 50.8 %, 51.1 %, and 50.8 %, highlighting the significant improvements made possible by incorporating SFS. This stark contrast underscores the transformative potential of incorporating ReliefF, situating the DNN-ReliefF model as a robust platform for forthcoming advancements in real-time prosthetic control systems.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    牙源性囊肿与其他囊肿的微观诊断差异很复杂,可能会使临床医生和病理学家感到困惑。特别感兴趣的是牙源性角化囊肿(OKC),具有独特组织病理学和临床特征的发育性囊肿。然而,这种囊肿的区别在于其侵袭性和高复发倾向。临床医生在处理这种经常遇到的颌骨病变时遇到挑战,因为在手术治疗上没有共识。因此,这种囊肿的准确和早期诊断将使临床医生在治疗管理方面受益,并使受试者免于遭受侵袭性OKC的精神痛苦,影响他们的生活质量。这项研究的目的是开发一个自动化的OKC诊断系统,可以作为病理学家的决策支持工具,无论他们是在本地工作还是远程工作。该系统将为他们提供额外的数据和见解,以增强他们的决策能力。这项研究旨在提供一个自动化管道来分类OKC和非角化囊肿(非KC:牙科和根性囊肿)的整个幻灯片图像。使用全切片图像(WSI)和深度学习方法对组织进行组织病理学分析的OKC诊断和预后是一个新兴的研究领域。WSI具有以高分辨率放大组织而不丢失信息的独特优势。这项研究的贡献是一部小说,基于深度学习,和有效的算法,减少可训练参数,反过来,内存占用。这是使用名为P-C-ReliefF的卷积神经网络(CNN)中的主成分分析(PCA)和ReliefF特征选择算法(ReliefF)来实现的。与标准CNN相比,该模型减少了可训练参数,达到97%的分类准确率。
    The microscopic diagnostic differentiation of odontogenic cysts from other cysts is intricate and may cause perplexity for both clinicians and pathologists. Of particular interest is the odontogenic keratocyst (OKC), a developmental cyst with unique histopathological and clinical characteristics. Nevertheless, what distinguishes this cyst is its aggressive nature and high tendency for recurrence. Clinicians encounter challenges in dealing with this frequently encountered jaw lesion, as there is no consensus on surgical treatment. Therefore, the accurate and early diagnosis of such cysts will benefit clinicians in terms of treatment management and spare subjects from the mental agony of suffering from aggressive OKCs, which impact their quality of life. The objective of this research is to develop an automated OKC diagnostic system that can function as a decision support tool for pathologists, whether they are working locally or remotely. This system will provide them with additional data and insights to enhance their decision-making abilities. This research aims to provide an automation pipeline to classify whole-slide images of OKCs and non-keratocysts (non-KCs: dentigerous and radicular cysts). OKC diagnosis and prognosis using the histopathological analysis of tissues using whole-slide images (WSIs) with a deep-learning approach is an emerging research area. WSIs have the unique advantage of magnifying tissues with high resolution without losing information. The contribution of this research is a novel, deep-learning-based, and efficient algorithm that reduces the trainable parameters and, in turn, the memory footprint. This is achieved using principal component analysis (PCA) and the ReliefF feature selection algorithm (ReliefF) in a convolutional neural network (CNN) named P-C-ReliefF. The proposed model reduces the trainable parameters compared to standard CNN, achieving 97% classification accuracy.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    血清miRNA是可用于癌症筛查的临床样品。肺癌早期血清标志物的鉴定对患者的早期诊断和临床治疗至关重要。从基因表达综合(GEO)下载肺腺癌(LUAD)患者和健康个体的血清miRNA的表达数据。将这些数据标准化并进行差异表达分析以获得差异表达的miRNA(DEmiRNA)。DEmiRNA随后进行了ReliefF特征选择,筛选与癌症密切相关的亚群作为候选特征miRNA。此后,高斯朴素贝叶斯(NB),支持向量机(SVM)基于这些候选特征miRNA构建随机森林(RF)分类器。然后通过NB结合增量特征选择(IFS)构建最佳诊断特征。此后,基于具有最佳预测性能的miRNA,对这些样本进行主成分分析(PCA).最后,我们提取了64例LUAD患者和59例正常个体的外周血miRNA进行qRT-PCR分析,以验证诊断模型在临床检测方面的性能.最后,根据曲线下面积(AUC)和精度值,由miR-5100和miR-663a组成的NB分类器表现出最出色的诊断性能.PCA结果还表明,2-miRNA诊断特征可以有效区分癌症患者和健康个体。最后,临床血清样本的qRT-PCR结果显示,miR-5100和miR-663a在肿瘤样本中的表达明显高于正常样本。2-miRNA诊断特征的AUC为0.968。总之,我们鉴定了血清中的标志物(miR-5100和miR-663a)用于早期LUAD筛查,为开发早期LUAD诊断模型提供思路。
    Serum miRNAs are available clinical samples for cancer screening. Identifying early serum markers in lung cancer (LC) is essential for patients\' early diagnosis and clinical treatment. Expression data of serum miRNAs of lung adenocarcinoma (LUAD) patients and healthy individuals were downloaded from the Gene Expression Omnibus (GEO). These data were normalized and subjected to differential expression analysis to obtain differentially expressed miRNAs (DEmiRNAs). The DEmiRNAs were subsequently subjected to ReliefF feature selection, and subsets closely related to cancer were screened as candidate feature miRNAs. Thereafter, a Gaussian Naive Bayes (NB), Support Vector Machine (SVM), and Random Forest (RF) classifier were constructed based on these candidate feature miRNAs. Then the best diagnostic signature was constructed through NB combined with incremental feature selection (IFS). Thereafter, these samples were subjected to principal component analysis (PCA) based on miRNAs with optimal predictive performance. Finally, the peripheral serum miRNAs of 64 LUAD patients and 59 normal individuals were extracted for qRT-PCR analysis to validate the performance of the diagnostic model in respect of clinical detection. Finally, according to area under the curve (AUC) and accuracy values, the NB classifier composed of miR-5100 and miR-663a manifested the most outstanding diagnostic performance. The PCA results also revealed that the 2-miRNA diagnostic signature could effectively distinguish cancer patients from healthy individuals. Finally, qRT-PCR results of clinical serum samples revealed that miR-5100 and miR-663a expression in tumor samples was remarkably higher than that in normal samples. The AUC of the 2-miRNA diagnostic signature was 0.968. In summary, we identified markers (miR-5100 and miR-663a) in serum for early LUAD screening, providing ideas for developing early LUAD diagnostic models.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    基因选择(GS)是特征选择领域的一个重要分支,这在癌症分类中被广泛使用。它提供了对癌症发病机理的重要见解,并使人们能够更深入地了解癌症数据。在癌症分类中,GS本质上是一个多目标优化问题,旨在同时优化分类精度和基因子集大小这两个目标。海洋捕食者算法(MPA)已成功应用于实际应用中,然而,它的随机初始化会导致失明,这可能会对算法的收敛性产生不利影响。此外,指导进化的精英个体是从帕累托解决方案中随机选择的,这可能会降低种群的良好勘探性能。为了克服这些限制,提出了一种具有连续映射初始化和领导者选择策略的多目标改进MPA。在这项工作中,具有ReliefF的新的连续映射初始化克服了后期进化中信息较少的缺陷。此外,具有高斯分布的改进的精英选择机制引导种群朝着更好的帕累托前沿发展。最后,采用了一种有效的突变方法来防止进化停滞。为了评估其有效性,将该算法与9种著名算法进行了比较。在16个数据集上的实验结果表明,该算法能够显著降低数据维数,在大多数高维癌症微阵列数据集上获得最高的分类准确率。
    Gene selection (GS) is an important branch of interest within the field of feature selection, which is widely used in cancer classification. It provides essential insights into the pathogenesis of cancer and enables a deeper understanding of cancer data. In cancer classification, GS is essentially a multi-objective optimization problem, which aims to simultaneously optimize the two objectives of classification accuracy and the size of the gene subset. The marine predator algorithm (MPA) has been successfully employed in practical applications, however, its random initialization can lead to blindness, which may adversely affect the convergence of the algorithm. Furthermore, the elite individuals in guiding evolution are randomly chosen from the Pareto solutions, which may degrade the good exploration performance of the population. To overcome these limitations, a multi-objective improved MPA with continuous mapping initialization and leader selection strategies is proposed. In this work, a new continuous mapping initialization with ReliefF overwhelms the defects with less information in late evolution. Moreover, an improved elite selection mechanism with Gaussian distribution guides the population to evolve towards a better Pareto front. Finally, an efficient mutation method is adopted to prevent evolutionary stagnation. To evaluate its effectiveness, the proposed algorithm was compared with 9 famous algorithms. The experimental results on 16 datasets demonstrate that the proposed algorithm can significantly reduce the data dimension and obtain the highest classification accuracy on most of high-dimension cancer microarray datasets.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    萎缩性胃炎(AG)通常由幽门螺杆菌(H。幽门螺杆菌)细菌。如果未经治疗,AG可能会发展成导致胃癌的慢性病,这被认为是全球癌症相关死亡的第三大主要原因。AG的前兆检测对于避免这种情况至关重要。这项工作的重点是位于胃窦的幽门螺杆菌相关感染,其中分类是正常和萎缩性胃炎的二元类。现有工作开发了GoogLeNet的深度卷积神经网络(DCNN),其中包含22层的预训练模型。另一项研究采用了基于Inception模块的GoogLeNet,快速稳健的模糊C均值(FRFCM),和简单的线性迭代聚类(SLIC)超像素算法来识别胃病。带Caffe框架和ResNet-50的GoogLeNet是检测幽门螺杆菌感染的机器学习者。尽管如此,精度可能会随着网络深度的增加而变得丰富。高度预期对当前标准方法的升级以避免可能导致慢性AG的未治疗和不准确的诊断。拟议的工作结合了在DCNN中旋转的改进技术,将池化作为预训练模型和频道混洗,以帮助跨特征频道的信息流,从而简化网络的训练,以实现更深入的CNN。此外,典型相关分析(CCA)特征融合方法和ReliefF特征选择方法旨在改进组合技术。CCA对由预先训练的ShuffleNet生成的显著特征的两个数据集之间的关系进行建模。ReliefF从CCA中减少并选择基本特征,并使用广义加性模型(GAM)进行分类。相信扩展的工作是合理的,测试准确率为98.2%,从而提供了正常和萎缩性胃炎的准确诊断。
    Atrophic gastritis (AG) is commonly caused by the infection of the Helicobacter pylori (H. pylori) bacteria. If untreated, AG may develop into a chronic condition leading to gastric cancer, which is deemed to be the third primary cause of cancer-related deaths worldwide. Precursory detection of AG is crucial to avoid such cases. This work focuses on H. pylori-associated infection located at the gastric antrum, where the classification is of binary classes of normal versus atrophic gastritis. Existing work developed the Deep Convolution Neural Network (DCNN) of GoogLeNet with 22 layers of the pre-trained model. Another study employed GoogLeNet based on the Inception Module, fast and robust fuzzy C-means (FRFCM), and simple linear iterative clustering (SLIC) superpixel algorithms to identify gastric disease. GoogLeNet with Caffe framework and ResNet-50 are machine learners that detect H. pylori infection. Nonetheless, the accuracy may become abundant as the network depth increases. An upgrade to the current standards method is highly anticipated to avoid untreated and inaccurate diagnoses that may lead to chronic AG. The proposed work incorporates improved techniques revolving within DCNN with pooling as pre-trained models and channel shuffle to assist streams of information across feature channels to ease the training of networks for deeper CNN. In addition, Canonical Correlation Analysis (CCA) feature fusion method and ReliefF feature selection approaches are intended to revamp the combined techniques. CCA models the relationship between the two data sets of significant features generated by pre-trained ShuffleNet. ReliefF reduces and selects essential features from CCA and is classified using the Generalized Additive Model (GAM). It is believed the extended work is justified with a 98.2% testing accuracy reading, thus providing an accurate diagnosis of normal versus atrophic gastritis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    镉(Cd)是一种有毒元素,可以在可食用植物组织中积累并对人体健康产生负面影响。传统的Cd量化方法耗时,贵,产生大量有毒废物,减缓减少吸收的方法的发展。这项研究的目的是确定是否可以使用高光谱成像(HSI)和机器学习(ML)来预测使用羽衣甘蓝(甘蓝)和罗勒(罗勒)作为模型作物的植物中的Cd浓度。实验是在自动表型设备中进行的,其中除土壤Cd浓度外,所有环境条件均保持恒定。使用传统方法在收获时确定Cd浓度,并使用从成像传感器收集的数据训练ML模型。还在收获时收集可见/近红外(VNIR)图像,并进行处理以计算在400至998nm之间的473波段的反射率。所有反射光谱都经过特征选择算法ReliefF和主成分分析(PCA)来生成数据并提供输入以评估三种ML分类模型:人工神经网络(ANN),集成学习(EL),和支持向量机(SVM)。根据Cd浓度高于或低于0.2mgkg-1Cd的安全阈值对植物进行分类。Cd检测最高等级的波长在519和574之间,以及692和732nm之间,表明Cd含量可能改变植物的叶绿素含量和叶片内部结构。所有的模型都能把植物分类,尽管F1评分最好的模型是利用所有波长反射率的验证子集的ANN。这项研究表明,HSI和ML是快速,精确诊断绿叶植物中Cd的有前途的技术,虽然需要更多的研究来适应这种方法更复杂的现场环境。
    Cadmium (Cd) is a toxic element that can accumulate in edible plant tissues and negatively impact human health. Traditional Cd quantification methods are time-consuming, expensive, and generate a lot of toxic waste, slowing development of methods to reduce uptake. The objective of this study was to determine whether hyperspectral imaging (HSI) and machine learning (ML) can be used to predict Cd concentrations in plants using kale (Brassica oleracea) and basil (Ocimum basilicum) as model crops. The experiments were conducted in an automated phenotyping facility where all environmental conditions except soil Cd concentration were kept constant. Cd concentrations were determined at harvest using traditional methods and used to train the ML models with data collected from the imaging sensor. Visible/near infrared (VNIR) images were also collected at harvest and processed to calculate reflectance at 473 bands between 400 to 998 nm. All reflectance spectra were subject to the feature selection algorithm ReliefF and Principal Component Analysis (PCA) to generate data and provide input to evaluate three ML classification models: artificial neural network (ANN), ensemble learning (EL), and support vector machine (SVM). Plants were categorized according to Cd concentrations higher or lower than the safety threshold of 0.2 mg kg-1 Cd. Wavelengths with the highest ranks for Cd detection were between 519 and 574, and 692 and 732 nm, indicating that Cd content likely altered the plants\' chlorophyll content and altered leaf internal structure. All models were able to sort the plants into groups, though the model with the best F1 score was the ANN for the validation subset that utilized reflectance from all wavelengths. This study demonstrates that HSI and ML are promising technologies for the fast and precise diagnosis of Cd in leafy green plants, though additional studies are needed to adapt this approach for more complex field environments.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    脑肿瘤是现代世界上最致命的疾病之一。这项研究提出了一种优化的机器学习方法,用于检测和识别脑肿瘤的类型(神经胶质瘤,脑膜瘤,或垂体肿瘤)在使用磁共振成像(MRI)记录的大脑图像中。利用加速鲁棒特征(SURF)提取图像的高斯特征,而其非线性特征是使用KAZE获得的,由于它们对旋转的高性能,缩放,和噪音问题。要检索本地级别的信息,所有脑MRI图像被分割成8×8像素网格。为了提高精度并减少计算时间,采用基于方差的k-means聚类和PSO-ReliefF算法来消除脑MRI图像的冗余特征。最后,使用各种机器学习分类器评估所提出的混合优化特征向量的性能。使用支持向量机(SVM)通过169个特征获得96.30%的精度。此外,与用于训练SVM的非优化特征相比,计算时间也减少到1分钟。这些发现也与以前的研究进行了比较,证明建议的方法可能有助于医生和医生及时发现脑肿瘤。
    Brain tumors are among the deadliest diseases in the modern world. This study proposes an optimized machine-learning approach for the detection and identification of the type of brain tumor (glioma, meningioma, or pituitary tumor) in brain images recorded using magnetic resonance imaging (MRI). The Gaussian features of the image are extracted using speed-up robust features (SURF), whereas its non-linear features are obtained using KAZE, owing to their high performance against rotation, scaling, and noise problems. To retrieve local-level information, all brain MRI images are segmented into an 8 × 8 pixel grid. To enhance the accuracy and reduce the computational time, the variance-based k-means clustering and PSO-ReliefF algorithms are employed to eliminate the redundant features of the brain MRI images. Finally, the performance of the proposed hybrid optimized feature vector is evaluated using various machine learning classifiers. An accuracy of 96.30% is obtained with 169 features using a support vector machine (SVM). Furthermore, the computational time is also reduced to 1 min compared to the non-optimized features used for training of the SVM. The findings are also compared with previous research, demonstrating that the suggested approach might assist physicians and doctors in the timely detection of brain tumors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号