SVM

SVM
  • 文章类型: Journal Article
    淡水资源近年来逐渐盐化,极大地影响了生态系统和人类健康。因此,有必要检测淡水资源的盐度。然而,传统的检测方法难以快速、准确地检测溶液中盐的种类和浓度。本文利用便携式近红外光谱仪对溶液中的盐进行定性判别和定量预测。这项研究是通过添加10种NaCl盐进行的,KCl,MgCl2、CaCl2、Na2CO3、K2CO3、CaCO3、Na2SO4、K2SO4、MgSO4至2mL去离子水中制备单一盐溶液(0.02%-1.00%)总计100套。发现支持向量机(SVM)模型仅在区分溶液中的盐阴离子类别方面有效。偏最小二乘判别分析(PLS-DA)模型,另一方面,可以有效区分溶液中的盐类别,最优模型预测集和交互验证集的准确率分别为98.86%和99.66%,分别。此外,偏最小二乘回归(PLSR)模型可以准确预测NaCl浓度,KCl,MgCl2、CaCl2、Na2CO3、K2CO3、CaCO3、Na2SO4、K2SO4、MgSO4盐溶液。他们的模型交互验证集的确定系数R2分别为0.99、0.99、0.99、0.97、0.99、0.99、0.98、0.99、0.98和0.98。本研究表明,NIRS能够实现溶液中盐类的快速、准确的定性和定量检测,为水资源的安全利用提供了技术支持。
    Freshwater resources have been gradually salinized in recent years, dramatically impacting the ecosystem and human health. Therefore, it is necessary to detect the salinity of freshwater resources. However, traditional detection methods make it difficult to check the type and concentration of salt quickly and accurately in solution. This paper uses a portable near-infrared spectrometer to qualitatively discriminate and quantitatively predict the salt in the solution. The study was carried out by adding ten salts of NaCl, KCl, MgCl2, CaCl2, Na2CO3, K2CO3, CaCO3, Na2SO4, K2SO4, MgSO4 to 2 mL of deionized water to prepare a single salt solution (0.02 %-1.00 %) totaling 100 sets. It was found that the Support vector machine (SVM) model was only effective in discriminating the class of salt anions in the solution. The Partial least squares-discriminant analysis (PLS-DA) model, on the other hand, can effectively discriminate the classes of salt in solution, and the accuracies of the optimal model prediction set and the interactive validation set are 98.86 % and 99.66 %, respectively. Furthermore, the Partial least squares regression (PLSR) models can accurately predict the concentration of NaCl, KCl, MgCl2, CaCl2, Na2CO3, K2CO3, CaCO3, Na2SO4, K2SO4, MgSO4 salt solutions. The coefficients of determination R2 of their model interactive validation sets were 0.99, 0.99, 0.99, 0.97, 0.99, 0.99, 0.98, 0.99, 0.98, and 0.98, respectively. This study shows that NIRS can achieve rapid and accurate qualitative and quantitative detection of salts in solution, which provides technical support for the utilization of safe water resources.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在过去的几年里,COVID-19疫情已经在全球蔓延。人们已经习惯了小说的标准,这包括在家工作,网上聊天,保持自己的清洁,阻止COVID-19的传播。由于这个原因,许多公共场所努力确保他们的访客佩戴适当的口罩,并保持彼此的安全距离。监控人员不可能确保每个人都戴着口罩;自动化解决方案是识别和监控口罩的更好选择,以帮助控制公众行为并减少COVID-19的流行。开发这项技术的动机是需要识别那些露出面孔的人。以前发表的大多数研究出版物都集中在各种方法上。这项研究建立了新的方法,即K-medoids,K-means,和模糊K均值(FKM)来使用图像预处理来获得更好的人脸质量并减少噪声数据。此外,这项研究调查了各种机器学习模型卷积神经网络(CNN)与预训练(DenseNet201,VGG-16和VGG-19)模型,和支持向量机(SVM)的人脸检测。所提出的方法K-medoids与预训练模型DenseNet201的实验结果取得了97.7%的准确率最佳结果。我们的研究结果表明,图像的分割可以提高识别的准确性。更重要的是,当面罩识别工具能够以侧面方式识别面罩时,该面罩识别工具是更有益的。
    Over the last several years, the COVID-19 epidemic has spread over the globe. People have become used to the novel standard, which involves working from home, chatting online, and keeping oneself clean, to stop the spread of COVID-19. Due to this, many public spaces make an effort to make sure that their visitors wear proper face masks and maintain a safe distance from one another. It is impossible for monitoring workers to ensure that everyone is wearing a face mask; automated solutions are a far better option for face mask identification and monitoring to assist control public conduct and reduce the COVID-19 epidemic. The motivation for developing this technology was the need to identify those individuals who uncover their faces. Most of the previously published research publications focused on various methodologies. This study built new methods namely K-medoids, K-means, and Fuzzy K-Means(FKM) to use image pre-processing to get the better quality of the face and reduce the noise data. In addition, this study investigates various machine learning models Convolutional neural networks (CNN) with pre-trained (DenseNet201, VGG-16, and VGG-19) models, and Support Vector Machine (SVM) for the detection of face masks. The experimental results of the proposed method K-medoids with pre-trained model DenseNet201 achieved the 97.7 % accuracy best results for face mask identification. Our research results indicate that the segmentation of images may improve the identification of accuracy. More importantly, the face mask identification tool is more beneficial when it can identify the face mask in a side-on approach.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    飞行员的行为对航空安全至关重要。本研究旨在探讨飞行员的脑电图特征,完善培训评估方法,加强飞行安全措施。对采集到的脑电信号进行初步预处理。在左、右转弯时进行EEG特征分析。涉及β波能量比和香农熵的计算。还量化了飞行员在不同飞行阶段的心理工作量。根据脑电图特征,飞行员的心理工作量通过使用支持向量机(SVM)进行分类。研究结果表明,与巡航阶段相比,左转弯和右转弯期间β波的能量比和香农熵发生了显着变化。此外,发现飞行员的心理工作量在这些转弯阶段有所增加。使用支持向量机检测飞行员的心理工作量,训练集的分类准确率为98.92%,而对于测试集,为93.67%。这项研究对于理解飞行员的心理工作量具有重要意义。
    Pilot behavior is crucial for aviation safety. This study aims to investigate the EEG characteristics of pilots, refine training assessment methodologies, and bolster flight safety measures. The collected EEG signals underwent initial preprocessing. The EEG characteristic analysis was performed during left and right turns, involving the calculation of the energy ratio of beta waves and Shannon entropy. The psychological workload of pilots during different flight phases was quantified as well. Based on the EEG characteristics, the pilots\' psychological workload was classified through the use of a support vector machine (SVM). The study results showed significant changes in the energy ratio of beta waves and Shannon entropy during left and right turns compared to the cruising phase. Additionally, the pilots\' psychological workload was found to have increased during these turning phases. Using support vector machines to detect the pilots\' psychological workload, the classification accuracy for the training set was 98.92%, while for the test set, it was 93.67%. This research holds significant importance in understanding pilots\' psychological workload.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    本研究利用了θ的显著差异,α,和β波段功率谱在脑电图(EEG)中观察到的注意力分散和集中驾驶。三个子任务,视觉分心,听觉分心,和认知分心,被设计为在驾驶模拟过程中随机出现。θ,α,提取了四种驾驶注意状态的脑电信号的β波段功率谱,和SVM,EEGNet,和GRU-EEGNet模型用于检测驾驶注意力状态,分别。进行了在线实验。θ的提取,α,发现EEG信号的β波段功率谱特征比提取整个EEG信号的功率谱特征更有效地检测驾驶注意力状态。提出的GRU-EEGNet模型的驾驶注意力状态检测精度比EEGNet模型和PSD_SVM方法提高了6.3%和12.8%,分别。结合脑电特征和改进的深度学习算法的脑电解码方法,有效提高了驾驶注意力状态检测的准确性,是根据现有研究的结果手动初步选择的。
    The present study utilizes the significant differences in θ, α, and β band power spectra observed in electroencephalograms (EEGs) during distracted versus focused driving. Three subtasks, visual distraction, auditory distraction, and cognitive distraction, were designed to appear randomly during driving simulations. The θ, α, and β band power spectra of the EEG signals of the four driving attention states were extracted, and SVM, EEGNet, and GRU-EEGNet models were employed for the detection of the driving attention states, respectively. Online experiments were conducted. The extraction of the θ, α, and β band power spectrum features of the EEG signals was found to be a more effective method than the extraction of the power spectrum features of the whole EEG signals for the detection of driving attention states. The driving attention state detection accuracy of the proposed GRU-EEGNet model is improved by 6.3% and 12.8% over the EEGNet model and PSD_SVM method, respectively. The EEG decoding method combining EEG features and an improved deep learning algorithm, which effectively improves the driving attention state detection accuracy, was manually and preliminarily selected based on the results of existing studies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    大麻被培养用于治疗和娱乐目的,其中δ-9四氢大麻酚(THC)是其治疗效果的主要目标。随着全球大麻产业和大麻素研究的扩大,用于确定大麻素浓度的更有效和更具成本效益的分析方法将有利于提高效率和最大限度地提高生产率。利用机器学习工具开发基于近红外(NIR)光谱的预测模型,这已经通过准确和灵敏的化学分析得到了验证,如气相色谱(GC)或液相色谱质谱(LCMS),是必不可少的。以往针对脱羧大麻素的大麻素预测模型研究,如THC,而不是天然存在的前体,四氢大麻酚酸(THCA),并利用细磨的大麻花序。目前的研究重点是在收获前建立整个大麻花序中THCA浓度的预测模型,通过采用非破坏性筛选技术,因此中耕者可以实时快速表征高性能品种的化学型,从而有利于有针对性地优化杂交育种工作。使用近红外光谱和LCMS创建预测模型,我们可以区分高THCA和甚至比率类别,预测精度为100%。我们还开发了THCA浓度的预测模型,R2=0.78,预测误差平均值为13%。这项研究证明了便携式手持NIR设备在收获前预测整个大麻样品的THCA浓度的可行性。允许更早地评估大麻素的概况,因此增加了高通量和快速的能力。
    Cannabis is cultivated for therapeutic and recreational purposes where delta-9 tetrahydrocannabinol (THC) is a main target for its therapeutic effects. As the global cannabis industry and research into cannabinoids expands, more efficient and cost-effective analysis methods for determining cannabinoid concentrations will be beneficial to increase efficiencies and maximize productivity. The utilization of machine learning tools to develop near-infrared (NIR) spectroscopy-based prediction models, which have been validated from accurate and sensitive chemical analysis, such as gas chromatography (GC) or liquid chromatography mass spectroscopy (LCMS), is essential. Previous research on cannabinoid prediction models targeted decarboxylated cannabinoids, such as THC, rather than the naturally occurring precursor, tetrahydrocannabinolic acid (THCA), and utilize finely ground cannabis inflorescence. The current study focuses on building prediction models for THCA concentrations in whole cannabis inflorescences prior to harvest, by employing non-destructive screening techniques so cultivators may rapidly characterize high-performing cultivars for chemotype in real time, thus facilitating targeted optimization of crossbreeding efforts. Using NIR spectroscopy and LCMS to create prediction models we can differentiate between high-THCA and even ratio classes with 100% prediction accuracy. We have also developed prediction models for THCA concentration with a R2 = 0.78 with a prediction error average of 13%. This study demonstrates the viability of a portable handheld NIR device to predict THCA concentrations on whole cannabis samples before harvest, allowing the evaluation of cannabinoid profiles to be made earlier, therefore increasing high-throughput and rapid capabilities.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    乳腺癌是全球每年新癌症病例的主要原因之一。它是在乳腺细胞中发展的恶性肿瘤。这种疾病的早期筛查对于防止其转移至关重要。当怀疑这种疾病时,乳房X线照片是目前最常见的筛查工具;所有确定的乳腺病变都不是恶性的。乳腺肿块样本的侵入性细针抽吸(FNA)是临床检查癌性病变的二级筛查工具。染色的抽吸样品的视觉图像分析对细胞学家提出了准确识别恶性细胞的挑战。在内省评估之上制定基于人工智能的客观技术对于避免误诊至关重要。本文介绍了几种基于人工智能(AI)的技术,可从FNA样本的核特征诊断乳腺癌。来自UCI机器学习存储库的威斯康星州乳腺癌数据集(WBCD)适用于此调查。测量重要的统计参数以评估所提出的技术的性能。使用两层前馈神经网络(FFNN)可实现98.10%的最佳检测精度。最后,将开发的算法的性能与文献中的一些最新作品进行了比较。
    Breast cancer is one of the paramount causes of new cancer cases worldwide annually. It is a malignant neoplasm that develops in the breast cells. The early screening of this disease is essential to prevent its metastasis. A mammogram X-ray image is the most common screening tool practiced currently when this disease is suspected; all the breast lesions identified are not malignant. The invasive fine needle aspiration (FNA) of a breast mass sample is the secondary screening tool to clinically examine cancerous lesions. The visual image analysis of the stained aspirated sample imposes a challenge for the cytologist to identify the malignant cells accurately. The formulation of an artificial intelligence-based objective technique on top of the introspective assessment is essential to avoid misdiagnosis. This paper addresses several artificial intelligence (AI)-based techniques to diagnose breast cancer from the nuclear features of FNA samples. The Wisconsin Breast Cancer dataset (WBCD) from the UCI machine learning repository is applied for this investigation. Significant statistical parameters are measured to evaluate the performance of the proposed techniques. The best detection accuracy of 98.10% is achieved with a two-layer feed-forward neural network (FFNN). Finally, the developed algorithm\'s performance is compared with some state-of-the-art works in the literature.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    基因选择是微阵列癌症数据分类的重要步骤。基因表达癌症数据(脱氧核糖核酸微阵列)有助于计算各种基因的稳健和同时表达。粒子群优化(PSO)需要简单的运算符和较少的参数来调整基因选择中的模型。选择具有小冗余的预后基因对于研究人员来说是一个巨大的挑战,因为在基于PSO的选择方法中存在一些并发症。在这项研究中,提出了一种新的PSO(自惯性权重自适应PSO)变体。在提出的算法中,探索SIW-APSO-ELM以实现基因选择预测的准确性。该新算法在改进的惯性权重自适应粒子群算法的开发和探索能力之间建立了平衡。采用自惯性权重自适应粒子群优化(SIW-APSO)算法进行求解探索。SIW-APSO中的每个粒子通过进化过程迭代地增加其位置和速度。极限学习机(ELM)是为选择程序而设计的。所提出的方法已用于鉴定癌症数据集中的几个基因。分类算法包含ELM,K-质心最近邻,与微阵列癌症数据集的最新方法相比,支持向量机可获得较高的预测精度,这些方法显示了所提出方法的有效性。
    Gene selection is an essential step for the classification of microarray cancer data. Gene expression cancer data (deoxyribonucleic acid microarray] facilitates in computing the robust and concurrent expression of various genes. Particle swarm optimization (PSO) requires simple operators and less number of parameters for tuning the model in gene selection. The selection of a prognostic gene with small redundancy is a great challenge for the researcher as there are a few complications in PSO based selection method. In this research, a new variant of PSO (Self-inertia weight adaptive PSO) has been proposed. In the proposed algorithm, SIW-APSO-ELM is explored to achieve gene selection prediction accuracies. This novel algorithm establishes a balance between the exploitation and exploration capabilities of the improved inertia weight adaptive particle swarm optimization. The self-inertia weight adaptive particle swarm optimization (SIW-APSO) algorithm is employed for solution explorations. Each particle in the SIW-APSO increases its position and velocity iteratively through an evolutionary process. The extreme learning machine (ELM) has been designed for the selection procedure. The proposed method has been employed to identify several genes in the cancer dataset. The classification algorithm contains ELM, K-centroid nearest neighbor, and support vector machine to attain high forecast accuracy as compared to the start-of-the-art methods on microarray cancer datasets that show the effectiveness of the proposed method.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:之前的研究人员已经确定了MDD患者在功能连接神经影像学特征上的明显差异。然而,VMHC值在MDD患者中的辅助诊断和亚型分化作用尚未完全了解.我们旨在探讨焦虑性MDD或非焦虑性MDD和HCs患者VMHC值的分离能力。
    方法:我们招募了90名焦虑性MDD患者,69例非焦虑性MDD和84例HCs。我们收集了一组临床变量,包括HAMD-17评分,HAMA评分和rs-fMRI数据。数据分析结合差异分析,SVM,相关分析和ROC分析。
    结果:相对于HC,非焦虑性MDD患者在脑岛和PCG中显示出显著较低的VMHC值,和焦虑的MDD患者在小脑_crus2,STG,postCG,MFG和IFG。与非焦虑性MDD患者相比,焦虑MDD显示PCG中VMHC值显著增强.脑岛和小脑_crus2区域的VMHC值显示出更好的区分HCs与非焦虑MDD或焦虑MDD患者的能力。PCG中的VMHC值显示出更好的区分焦虑MDD患者和非焦虑MDD患者的能力。
    结论:脑岛和小脑_crus2区域的VMHC值可以作为影像学标志物,分别区分非焦虑性MDD或焦虑性MDD患者的HC。PCG中的VMHC值可用于区分焦虑MDD患者和非焦虑MDD患者。
    OBJECTIVE: Prior researchers have identified distinct differences in functional connectivity neuroimaging characteristics among MDD patients. However, the auxiliary diagnosis and subtype differentiation roles of VMHC values in MDD patients have yet to be fully understood. We aim to explore the separating ability of VMHC values in patients with anxious MDD or with non-anxious MDD and HCs.
    METHODS: We recruited 90 patients with anxious MDD, 69 patients with non-anxious MDD and 84 HCs. We collected a set of clinical variables included HAMD-17 scores, HAMA scores and rs-fMRI data. The data were analyzed combining difference analysis, SVM, correlation analysis and ROC analysis.
    RESULTS: Relative to HCs, non-anxious MDD patients displayed significant lower VMHC values in the insula and PCG, and anxious MDD patients displayed a significant decrease in VMHC values in the cerebellum_crus2, STG, postCG, MFG and IFG. Compared with non-anxious MDD patients, the anxious MDD showed significant enhanced VMHC values in the PCG. The VMHC values in the insula and cerebellum_crus2 regions showed a better ability to discriminate HCs from patients with non-anxious MDD or with anxious MDD. The VMHC values in PCG showed a better ability to discriminate patients with anxious MDD and non-anxious MDD patients.
    CONCLUSIONS: The VMHC values in the insula and cerebellum_crus2 regions could be served as imaging markers to differentiate HCs from patients with non-anxious MDD or with anxious MDD respectively. And the VMHC values in the PCG could be used to discriminate patients with anxious MDD from the non-anxious MDD patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    这项初步研究应用了计算机辅助的定量语言分析,以检查基于语言的分类模型在有和没有抑郁症治疗史的母亲(n=140)之间进行区分的有效性(51%和49%,分别)。母亲在与青春期孩子解决问题的互动中被记录下来。成绩单使用基于字典的手动注释和分析,自然语言程序方法(语言查询和字数统计)。评估语言特征对正确分类抑郁症史的重要性,我们使用了具有可解释特征的支持向量机(SVM)。使用经验文献中确定的语言特征,最初的SVM达到了近63%的准确率。仅使用排名最高的前5个SHAP特征的第二个SVM将准确度提高到67.15%。这些发现扩展了现有文献的基础上理解抑郁情绪状态的语言行为,重点关注有和没有抑郁症治疗史的母亲的语言风格及其对儿童发育和抑郁症跨代传播的潜在影响。
    This preliminary study applied a computer-assisted quantitative linguistic analysis to examine the effectiveness of language-based classification models to discriminate between mothers (n = 140) with and without history of treatment for depression (51% and 49%, respectively). Mothers were recorded during a problem-solving interaction with their adolescent child. Transcripts were manually annotated and analyzed using a dictionary-based, natural-language program approach (Linguistic Inquiry and Word Count). To assess the importance of linguistic features to correctly classify history of depression, we used Support Vector Machines (SVM) with interpretable features. Using linguistic features identified in the empirical literature, an initial SVM achieved nearly 63% accuracy. A second SVM using only the top 5 highest ranked SHAP features improved accuracy to 67.15%. The findings extend the existing literature base on understanding language behavior of depressed mood states, with a focus on the linguistic style of mothers with and without a history of treatment for depression and its potential impact on child development and trans-generational transmission of depression.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在过去的25年里,快速的城市化导致喀布尔省的土地利用和土地覆盖(LULC)发生了重大变化,阿富汗。为了评估LULC变化对地表温度(LST)的影响,喀布尔省使用1998年至2022年的Landsat卫星图像应用支持向量机(SVM)算法分为四个LULC类。使用来自热带的Landsat数据评估LST。应用细胞自动机-逻辑回归(CA-LR)模型预测了2034年和2046年LULC和LST的未来模式。结果显示LULC类的显著变化,随着建成区面积增加约9.37%,而裸露的土壤和植被覆盖率下降了7.20%和2.35%,分别,从1998年到2022年。对年度LST的分析表明,建成区的平均LST最高,其次是裸露的土壤和植被。未来的模拟结果表明,预计到2034年和2046年,建成区面积将分别增加到17.08%和23.10%,比2022年的11.23%。同样,LST的模拟结果表明,到2034年和2046年,经历最高LST等级(≥32°C)的区域预计将分别增加到27.01%和43.05%,比2022年的11.21%。结果表明,随着建成区面积的增加和植被覆盖的减少,LST显著增加,揭示了城市化和气温上升之间的直接联系。
    Over the past two and a half decades, rapid urbanization has led to significant land use and land cover (LULC) changes in Kabul province, Afghanistan. To assess the impact of LULC changes on land surface temperature (LST), Kabul province was divided into four LULC classes applying the Support Vector Machine (SVM) algorithm using the Landsat satellite images from 1998 to 2022. The LST was assessed using Landsat data from the thermal band. The Cellular Automata-Logistic Regression (CA-LR) model was applied to predict the future patterns of LULC and LST for 2034 and 2046. Results showed significant changes in LULC classes, as the built-up areas increased about 9.37%, while the bare soil and vegetation cover decreased 7.20% and 2.35%, respectively, from 1998 to 2022. The analysis of annual LST revealed that built-up areas showed the highest mean LST, followed by bare soil and vegetation. The future simulation results indicate an expected increase in built-up areas to 17.08% and 23.10% by 2034 and 2046, respectively, compared to 11.23% in 2022. Similarly, the simulation results for LST indicated that the area experiencing the highest LST class (≥ 32 °C) is expected to increase to 27.01% and 43.05% by 2034 and 2046, respectively, compared to 11.21% in 2022. The results indicate that LST increases considerably as built-up areas increase and vegetation cover decreases, revealing a direct link between urbanization and rising temperatures.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号