dimension reduction

降维
  • 文章类型: Journal Article
    由于高维,冗余,和近红外(NIR)光谱数据的非线性,以及样品的生产面积和品位等属性的影响,这些都会影响样本之间的相似性度量。结合多属性数据信息,提出了一种基于Sinkhorn距离的t分布随机近邻嵌入算法(St-SNE)。首先,引入了Sinkhorn距离,可以解决高维空间中KL发散不对称和数据分布稀疏等问题,从而构造使低维空间类似于高维空间的概率分布。此外,为了解决样本的多属性特征对相似性度量的影响,利用信息熵构造了多属性距离矩阵,然后结合光谱数据的数值矩阵得到混合数据矩阵。为了验证St-SNE算法的有效性,对近红外光谱数据进行降维投影,并与PCA进行比较,LPP,和t-SNE算法。结果表明,St-SNE算法能有效区分具有不同属性信息的样本,并在低维空间中产生了更清晰的样本类别投影边界。然后,我们使用烟草和芒果数据集测试了St-SNE对不同属性的分类性能,并将其与LPP进行比较,t-SNE,UMAP,和Fishert-SNE算法。结果表明,St-SNE算法对不同属性的分类准确率最高。最后,我们将搜索最相似样本的结果与卷烟配方的目标烟草进行了比较,实验表明,与其他算法相比,St-SNE与专家推荐的一致性最高。它可以为产品配方的维护和设计提供强有力的支持。
    Due to the high-dimensionality, redundancy, and non-linearity of the near-infrared (NIR) spectra data, as well as the influence of attributes such as producing area and grade of the sample, which can all affect the similarity measure between samples. This paper proposed a t-distributed stochastic neighbor embedding algorithm based on Sinkhorn distance (St-SNE) combined with multi-attribute data information. Firstly, the Sinkhorn distance was introduced which can solve problems such as KL divergence asymmetry and sparse data distribution in high-dimensional space, thereby constructing probability distributions that make low-dimensional space similar to high-dimensional space. In addition, to address the impact of multi-attribute features of samples on similarity measure, a multi-attribute distance matrix was constructed using information entropy, and then combined with the numerical matrix of spectral data to obtain a mixed data matrix. In order to validate the effectiveness of the St-SNE algorithm, dimensionality reduction projection was performed on NIR spectral data and compared with PCA, LPP, and t-SNE algorithms. The results demonstrated that the St-SNE algorithm effectively distinguishes samples with different attribute information, and produced more distinct projection boundaries of sample category in low-dimensional space. Then we tested the classification performance of St-SNE for different attributes by using the tobacco and mango datasets, and compared it with LPP, t-SNE, UMAP, and Fisher t-SNE algorithms. The results showed that St-SNE algorithm had the highest classification accuracy for different attributes. Finally, we compared the results of searching the most similar sample with the target tobacco for cigarette formulas, and experiments showed that the St-SNE had the highest consistency with the recommendation of the experts than that of the other algorithms. It can provide strong support for the maintenance and design of the product formula.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:在诸如全基因组关联研究之类的生物系统研究中,对多种表型的联合分析对于揭示各种性状与遗传变异之间的功能相互作用至关重要,但是在联合分析的广泛使用中,数据在维度上的增长已经成为一个非常具有挑战性的问题。为了处理变量的过度,我们考虑切片逆回归(SIR)方法。具体来说,我们提出了一种新颖的基于SIR的关联检验,该检验在检验多个预测因子和多个结局之间的关联方面具有良好的鲁棒性.
    结果:我们在具有各种数量的单核苷酸多态性的低维和高维设置中进行模拟研究,并考虑性状的相关结构。仿真结果表明,该方法优于现有方法。我们还成功地将我们的方法应用于ADNI数据集的遗传关联研究。仿真研究和实际数据分析都表明,与竞争对手相比,基于SIR的关联测试是有效的,并且具有更高的效率。
    结论:本文考虑了具有低维和高维反应和基因型的几种情况。我们基于SIR的方法将估计的I型误差控制在预先指定的水平α。
    BACKGROUND: Joint analysis of multiple phenotypes in studies of biological systems such as Genome-Wide Association Studies is critical to revealing the functional interactions between various traits and genetic variants, but growth of data in dimensionality has become a very challenging problem in the widespread use of joint analysis. To handle the excessiveness of variables, we consider the sliced inverse regression (SIR) method. Specifically, we propose a novel SIR-based association test that is robust and powerful in testing the association between multiple predictors and multiple outcomes.
    RESULTS: We conduct simulation studies in both low- and high-dimensional settings with various numbers of Single-Nucleotide Polymorphisms and consider the correlation structure of traits. Simulation results show that the proposed method outperforms the existing methods. We also successfully apply our method to the genetic association study of ADNI dataset. Both the simulation studies and real data analysis show that the SIR-based association test is valid and achieves a higher efficiency compared with its competitors.
    CONCLUSIONS: Several scenarios with low- and high-dimensional responses and genotypes are considered in this paper. Our SIR-based method controls the estimated type I error at the pre-specified level α .
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    提出了一种新的方法来快速预测具有树脂缺失缺陷的碳/环氧复合材料的拉伸强度。使用降维方法和Chebyshev多项式开发了单变量Chebyshev预测模型(UCPM)。为了提高计算效率,减少人工建模工作量,在模型构建过程中使用Python建立了有限元模型的参数化脚本。要验证模型,使用真空辅助树脂灌注(VARI)工艺制备具有不同缺陷尺寸的试样,测试了试样的力学性能,并将模型预测结果与实验结果进行了比较分析。此外,检查了顺序(第9次)对UCPM预测准确性的影响,并使用统计误差评估模型的性能。结果表明,该预测模型具有较高的预测精度,与实验结果相比,最大预测误差为5.20%。较低的订单导致不合适,而增加阶数可以提高UCPM的预测精度。然而,如果订单太高,可能会出现过拟合,导致预测精度下降。
    A novel method is proposed to quickly predict the tensile strength of carbon/epoxy composites with resin-missing defects. The univariate Chebyshev prediction model (UCPM) was developed using the dimension reduction method and Chebyshev polynomials. To enhance the computational efficiency and reduce the manual modeling workload, a parameterization script for the finite element model was established using Python during the model construction process. To validate the model, specimens with different defect sizes were prepared using the vacuum assistant resin infusion (VARI) process, the mechanical properties of the specimens were tested, and the model predictions were analyzed in comparison with the experimental results. Additionally, the impact of the order (second-ninth) on the predictive accuracy of the UCPM was examined, and the performance of the model was evaluated using statistical errors. The results demonstrate that the prediction model has a high prediction accuracy, with a maximum prediction error of 5.20% compared to the experimental results. A low order resulted in underfitting, while increasing the order can improve the prediction accuracy of the UCPM. However, if the order is too high, overfitting may occur, leading to a decrease in the prediction accuracy.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    作为物种共存的生态策略,一些物种适应广泛的栖息地,而其他人则专注于特定的环境。这样的“通才”和“专家”通过物种之间复杂的相互作用网络实现正常的生态平衡。然而,在一般的理论框架内,尚未阐明这些相互作用在维持通才和专业物种共存中的作用。这里,我们基于网络降维方法分析了一类共生竞争相互作用生态系统中专家和通才物种共存的生态机制。我们发现,生态专家和通才可以根据他们各自相互作用的数量来识别。我们还发现,使用真实世界的经验网络模拟,生态通才的移除会导致当地生态系统的崩溃,很少观察到生态专家的流失。
    As an ecological strategy for species coexistence, some species adapt to a wide range of habitats, while others specialize in particular environments. Such \'generalists\' and \'specialists\' achieve normal ecological balance through a complex network of interactions between species. However, the role of these interactions in maintaining the coexistence of generalist and specialist species has not been elucidated within a general theoretical framework. Here, we analyze the ecological mechanism for the coexistence of specialist and generalist species in a class of mutualistic and competitive interaction ecosystems based on the network dimension reduction method. We find that ecological specialists and generalists can be identified based on the number of their respective interactions. We also find, using real-world empirical network simulations, that the removal of ecological generalists can lead to the collapse of local ecosystems, which is rarely observed with the loss of ecological specialists.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:酶在维持生物体的生命中起着不可替代的重要作用。酶的酶委员会(EC)编号表明其基本功能。在过去的二十年中,正确识别给定酶的EC数的第一位(家族类别)是热门话题。先前的几种方法采用功能结构域组成来代表酶。然而,它会导致维度灾难,从而降低了方法的效率。另一方面,大多数以前的方法只能处理属于一个家族的酶。事实上,几种酶属于两个或两个以上的家族类别。
    结果:在这项研究中,快速高效的多标签分类器,名为PredictEFC,是设计的。要构造这个分类器,设计了一种新的特征提取方案,用于处理酶的功能域信息,计算训练数据集中七个家族类别中每个功能域条目的分布。基于这个方案,通过融合其功能域信息和以上统计结果,将每种训练或测试酶编码到7维载体中.采用随机k-标签集(RAKEL)来构建分类器,其中随机森林被选择为基本分类算法。在训练数据集上的两个十倍交叉验证结果表明,PredictEFC的准确性可以达到0.8493和0.8370。在两个数据集上的独立测试表明0.9118和0.8777的准确度值。
    结论:PredictEFC的性能略低于直接使用功能结构域组成的分类器。然而,它的效率大大提高。运行时间小于直接使用功能域组成的分类器的时间的十分之一。Inadditional,PredictEFC的效用优于使用传统降维方法和一些以前的方法的分类器,该分类器可以移植用于预测其他物种的酶家族类别。最后,在http://124.221.158.221/上提供的Web服务器设置为易于使用。
    BACKGROUND: Enzymes play an irreplaceable and important role in maintaining the lives of living organisms. The Enzyme Commission (EC) number of an enzyme indicates its essential functions. Correct identification of the first digit (family class) of the EC number for a given enzyme is a hot topic in the past twenty years. Several previous methods adopted functional domain composition to represent enzymes. However, it would lead to dimension disaster, thereby reducing the efficiency of the methods. On the other hand, most previous methods can only deal with enzymes belonging to one family class. In fact, several enzymes belong to two or more family classes.
    RESULTS: In this study, a fast and efficient multi-label classifier, named PredictEFC, was designed. To construct this classifier, a novel feature extraction scheme was designed for processing functional domain information of enzymes, which counting the distribution of each functional domain entry across seven family classes in the training dataset. Based on this scheme, each training or test enzyme was encoded into a 7-dimenion vector by fusing its functional domain information and above statistical results. Random k-labelsets (RAKEL) was adopted to build the classifier, where random forest was selected as the base classification algorithm. The two tenfold cross-validation results on the training dataset shown that the accuracy of PredictEFC can reach 0.8493 and 0.8370. The independent test on two datasets indicated the accuracy values of 0.9118 and 0.8777.
    CONCLUSIONS: The performance of PredictEFC was slightly lower than the classifier directly using functional domain composition. However, its efficiency was sharply improved. The running time was less than one-tenth of the time of the classifier directly using functional domain composition. In additional, the utility of PredictEFC was superior to the classifiers using traditional dimensionality reduction methods and some previous methods, and this classifier can be transplanted for predicting enzyme family classes of other species. Finally, a web-server available at http://124.221.158.221/ was set up for easy usage.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    高通量技术使高维设置变得越来越普遍,为高维调解方法的发展提供了机会。我们旨在通过总结和讨论高维中介分析的最新进展,为使用高维中介分析的研究人员提供有用的指导,并为生物统计学家开发高维中介分析提供思路。当将单个和多个中介分析扩展到高维设置时,该方法仍然面临许多挑战。高维调解方法的发展试图解决这些问题,比如筛选真正的调解员,通过变量选择估计中介效应,降低中介维度来解决变量之间的相关性,并利用复合零假设测试来测试它们。虽然高维调解的这些问题在一定程度上得到了解决,一些挑战依然存在。首先,当选择变量进行中介时,很少考虑中介之间的相关性。第二,在不纳入现有生物学知识的情况下缩小尺度使得结果难以解释。此外,对于高维中介分析中的严格序贯可忽略性假设,仍然缺乏敏感性分析方法。分析师在使用每种方法时需要考虑它们的适用性,而生物统计学家可以考虑方法的扩展和改进。
    High-throughput technologies have made high-dimensional settings increasingly common, providing opportunities for the development of high-dimensional mediation methods. We aimed to provide useful guidance for researchers using high-dimensional mediation analysis and ideas for biostatisticians to develop it by summarizing and discussing recent advances in high-dimensional mediation analysis. The method still faces many challenges when extended single and multiple mediation analyses to high-dimensional settings. The development of high-dimensional mediation methods attempts to address these issues, such as screening true mediators, estimating mediation effects by variable selection, reducing the mediation dimension to resolve correlations between variables, and utilizing composite null hypothesis testing to test them. Although these problems regarding high-dimensional mediation have been solved to some extent, some challenges remain. First, the correlation between mediators are rarely considered when the variables are selected for mediation. Second, downscaling without incorporating prior biological knowledge makes the results difficult to interpret. In addition, a method of sensitivity analysis for the strict sequential ignorability assumption in high-dimensional mediation analysis is still lacking. An analyst needs to consider the applicability of each method when utilizing them, while a biostatistician could consider extensions and improvements in the methodology.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:空间转录组学技术充分利用了空间位置信息,组织形态特征,和转录谱。整合这些数据可以极大地促进我们对形态学背景下细胞生物学的理解。
    方法:我们通过结合图神经网络开发了一种创新的空间聚类方法,称为STGNNks,去噪自动编码器,和k-sum聚类。首先,对空间分辨转录组学数据进行预处理,构造混合邻接矩阵。接下来,通过基于深图信息的图卷积网络,基因表达式和空间上下文被集成以学习点嵌入特征。第三,通过基于零膨胀负二项式(ZINB)的去噪自动编码器,将学习的特征映射到低维空间。第四,结合k均值聚类和比率割聚类算法,开发了一种k和聚类算法来识别空间域。最后,它实现了空间轨迹推断,空间可变基因识别,和差异表达基因检测基于伪时空方法在六个10x基因组铯数据集。
    结果:我们将我们提出的STGNNks方法与其他五种空间聚类方法进行了比较,CCST,Seurat,stLearn,Scanpy和SEDR。第一次,机器学习领域的四个内部指标,也就是说,轮廓系数,戴维斯-博尔丁指数,Caliniski-Harabasz指数,和S_Dbw索引,用于使用CCST测量STGNNks的聚类性能,Seurat,stLearn,Scanpy和SEDR在五个没有标签的空间转录组学数据集上(即,成年小鼠大脑(FFPE),成年小鼠肾脏(FFPE),人类乳腺癌(A组2),人类乳腺癌(FFPE),和人类淋巴结)。并应用两个外部指标,包括调整后的Rand指数(ARI)和归一化互信息(NMI),以真实标签评估上述六种方法对人类乳腺癌的性能(第1部分)。比较实验表明,STGNNks获得了最小的Davies-Bouldin和S_Dbw值和最大的剪影系数,Caliniski-Harabasz,ARI和NMI,显著优于上述五种空间转录组学分析算法。此外,我们在上述5个未标记的数据集上检测到每个簇的前6个空间可变基因和前5个差异表达基因。具有分层布局的伪时空树图显示了人类乳腺癌(A部分1)在从三个浸润性导管癌区域分支到多个导管癌原位癌亚组的三个分支中的进展。
    结论:我们预计STGNNks可以有效改善空间转录组学数据分析,并进一步促进相关疾病的诊断和治疗。这些代码可在https://github.com/plhhnu/STGNNks上公开获得。
    BACKGROUND: Spatial transcriptomics technologies fully utilize spatial location information, tissue morphological features, and transcriptional profiles. Integrating these data can greatly advance our understanding about cell biology in the morphological background.
    METHODS: We developed an innovative spatial clustering method called STGNNks by combining graph neural network, denoising auto-encoder, and k-sums clustering. First, spatial resolved transcriptomics data are preprocessed and a hybrid adjacency matrix is constructed. Next, gene expressions and spatial context are integrated to learn spots\' embedding features by a deep graph infomax-based graph convolutional network. Third, the learned features are mapped to a low-dimensional space through a zero-inflated negative binomial (ZINB)-based denoising auto-encoder. Fourth, a k-sums clustering algorithm is developed to identify spatial domains by combining k-means clustering and the ratio-cut clustering algorithms. Finally, it implements spatial trajectory inference, spatially variable gene identification, and differentially expressed gene detection based on the pseudo-space-time method on six 10x Genomics Visium datasets.
    RESULTS: We compared our proposed STGNNks method with five other spatial clustering methods, CCST, Seurat, stLearn, Scanpy and SEDR. For the first time, four internal indicators in the area of machine learning, that is, silhouette coefficient, the Davies-Bouldin index, the Caliniski-Harabasz index, and the S_Dbw index, were used to measure the clustering performance of STGNNks with CCST, Seurat, stLearn, Scanpy and SEDR on five spatial transcriptomics datasets without labels (i.e., Adult Mouse Brain (FFPE), Adult Mouse Kidney (FFPE), Human Breast Cancer (Block A Section 2), Human Breast Cancer (FFPE), and Human Lymph Node). And two external indicators including adjusted Rand index (ARI) and normalized mutual information (NMI) were applied to evaluate the performance of the above six methods on Human Breast Cancer (Block A Section 1) with real labels. The comparison experiments elucidated that STGNNks obtained the smallest Davies-Bouldin and S_Dbw values and the largest Silhouette Coefficient, Caliniski-Harabasz, ARI and NMI, significantly outperforming the above five spatial transcriptomics analysis algorithms. Furthermore, we detected the top six spatially variable genes and the top five differentially expressed genes in each cluster on the above five unlabeled datasets. And the pseudo-space-time tree plot with hierarchical layout demonstrated a flow of Human Breast Cancer (Block A Section 1) progress in three clades branching from three invasive ductal carcinoma regions to multiple ductal carcinoma in situ sub-clusters.
    CONCLUSIONS: We anticipate that STGNNks can efficiently improve spatial transcriptomics data analysis and further boost the diagnosis and therapy of related diseases. The codes are publicly available at https://github.com/plhhnu/STGNNks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    二维金属有机骨架(MOFs)的演化和形成过程主要来自晶体的各向异性生长,导致光催化性能的变化。实现各向异性电子转移方向和降维策略的协同组合至关重要。在这项研究中,提出了一种通过溶剂分子的配位有效阻止晶体生长增加的新方法,成功合成了无杂质的二维纳米片Zn-PTC,具有出色的析氢反应(HER)性能(15.4mmolg-1h-1)。结构和光物理表征验证了成功防止晶体积聚,同时通过瞬态光谱建立结构各向异性和固有电荷转移模式之间的相关性。这些发现明确地表明,电子转移沿[001]方向起着关键作用的氧化还原性能的纳米Zn-PTC。随后,通过耦合光催化性能和密度泛函理论(DFT)模拟计算,探索了载流子扩散动力学,揭示了沿配体到金属电荷转移(LMCT)方向的有效降维是实现卓越光催化性能的关键。
    The evolution and formation process of two-dimensional metal-organic frameworks (MOFs) primarily arise from the anisotropic growth of crystals, leading to variations in photocatalytic performance. It is crucial to achieve a synergistic combination of anisotropic electron transfer direction and dimension reduction strategies. In this study, a novel approach that effectively blocks crystal growth accretion through the coordination of solvent molecules is presented, achieving the successful synthesis of impurity-free two-dimensional nanosheet Zn-PTC with exceptional hydrogen evolution reaction (HER) performance (15.4 mmol g-1  h-1 ). The structural and photophysical characterizations validate the successful prevention of crystal accretion, while establishing correlation between structural anisotropy and intrinsic charge transfer mode through transient spectroscopy. These findings unequivocally demonstrate that electron transfer along the [001] direction plays a pivotal role in the redox performance of nano-Zn-PTC. Subsequently, by coupling the photocatalytic performance and density functional theory (DFT) simulation calculations, the carrier diffusion kinetics is explored, revealing that effective dimension reduction along the ligand-to-metal charge transfer (LMCT) direction is the key to achieving superior photocatalytic performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    这项研究提出了一种简单的方法,用于通过因子加载来选择输入变量,并将这些变量输入到小波神经网络(WNN)模型中以预测垂直地面反作用力(vGRF)。使用运动捕获系统和仪表式跑步机收集了9名后足前锋在12、14和16km/h的运动学数据和vGRF。通过因子加载筛选输入变量,并利用WNN预测vGRF。选择了9个运动学变量,对应于九个主成分,主要集中在膝关节和踝关节。vGRF在不同速度下的预测结果是有效和准确的,即,多重相关系数(CMC)>0.98(0.984-0.988),归一化均方根误差(NRMSE)<15%(9.34-11.51%)。冲击力的NRMSE(8.18-10.01%),主动力(4.92-7.42%),峰值时间(7.16-12.52%)小于15%。有一个小数字(峰值,4.12-6.18%;时间,4.71-6.76%)超过使用Bland-Altman方法的95%置信区间(CI)。膝关节是估计vGRF的最佳位置,接着是脚踝。在12、14和16km/h的峰值和峰值时间预测vGRF具有很高的准确性和一致性。因此,因子加载可能是筛选人工神经网络中运动学变量的有效方法。
    This study proposed a simple method for selecting input variables by factor loading and inputting these variables into a wavelet neural network (WNN) model to predict vertical ground reaction force (vGRF). The kinematic data and vGRF of 9 rearfoot strikers at 12, 14, and 16 km/h were collected using a motion capture system and an instrumented treadmill. The input variables were screened by factor loading and utilized to predict vGRF with the WNN. Nine kinematic variables were selected, corresponding to nine principal components, mainly focusing on the knee and ankle joints. The prediction results of vGRF were effective and accurate at different speeds, namely, the coefficient of multiple correlation (CMC) > 0.98 (0.984-0.988), the normalized root means square error (NRMSE) < 15% (9.34-11.51%). The NRMSEs of impact force (8.18-10.01%), active force (4.92-7.42%), and peak time (7.16-12.52%) were less than 15%. There was a small number (peak, 4.12-6.18%; time, 4.71-6.76%) exceeding the 95% confidence interval (CI) using the Bland-Altman method. The knee joint was the optimal location for estimating vGRF, followed by the ankle. There were high accuracy and agreement for predicting vGRF with the peak and peak time at 12, 14, and 16 km/h. Therefore, factor loading could be a valid method to screen kinematic variables in artificial neural networks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    新疆长绒棉以其卓越的品质而闻名。然而,在机械采摘过程中容易受到塑料薄膜的污染。为了解决籽棉薄膜去除棘手的问题,提出了一种基于高光谱图像和AlexNet-PCA的技术来识别籽棉的无色透明薄膜。该方法包括对高光谱图像进行黑白校正,高光谱数据的降维,以及卷积神经网络(CNN)模型的训练和测试。关键技术是找到降低高光谱数据维数的最佳途径,从而降低了计算成本。论文最大的创新是结合CNN和降维方法实现了透明塑料薄膜的高精度智能识别。使用三种降维方法和三种CNN架构进行了实验,以寻求用于塑料薄膜识别的最佳模型。结果表明,AlexNet-PCA-12在降维方面实现了最高的识别精度和性价比。在实际应用中排序测试,本文提出的方法对塑料薄膜的去除率达到97.02%,为籽棉杂聚物的高精度鉴别提供了现代理论模型和有效方法。
    Long-staple cotton from Xinjiang is renowned for its exceptional quality. However, it is susceptible to contamination with plastic film during mechanical picking. To address the issue of tricky removal of film in seed cotton, a technique based on hyperspectral images and AlexNet-PCA is proposed to identify the colorless and transparent film of the seed cotton. The method consists of black and white correction of hyperspectral images, dimensionality reduction of hyperspectral data, and training and testing of convolutional neural network (CNN) models. The key technique is to find the optimal way to reduce the dimensionality of the hyperspectral data, thus reducing the computational cost. The biggest innovation of the paper is the combination of CNNs and dimensionality reduction methods to achieve high-precision intelligent recognition of transparent plastic films. Experiments with three dimensionality reduction methods and three CNN architectures are conducted to seek the optimal model for plastic film recognition. The results demonstrate that AlexNet-PCA-12 achieves the highest recognition accuracy and cost performance in dimensionality reduction. In the practical application sorting tests, the method proposed in this paper achieved a 97.02% removal rate of plastic film, which provides a modern theoretical model and effective method for high-precision identification of heteropolymers in seed cotton.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号