fine-tuning

微调
  • 文章类型: Journal Article
    背景:解码人类基因组序列需要对DNA序列功能性进行全面分析。通过计算和实验方法,研究人员已经研究了基因型与表型的关系,并生成了有助于解开复杂遗传蓝图的重要数据集。因此,最近开发的人工智能方法可以用来解释这些DNA序列的功能。
    方法:本研究探讨了深度学习的使用,特别是预训练的基因组模型,如DNA_bert_6和human_gpt2-v1,在解释和表示人类基因组序列。最初,我们精心构建了多个连接基因型和表型的数据集,以微调这些模型,从而实现精确的DNA序列分类.此外,我们评估了序列长度对分类结果的影响,并使用HERV数据集分析了模型隐藏层中特征提取的影响.为了增强我们对模型识别的表型特异性模式的理解,我们进行浓缩,具有高平均局部代表权重(ALRW)评分的人内源性逆转录病毒(HERV)序列中特定基序的致病性和保守性分析。
    结果:我们构建了多个基因型-表型数据集,与随机基因组序列相比,这些数据集显示出值得称道的分类性能,特别是在HERV数据集中,实现了二进制和多分类精度,F1值分别超过0.935和0.888。值得注意的是,HERV数据集的微调不仅提高了我们识别和区分DNA序列中不同信息类型的能力,而且还成功地在ALRW评分较高的区域中识别出与神经系统疾病和癌症相关的特定基序.随后对这些基序的分析揭示了物种对环境压力的适应性反应及其与病原体的共同进化。
    结论:这些发现突出了预先训练的基因组模型在学习DNA序列表征方面的潜力。特别是在利用HERV数据集时,并为未来的研究工作提供有价值的见解。这项研究代表了一种创新的策略,将预先训练的基因组模型表示与分析基因组序列功能的经典方法相结合。从而促进基因组学和人工智能之间的交叉受精。
    BACKGROUND: Decoding human genomic sequences requires comprehensive analysis of DNA sequence functionality. Through computational and experimental approaches, researchers have studied the genotype-phenotype relationship and generate important datasets that help unravel complicated genetic blueprints. Thus, the recently developed artificial intelligence methods can be used to interpret the functions of those DNA sequences.
    METHODS: This study explores the use of deep learning, particularly pre-trained genomic models like DNA_bert_6 and human_gpt2-v1, in interpreting and representing human genome sequences. Initially, we meticulously constructed multiple datasets linking genotypes and phenotypes to fine-tune those models for precise DNA sequence classification. Additionally, we evaluate the influence of sequence length on classification results and analyze the impact of feature extraction in the hidden layers of our model using the HERV dataset. To enhance our understanding of phenotype-specific patterns recognized by the model, we perform enrichment, pathogenicity and conservation analyzes of specific motifs in the human endogenous retrovirus (HERV) sequence with high average local representation weight (ALRW) scores.
    RESULTS: We have constructed multiple genotype-phenotype datasets displaying commendable classification performance in comparison with random genomic sequences, particularly in the HERV dataset, which achieved binary and multi-classification accuracies and F1 values exceeding 0.935 and 0.888, respectively. Notably, the fine-tuning of the HERV dataset not only improved our ability to identify and distinguish diverse information types within DNA sequences but also successfully identified specific motifs associated with neurological disorders and cancers in regions with high ALRW scores. Subsequent analysis of these motifs shed light on the adaptive responses of species to environmental pressures and their co-evolution with pathogens.
    CONCLUSIONS: These findings highlight the potential of pre-trained genomic models in learning DNA sequence representations, particularly when utilizing the HERV dataset, and provide valuable insights for future research endeavors. This study represents an innovative strategy that combines pre-trained genomic model representations with classical methods for analyzing the functionality of genome sequences, thereby promoting cross-fertilization between genomics and artificial intelligence.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    微调是迁移学习中的一项重要技术,在缺乏训练数据的任务中取得了显著的成功。然而,由于当源域和目标域之间的数据分布差异较大时,难以提取单源域微调的有效特征,我们提出了一种基于多源域的迁移学习框架,称为自适应多源域协作微调(AMCF)。AMCF利用多个源域模型进行协作微调,从而提高模型在目标任务中的特征提取能力。具体来说,AMCF采用自适应多源域层选择策略,为多个源域模型中的目标任务定制合适的层微调方案,旨在提取更有效的特征。此外,设计了一种新的多源域协同损失函数,便于各源域模型精确提取目标数据特征。同时,它致力于最小化各种源域模型之间的输出差异,增强了源域模型对目标数据的适应性。为了验证AMCF的有效性,它适用于迁移学习中常用的七个公共视觉分类数据集,并与最广泛使用的单源域微调方法进行了比较。实验结果表明,与现有的微调方法相比,我们的方法不仅提高了模型中特征提取的准确性,而且为目标任务提供了精确的层微调方案,从而显著提高微调性能。
    Fine-tuning is an important technique in transfer learning that has achieved significant success in tasks that lack training data. However, as it is difficult to extract effective features for single-source domain fine-tuning when the data distribution difference between the source and the target domain is large, we propose a transfer learning framework based on multi-source domain called adaptive multi-source domain collaborative fine-tuning (AMCF) to address this issue. AMCF utilizes multiple source domain models for collaborative fine-tuning, thereby improving the feature extraction capability of model in the target task. Specifically, AMCF employs an adaptive multi-source domain layer selection strategy to customize appropriate layer fine-tuning schemes for the target task among multiple source domain models, aiming to extract more efficient features. Furthermore, a novel multi-source domain collaborative loss function is designed to facilitate the precise extraction of target data features by each source domain model. Simultaneously, it works towards minimizing the output difference among various source domain models, thereby enhancing the adaptability of the source domain model to the target data. In order to validate the effectiveness of AMCF, it is applied to seven public visual classification datasets commonly used in transfer learning, and compared with the most widely used single-source domain fine-tuning methods. Experimental results demonstrate that, in comparison with the existing fine-tuning methods, our method not only enhances the accuracy of feature extraction in the model but also provides precise layer fine-tuning schemes for the target task, thereby significantly improving the fine-tuning performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    加密流量的广泛使用对网络管理和网络安全提出了挑战。传统的基于机器学习的加密流量分类方法不再满足管理和安全的需求。深度学习技术在加密流量分类中的应用显著提高了模型的准确性。本研究主要关注网络分析和网络安全领域的加密流量分类。为解决现有基于深度学习的加密流量分类方法在计算内存消耗和可解释性方面的不足,我们介绍了一种参数有效的微调方法,用于有效地调整加密流量分类模型的参数。对各种分类场景进行了实验,包括Tor流量服务分类和恶意流量分类,使用多个公共数据集。与最先进的深度学习模型架构进行了公平的比较。结果表明,所提出的方法显着降低了微调参数的规模和计算资源的使用,同时实现了与现有最佳模型相当的性能。此外,通过分析预训练模型的参数和结构,解释了预训练模型中加密流量表示的学习机制。这一比较验证了该模型表现出层次结构的假设,组织清晰,和独特的特征。
    The widespread use of encrypted traffic poses challenges to network management and network security. Traditional machine learning-based methods for encrypted traffic classification no longer meet the demands of management and security. The application of deep learning technology in encrypted traffic classification significantly improves the accuracy of models. This study focuses primarily on encrypted traffic classification in the fields of network analysis and network security. To address the shortcomings of existing deep learning-based encrypted traffic classification methods in terms of computational memory consumption and interpretability, we introduce a Parameter-Efficient Fine-Tuning method for efficiently tuning the parameters of an encrypted traffic classification model. Experimentation is conducted on various classification scenarios, including Tor traffic service classification and malicious traffic classification, using multiple public datasets. Fair comparisons are made with state-of-the-art deep learning model architectures. The results indicate that the proposed method significantly reduces the scale of fine-tuning parameters and computational resource usage while achieving performance comparable to that of the existing best models. Furthermore, we interpret the learning mechanism of encrypted traffic representation in the pre-training model by analyzing the parameters and structure of the model. This comparison validates the hypothesis that the model exhibits hierarchical structure, clear organization, and distinct features.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    生成大语言模型(LLM)在各种自然语言处理任务中取得了显著的成功。包括问答(QA)和对话系统。然而,大多数模型都是在英语数据上训练的,在提供中文答案方面缺乏很强的泛化能力。这种局限性在中医QA等专业领域尤为明显,由于缺乏微调和高质量数据集,性能受到影响。为了解决这个问题,我们介绍MedChatZH,基于LLaMA架构的变压器解码器优化的中医QA对话模型。继续对精选的中文医学书籍语料库进行预培训,然后使用精心挑选的医学指导数据集进行微调,在现实世界的医学对话数据集上,MedChatZH的表现优于几个中文对话基线。我们的模型,代码,和数据集在GitHub(https://github.com/tyang816/MedChatZH)上公开提供,以鼓励对中药和LLM的进一步研究。
    Generative Large Language Models (LLMs) have achieved significant success in various natural language processing tasks, including Question-Answering (QA) and dialogue systems. However, most models are trained on English data and lack strong generalization in providing answers in Chinese. This limitation is especially evident in specialized domains like traditional Chinese medical QA, where performance suffers due to the absence of fine-tuning and high-quality datasets. To address this, we introduce MedChatZH, a dialogue model optimized for Chinese medical QA based on transformer decoder with LLaMA architecture. Continued pre-training on a curated corpus of Chinese medical books is followed by fine-tuning with a carefully selected medical instruction dataset, resulting in MedChatZH outperforming several Chinese dialogue baselines on a real-world medical dialogue dataset. Our model, code, and dataset are publicly available on GitHub (https://github.com/tyang816/MedChatZH) to encourage further research in traditional Chinese medicine and LLMs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    昆虫对环境的适应依赖于一系列由分子和生理过程控制的复杂行为。在过去的几十年里,越来越多的研究揭示了非编码RNA(ncRNA)在调节昆虫行为中的作用。ncRNAs通过快速响应环境刺激在昆虫的行为可塑性中承担特别关键的作用。ncRNA还通过微调靶基因的表达有助于维持昆虫的体内平衡。然而,尚未对ncRNAs在调节昆虫行为中的作用进行全面审查。这里,我们介绍了我们对ncRNAs如何调节各种昆虫行为的理解的最新进展,包括飞行和运动,社会行为,繁殖,学习和记忆,和喂养。我们完善了ncRNAs调节神经功能的复杂机制,电机,生殖,和其他生理系统,以及果蝇等昆虫的基因表达,社会性昆虫,蝗虫,还有蚊子.此外,我们讨论了ncRNA介导的昆虫行为未来研究的潜在途径。
    The adaptation of insects to environments relies on a sophisticated set of behaviors controlled by molecular and physiological processes. Over the past several decades, accumulating studies have unveiled the roles of non-coding RNAs (ncRNAs) in regulating insect behaviors. ncRNAs assume particularly pivotal roles in the behavioral plasticity of insects by rapidly responding to environmental stimuli. ncRNAs also contribute to the maintenance of homeostasis of insects by fine-tuning the expression of target genes. However, a comprehensive review of ncRNAs\' roles in regulating insect behaviors has yet to be conducted. Here, we present the recent progress in our understanding of how ncRNAs regulate various insect behaviors, including flight and movement, social behavior, reproduction, learning and memory, and feeding. We refine the intricate mechanisms by which ncRNAs modulate the function of neural, motor, reproductive, and other physiological systems, as well as gene expression in insects like fruit flies, social insects, locusts, and mosquitos. Furthermore, we discuss potential avenues for future studies in ncRNA-mediated insect behaviors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    介绍:目前,肝癌的发病率每年都在上升。肝肿瘤的精确识别对于临床医生制定治疗策略和对抗肝癌至关重要。到目前为止,肝肿瘤轮廓是通过劳动密集型和主观的手动标记得出的。计算机在肝肿瘤分割领域得到了广泛的应用。尽管如此,肝肿瘤分割仍然是一个巨大的挑战,由于不同的体积范围,形状,和遇到的图像强度。方法:在本文中,我们介绍了一种创新的解决方案,称为注意力连接网络(AC-Net)设计用于自动肝肿瘤分割。建立在U型网络架构上,我们的方法包含2个关键注意模块:轴向注意模块(AAM)和视觉转换模块(VTM),它取代了传统的跳过连接以无缝集成空间特征。AAM通过计算跨特征图的轴向注意力来促进特征融合,虽然VTM在最低分辨率的特征地图上运行,采用多头自我注意,并将输出重塑为特征图,以进行后续连接。此外,我们采用适合我们方法的专门损失函数。我们的方法从使用LiTS2017数据集进行预训练AC-Net开始,随后使用来自湖北省肿瘤医院的计算机断层扫描(CT)和磁共振成像(MRI)数据对其进行微调。结果:AC-Net在CT数据上的性能指标如下:骰子相似系数(DSC)为0.90,Jaccard系数(JC)为0.82,召回率为0.92,平均对称表面距离(ASSD)为4.59,Hausdorff距离(HD)为11.96,精度为0.89。对于MRI数据上的AC-Net,指标为DSC为0.80,JC为0.70,召回率为0.82,ASSD为7.58,HD为30.26,精度为0.84。结论:比较实验表明,在湖北省肿瘤医院数据集上进行测试时,AC-Net表现出出色的肿瘤识别准确性。在实际临床应用中表现出高度的竞争力。此外,消融实验为本文提出的每个模块的功效提供了确凿的证据。对于那些感兴趣的人,可以在以下GitHub存储库中访问此研究文章的代码:https://github.com/killian-zero/py_tumor-segmentation。git.
    Introduction: Currently, the incidence of liver cancer is on the rise annually. Precise identification of liver tumors is crucial for clinicians to strategize the treatment and combat liver cancer. Thus far, liver tumor contours have been derived through labor-intensive and subjective manual labeling. Computers have gained widespread application in the realm of liver tumor segmentation. Nonetheless, liver tumor segmentation remains a formidable challenge owing to the diverse range of volumes, shapes, and image intensities encountered. Methods: In this article, we introduce an innovative solution called the attention connect network (AC-Net) designed for automated liver tumor segmentation. Building upon the U-shaped network architecture, our approach incorporates 2 critical attention modules: the axial attention module (AAM) and the vision transformer module (VTM), which replace conventional skip-connections to seamlessly integrate spatial features. The AAM facilitates feature fusion by computing axial attention across feature maps, while the VTM operates on the lowest resolution feature maps, employing multihead self-attention, and reshaping the output into a feature map for subsequent concatenation. Furthermore, we employ a specialized loss function tailored to our approach. Our methodology begins with pretraining AC-Net using the LiTS2017 dataset and subsequently fine-tunes it using computed tomography (CT) and magnetic resonance imaging (MRI) data sourced from Hubei Cancer Hospital. Results: The performance metrics for AC-Net on CT data are as follows: dice similarity coefficient (DSC) of 0.90, Jaccard coefficient (JC) of 0.82, recall of 0.92, average symmetric surface distance (ASSD) of 4.59, Hausdorff distance (HD) of 11.96, and precision of 0.89. For AC-Net on MRI data, the metrics are DSC of 0.80, JC of 0.70, recall of 0.82, ASSD of 7.58, HD of 30.26, and precision of 0.84. Conclusion: The comparative experiments highlight that AC-Net exhibits exceptional tumor recognition accuracy when tested on the Hubei Cancer Hospital dataset, demonstrating highly competitive performance for practical clinical applications. Furthermore, the ablation experiments provide conclusive evidence of the efficacy of each module proposed in this article. For those interested, the code for this research article can be accessed at the following GitHub repository: https://github.com/killian-zero/py_tumor-segmentation.git.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:各种AI软件之间存在明显的自动分割不一致。在这项研究中,我们开发了一种新颖的卷积神经网络(CNN)微调工作流程,以实现精确和鲁棒的局部分割。
    方法:数据集包括湖北省肿瘤医院数据集,西妥昔单抗头颈部公共数据集,和魁北克公共数据集。七个危险器官(OAR),包括脑干,左腮腺,食道,左视神经,视神经交叉,下颌骨,和咽部收缩,被选中。首先将来自四个商业AI软件的自动分割结果与手动描绘进行比较。然后在湖北省肿瘤医院的40个样本和10个样本上训练并测试了带有注意力模块的新的多尺度轻量级残差CNN模型(命名为HN-Net),分别。为了提高网络的准确性和泛化能力,微调工作流程利用不确定性估计方法,从西妥昔单抗头颈公共数据集中自动选择具有价值的候选样本,以进行进一步培训.在湖北省肿瘤医院数据集和/或整个魁北克公共数据集上评估分割性能。
    结果:通过四个AI软件观察到七个OAR的平均Dice值和Hausdorff距离值的最大差异为0.13和0.7mm。拟议的HN-Net实现了比AI软件高0.14的平均Dice值,它的表现也优于其他流行的CNN模型(HN-Net:0.79,U-Net:0.78,U-Net++:0.78,U-Net-Multi-scale:0.77,AI软件:0.65)。此外,使用本地数据集和外部公共数据集的HN-Net微调工作流进一步改进了自动分割,平均Dice值增加了0.02。
    结论:商业AI软件的描述需要仔细审查,本地化的进一步培训对于临床实践是必要的。通过使用本地数据集和外部公共数据集,可以合理地采用所提出的微调工作流程来实现准确且可靠的自动分割模型。
    OBJECTIVE: Obvious inconsistencies in auto-segmentations exist among various AI software. In this study, we have developed a novel convolutional neural network (CNN) fine-tuning workflow to achieve precise and robust localized segmentation.
    METHODS: The datasets include Hubei Cancer Hospital dataset, Cetuximab Head and Neck Public Dataset, and Québec Public Dataset. Seven organs-at-risks (OARs), including brain stem, left parotid gland, esophagus, left optic nerve, optic chiasm, mandible, and pharyngeal constrictor, were selected. The auto-segmentation results from four commercial AI software were first compared with the manual delineations. Then a new multi-scale lightweight residual CNN model with an attention module (named as HN-Net) was trained and tested on 40 samples and 10 samples from Hubei Cancer Hospital, respectively. To enhance the network\'s accuracy and generalization ability, the fine-tuning workflow utilized an uncertainty estimation method for automatic selection of candidate samples of worthiness from Cetuximab Head and Neck Public Dataset for further training. The segmentation performances were evaluated on the Hubei Cancer Hospital dataset and/or the entire Québec Public Dataset.
    RESULTS: A maximum difference of 0.13 and 0.7 mm in average Dice value and Hausdorff distance value for the seven OARs were observed by four AI software. The proposed HN-Net achieved an average Dice value of 0.14 higher than that of the AI software, and it also outperformed other popular CNN models (HN-Net: 0.79, U-Net: 0.78, U-Net++: 0.78, U-Net-Multi-scale: 0.77, AI software: 0.65). Additionally, the HN-Net fine-tuning workflow by using the local datasets and external public datasets further improved the automatic segmentation with the average Dice value by 0.02.
    CONCLUSIONS: The delineations of commercial AI software need to be carefully reviewed, and localized further training is necessary for clinical practice. The proposed fine-tuning workflow could be feasibly adopted to implement an accurate and robust auto-segmentation model by using local datasets and external public datasets.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    DNA5-甲基胞嘧啶(5mC)广泛存在于多细胞真核生物中,在各种发育和生理过程以及各种人类疾病中起着重要作用。因此,准确检测5mC位点至关重要。虽然目前的测序技术可以绘制全基因组5mC位点,这些实验方法既昂贵又耗时。为了实现对5mC位点的快速准确预测,我们提出了一种新的计算方法,BERT-5mC.首先,我们通过使用人类启动子序列作为语言语料库来预训练特定领域的BERT(来自转换器的双向编码器表示)模型。BERT是一种基于Transformer的深度双向语言表示模型。第二,我们基于5mC训练数据集对特定领域的BERT模型进行了微调,以构建模型。交叉验证结果表明,我们的模型实现了0.966的AUROC,高于其他最先进的方法,如iPromoter-5mC,5mC_Pred,和BiLSTM-5mC。此外,我们的模型是在独立测试集上评估的,这表明我们的模型实现了0.966的AUROC,也高于其他最先进的方法。此外,我们分析了BERT产生的注意力权重,以鉴定与5mC修饰密切相关的多个核苷酸分布.为了便于使用我们的模型,我们建立了一个网络服务器,可以自由访问:http://5mc-pred。zhulab.org.cn.
    DNA 5-methylcytosine (5mC) is widely present in multicellular eukaryotes, which plays important roles in various developmental and physiological processes and a wide range of human diseases. Thus, it is essential to accurately detect the 5mC sites. Although current sequencing technologies can map genome-wide 5mC sites, these experimental methods are both costly and time-consuming. To achieve a fast and accurate prediction of 5mC sites, we propose a new computational approach, BERT-5mC. First, we pre-trained a domain-specific BERT (bidirectional encoder representations from transformers) model by using human promoter sequences as language corpus. BERT is a deep two-way language representation model based on Transformer. Second, we fine-tuned the domain-specific BERT model based on the 5mC training dataset to build the model. The cross-validation results show that our model achieves an AUROC of 0.966 which is higher than other state-of-the-art methods such as iPromoter-5mC, 5mC_Pred, and BiLSTM-5mC. Furthermore, our model was evaluated on the independent test set, which shows that our model achieves an AUROC of 0.966 that is also higher than other state-of-the-art methods. Moreover, we analyzed the attention weights generated by BERT to identify a number of nucleotide distributions that are closely associated with 5mC modifications. To facilitate the use of our model, we built a webserver which can be freely accessed at: http://5mc-pred.zhulab.org.cn.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    情感识别是情感计算中的一个关键研究课题,由于其在各个领域的潜在应用。目前,基于深度学习框架的基于脑电图(EEG)信号的情感识别方法已经证明了有效的应用并取得了令人印象深刻的性能。然而,在基于EEG的情感识别中,由于受试者之间的个体差异,跨受试者EEG情绪识别的性能显着下降。为了应对这一挑战,提出了一种混合迁移学习策略,并且具有少量微调网络(DFF-Net)的域自适应设计用于跨主体的EEG情感识别。第一步涉及专门用于EEG情感识别的领域自适应学习模块的设计,被称为Emo-DA模块。在此之后,Emo-DA模块用于在源和目标域上预训练模型。随后,在目标域上进行微调,专门用于跨主题EEG情感识别测试。这种全面的方法有效地利用了域适应和微调的属性,在跨主题脑电情绪识别这一具有挑战性的任务中,模型的准确性得到了显著提高。提出的DFF-Net超越了跨学科EEG情感识别任务中的最新方法,SEED数据集上的平均识别准确率为93.37%,SEED-IV数据集上的平均识别准确率为82.32%。
    Emotion recognition constitutes a pivotal research topic within affective computing, owing to its potential applications across various domains. Currently, emotion recognition methods based on deep learning frameworks utilizing electroencephalogram (EEG) signals have demonstrated effective application and achieved impressive performance. However, in EEG-based emotion recognition, there exists a significant performance drop in cross-subject EEG Emotion recognition due to inter-individual differences among subjects. In order to address this challenge, a hybrid transfer learning strategy is proposed, and the Domain Adaptation with a Few-shot Fine-tuning Network (DFF-Net) is designed for cross-subject EEG emotion recognition. The first step involves the design of a domain adaptive learning module specialized for EEG emotion recognition, known as the Emo-DA module. Following this, the Emo-DA module is utilized to pre-train a model on both the source and target domains. Subsequently, fine-tuning is performed on the target domain specifically for the purpose of cross-subject EEG emotion recognition testing. This comprehensive approach effectively harnesses the attributes of domain adaptation and fine-tuning, resulting in a noteworthy improvement in the accuracy of the model for the challenging task of cross-subject EEG emotion recognition. The proposed DFF-Net surpasses the state-of-the-art methods in the cross-subject EEG emotion recognition task, achieving an average recognition accuracy of 93.37% on the SEED dataset and 82.32% on the SEED-IV dataset.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    活力是影响水稻产量和品质的重要因素之一。快速准确地检测水稻种子活力对水稻生产具有重要意义。在这项研究中,利用近红外高光谱成像技术和迁移学习相结合的方法检测水稻种子活力。研究了4个人工老化水稻种子品种(永优12、永优1540、苏香精100、龙井优1212)。建立不同的卷积神经网络(CNN)模型来检测水稻种子的活力。两种转移策略,微调和MixStyle,用于在不同水稻品种之间传递知识以进行活力检测。实验结果表明,永优12的卷积神经网络模型通过MixStyle迁移知识对永优1540、苏相精100、龙景优1212的活力进行分类,精度达到90.00%,80.33%,和85.00%的验证集,分别,与每个品种的初始建模性能更好或接近。MixStyle统计基于跨源域训练样本的概率混合实例级特征。训练实例时,可以合成新的域,这增加了源域的域多样性,从而提高训练模型的泛化能力。这项研究将有助于快速,准确地检测大型作物种子。
    Vigor is one of the important factors that affects rice yield and quality. Rapid and accurate detection of rice seed vigor is of great importance for rice production. In this study, near-infrared hyperspectral imaging technique and transfer learning were combined to detect rice seed vigor. Four varieties of artificial-aged rice seeds (Yongyou12, Yongyou1540, Suxiangjing100, and Longjingyou1212) were studied. Different convolutional neural network (CNN) models were built to detect the vigor of the rice seeds. Two transfer strategies, fine-tuning and MixStyle, were used to transfer knowledge among different rice varieties for vigor detection. The experimental results showed that the convolutional neural network model of Yongyou12 classified the vigor of Yongyou1540, Suxiangjing100, and Longjingyou1212 through MixStyle transfer knowledge, and the accuracy reached 90.00%, 80.33%, and 85.00% in validation sets, respectively, which was better or close to the initial modeling performances of each variety. MixStyle statistics are based on probabilistic mixed instance-level features of cross-source domain training samples. When training instances, new domains can be synthesized, which increases the domain diversity of the source domain, thereby improving the generalization ability of the trained model. This study would help rapid and accurate detection of a large varieties of crop seeds.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号