language model

语言模型
  • 文章类型: Journal Article
    分子识别特征(MoRFs)是无序蛋白质的特定功能片段,在调节无膜细胞器的相变中起着至关重要的作用,并且经常充当细胞相互作用网络的中心位点。随着无序蛋白质和严重疾病之间的联系不断被发现,识别MoRFs已经变得越来越重要。由于实验验证的MoRF数量有限,现有MoRF预测算法的性能不够好,仍需改进。在这项研究中,我们提出了一个名为MoRF_ESM的模型,它利用深度学习蛋白质表示来预测无序蛋白质中的MoRFs。该方法采用预训练的ESM-2蛋白质语言模型来生成注意力图矩阵形式的残基的嵌入表示。这些表示与自学习的TextCNN模型相结合,用于特征提取和预测。此外,在MoRF_ESM模型的末尾加入了平均步骤,以细化输出并生成最终预测结果。与基准数据集上其他令人印象深刻的方法相比,MoRF_ESM方法展示了最先进的性能,当在TEST1上测试时,实现[公式:参见文本]比其他方法更高的AUC,并且当在TEST2上测试时,实现[公式:参见文本]比其他方法更高的AUC。这些结果表明,ESM-2和TextCNN的组合可以有效地提取与蛋白质结构和功能相关的深层进化特征,同时捕获位于蛋白质序列中的浅层模式特征,并且很好地胜任了MoRFs的预测任务。鉴于ESM-2是一种高度通用的蛋白质语言模型,本研究中提出的方法可以很容易地应用于涉及蛋白质序列分类的其他任务。
    Molecular recognition features (MoRFs) are particular functional segments of disordered proteins, which play crucial roles in regulating the phase transition of membrane-less organelles and frequently serve as central sites in cellular interaction networks. As the association between disordered proteins and severe diseases continues to be discovered, identifying MoRFs has gained growing significance. Due to the limited number of experimentally validated MoRFs, the performance of existing MoRF\'s prediction algorithms is not good enough and still needs to be improved. In this research, we present a model named MoRF_ESM, which utilizes deep-learning protein representations to predict MoRFs in disordered proteins. This approach employs a pretrained ESM-2 protein language model to generate embedding representations of residues in the form of attention map matrices. These representations are combined with a self-learned TextCNN model for feature extraction and prediction. In addition, an averaging step was incorporated at the end of the MoRF_ESM model to refine the output and generate final prediction results. In comparison to other impressive methods on benchmark datasets, the MoRF_ESM approach demonstrates state-of-the-art performance, achieving [Formula: see text] higher AUC than other methods when tested on TEST1 and achieving [Formula: see text] higher AUC than other methods when tested on TEST2. These results imply that the combination of ESM-2 and TextCNN can effectively extract deep evolutionary features related to protein structure and function, along with capturing shallow pattern features located in protein sequences, and is well qualified for the prediction task of MoRFs. Given that ESM-2 is a highly versatile protein language model, the methodology proposed in this study can be readily applied to other tasks involving the classification of protein sequences.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    近几十年来,抗体已经成为对抗疾病不可或缺的疗法,尤其是病毒感染。然而,有限的结构信息和劳动密集型的工程过程阻碍了它们的发展。幸运的是,深度学习方法的重大进步通过利用同源蛋白质的共同进化信息,促进了蛋白质结构和功能的精确预测。尽管取得了这些进展,由于其独特的进化和抗原结合区的高度灵活性,预测抗体的构象仍然具有挑战性.这里,为了应对这一挑战,我们提出了生物启发的抗体语言模型(BALM)。该模型是在一个庞大的数据集上训练的,该数据集包含3.36亿个40%的非冗余未标记抗体序列,捕获抗体特有的独特和保守特性。值得注意的是,BALM展示了在四个抗原结合预测任务中的卓越表现。此外,我们介绍BALMFold,从BALM派生的端到端方法,能够从单个序列快速预测完整的原子抗体结构。值得注意的是,BALMFold优于那些成熟的方法,如AlphaFold2,IgFold,抗体基准中的ESMFold和OmegaFold,通过减少对不必要试验的需求,显示出促进创新工程和简化治疗性抗体开发的巨大潜力。BALMFold结构预测服务器可在https://beamlab-sh.com/models/BALMFold免费获得。
    In recent decades, antibodies have emerged as indispensable therapeutics for combating diseases, particularly viral infections. However, their development has been hindered by limited structural information and labor-intensive engineering processes. Fortunately, significant advancements in deep learning methods have facilitated the precise prediction of protein structure and function by leveraging co-evolution information from homologous proteins. Despite these advances, predicting the conformation of antibodies remains challenging due to their unique evolution and the high flexibility of their antigen-binding regions. Here, to address this challenge, we present the Bio-inspired Antibody Language Model (BALM). This model is trained on a vast dataset comprising 336 million 40% nonredundant unlabeled antibody sequences, capturing both unique and conserved properties specific to antibodies. Notably, BALM showcases exceptional performance across four antigen-binding prediction tasks. Moreover, we introduce BALMFold, an end-to-end method derived from BALM, capable of swiftly predicting full atomic antibody structures from individual sequences. Remarkably, BALMFold outperforms those well-established methods like AlphaFold2, IgFold, ESMFold and OmegaFold in the antibody benchmark, demonstrating significant potential to advance innovative engineering and streamline therapeutic antibody development by reducing the need for unnecessary trials. The BALMFold structure prediction server is freely available at https://beamlab-sh.com/models/BALMFold.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    生物医学命名实体识别(BioNER)是生物医学文本挖掘中最基本的任务之一。它旨在自动识别和分类文本中的生物医学实体。最近,基于深度学习的方法已应用于生物医学命名实体识别,并显示出令人鼓舞的结果。然而,许多生物实体是多义性和模棱两可的,这是生物医学命名实体识别任务的主要障碍之一。深度学习方法需要大量的训练数据,数据的缺乏也影响了模型识别的性能。为了解决多义词和数据不足的问题,对于生物医学命名实体识别的任务,基于BiLSTM-CRF架构,提出了一种融合语言模型的多任务学习框架。我们的模型使用语言模型来设计上下文的差分编码,可以获得动态词向量来区分不同数据集中的词。此外,我们使用多任务学习方法共同共享不同类型实体的动态词向量,以提高每种类型实体的识别性能。实验结果表明,该模型通过区分编码减少了由多义词引起的误报,并通过在不同实体数据之间共享信息来提高每个子任务的性能。与其他最先进的方法相比,我们的模型在四个典型的训练集中取得了优异的结果,并在F1值中取得了最好的结果。
    Biomedical Named Entity Recognition (BioNER) is one of the most basic tasks in biomedical text mining, which aims to automatically identify and classify biomedical entities in text. Recently, deep learning-based methods have been applied to Biomedical Named Entity Recognition and have shown encouraging results. However, many biological entities are polysemous and ambiguous, which is one of the main obstacles to the task of biomedical named entity recognition. Deep learning methods require large amounts of training data, so the lack of data also affect the performance of model recognition. To solve the problem of polysemous words and insufficient data, for the task of biomedical named entity recognition, we propose a multi-task learning framework fused with language model based on the BiLSTM-CRF architecture. Our model uses a language model to design a differential encoding of the context, which could obtain dynamic word vectors to distinguish words in different datasets. Moreover, we use a multi-task learning method to collectively share the dynamic word vector of different types of entities to improve the recognition performance of each type of entity. Experimental results show that our model reduces the false positives caused by polysemous words through differentiated coding, and improves the performance of each subtask by sharing information between different entity data. Compared with other state-of-the art methods, our model achieved superior results in four typical training sets, and achieved the best results in F1 values.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    揭示蛋白质与其他分子的结合位点,如核酸,肽,或者小配体,揭示了疾病机制的阐明和新药的设计。随着序列数据库中蛋白质的爆炸性增长,如何从序列中准确有效地识别这些结合位点变得至关重要。然而,目前的方法主要依赖于昂贵的多序列比对或实验性蛋白质结构,限制了它们的基因组规模应用。此外,这些方法还没有充分探索蛋白质结构的几何形状。这里,我们提议GPSite,同时预测DNA结合残基的多任务网络,RNA,肽,蛋白质,ATP,HEM,和蛋白质上的金属离子。GPSite接受了蛋白质语言模型的信息序列嵌入和预测结构的训练,同时以端到端的方式全面提取残差和关系几何上下文。实验表明,在各种基准数据集上,GPSite大大超过了最先进的基于序列和基于结构的方法,即使结构没有得到很好的预测。GPSite的低计算成本使超过568,000个序列的快速基因组规模结合残基注释成为可能。提供机会揭示未探索的结合位点与分子功能的关联,生物过程,和遗传变异。可以在https://bio-web1上自由访问GPSite网络服务器和注释数据库。nscc-gz.cn/app/GPSite.
    Revealing protein binding sites with other molecules, such as nucleic acids, peptides, or small ligands, sheds light on disease mechanism elucidation and novel drug design. With the explosive growth of proteins in sequence databases, how to accurately and efficiently identify these binding sites from sequences becomes essential. However, current methods mostly rely on expensive multiple sequence alignments or experimental protein structures, limiting their genome-scale applications. Besides, these methods haven\'t fully explored the geometry of the protein structures. Here, we propose GPSite, a multi-task network for simultaneously predicting binding residues of DNA, RNA, peptide, protein, ATP, HEM, and metal ions on proteins. GPSite was trained on informative sequence embeddings and predicted structures from protein language models, while comprehensively extracting residual and relational geometric contexts in an end-to-end manner. Experiments demonstrate that GPSite substantially surpasses state-of-the-art sequence-based and structure-based approaches on various benchmark datasets, even when the structures are not well-predicted. The low computational cost of GPSite enables rapid genome-scale binding residue annotations for over 568,000 sequences, providing opportunities to unveil unexplored associations of binding sites with molecular functions, biological processes, and genetic variants. The GPSite webserver and annotation database can be freely accessed at https://bio-web1.nscc-gz.cn/app/GPSite.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    通过自监督学习(SSL)预训练的语言模型已被广泛用于研究蛋白质序列,而针对基因组序列开发的模型很少,并且仅限于单个物种。由于缺乏来自不同物种的基因组,这些模型不能有效地利用进化信息。在这项研究中,我们开发了SpliceBERT,通过掩蔽语言建模对来自72种脊椎动物的初级核糖核酸(RNA)序列进行预训练的语言模型,并将其应用于基于序列的RNA剪接建模。对不同物种进行预训练的SpliceBERT可以有效地鉴定进化上保守的元素。同时,学习到的隐藏状态和注意力权重可以表征剪接位点的生物学特性。因此,SpliceBERT在几个下游任务中显示出有效的效果:对剪接的变异效应的零射预测,预测人类的分支点,以及剪接位点的跨物种预测。我们的研究强调了在不同物种上预先训练基因组语言模型的重要性,并表明SSL是一种有前途的方法,可以增强我们对基因组序列调控逻辑的理解。
    Language models pretrained by self-supervised learning (SSL) have been widely utilized to study protein sequences, while few models were developed for genomic sequences and were limited to single species. Due to the lack of genomes from different species, these models cannot effectively leverage evolutionary information. In this study, we have developed SpliceBERT, a language model pretrained on primary ribonucleic acids (RNA) sequences from 72 vertebrates by masked language modeling, and applied it to sequence-based modeling of RNA splicing. Pretraining SpliceBERT on diverse species enables effective identification of evolutionarily conserved elements. Meanwhile, the learned hidden states and attention weights can characterize the biological properties of splice sites. As a result, SpliceBERT was shown effective on several downstream tasks: zero-shot prediction of variant effects on splicing, prediction of branchpoints in humans, and cross-species prediction of splice sites. Our study highlighted the importance of pretraining genomic language models on a diverse range of species and suggested that SSL is a promising approach to enhance our understanding of the regulatory logic underlying genomic sequences.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    有益细菌大部分仍未被开发。缺乏系统的方法,了解益生菌群落特征变得具有挑战性,导致不同出版物之间关于其益生菌作用的各种结论。我们开发了基于语言模型的元益生菌来快速检测宏基因组中的益生菌,在模拟基准数据集中展示卓越的性能。对益生菌治疗个体的肠道宏基因组进行测试,它揭示了超出训练数据的干预菌株来源的垃圾箱和其他益生菌相关垃圾箱的自生性,例如质粒样的仓。对这些垃圾箱的分析揭示了各种益生菌机制和bai操纵子作为益生菌的潜在标记。在不同的健康疾病队列中,这些垃圾箱在健康个体中更常见,象征着它们的益生菌作用,但是基于这些垃圾箱丰度的相关健康预测面临着跨疾病挑战.为了更好地理解益生菌的异质性,我们使用metaProbiotics从全球肠道宏基因组数据构建了一个全面的益生菌基因组集。该组的模块分析表明,患病个体通常缺乏某些益生菌基因模块,在不同的疾病中缺失的模块存在显著差异。此外,同一益生菌上的不同基因模块对各种疾病具有异质性作用。因此,我们认为益生菌群落的基因功能完整性在维持肠道稳态方面比仅仅增加特定基因丰度更为重要。不分青红皂白地添加益生菌可能不会促进健康。我们预计,基于创新语言模型的metaProbiotics工具将利用大规模宏基因组数据促进新的益生菌发现,并促进细菌益生菌效应的系统研究。metaProbiotics计划可在https://github.com/zhenchengfang/metaProbiotics上免费下载。
    Beneficial bacteria remain largely unexplored. Lacking systematic methods, understanding probiotic community traits becomes challenging, leading to various conclusions about their probiotic effects among different publications. We developed language model-based metaProbiotics to rapidly detect probiotic bins from metagenomes, demonstrating superior performance in simulated benchmark datasets. Testing on gut metagenomes from probiotic-treated individuals, it revealed the probioticity of intervention strains-derived bins and other probiotic-associated bins beyond the training data, such as a plasmid-like bin. Analyses of these bins revealed various probiotic mechanisms and bai operon as probiotic Ruminococcaceae\'s potential marker. In different health-disease cohorts, these bins were more common in healthy individuals, signifying their probiotic role, but relevant health predictions based on the abundance profiles of these bins faced cross-disease challenges. To better understand the heterogeneous nature of probiotics, we used metaProbiotics to construct a comprehensive probiotic genome set from global gut metagenomic data. Module analysis of this set shows that diseased individuals often lack certain probiotic gene modules, with significant variation of the missing modules across different diseases. Additionally, different gene modules on the same probiotic have heterogeneous effects on various diseases. We thus believe that gene function integrity of the probiotic community is more crucial in maintaining gut homeostasis than merely increasing specific gene abundance, and adding probiotics indiscriminately might not boost health. We expect that the innovative language model-based metaProbiotics tool will promote novel probiotic discovery using large-scale metagenomic data and facilitate systematic research on bacterial probiotic effects. The metaProbiotics program can be freely downloaded at https://github.com/zhenchengfang/metaProbiotics.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    测试时间增强(TTA)是一种完善的技术,涉及在推理阶段汇总测试输入的转换示例。目标是增强模型性能并减少预测的不确定性。尽管它的优点是不需要额外的训练或超参数调整,并且适用于任何现有模型,TTA在NLP领域仍处于早期阶段。这部分是由于难以辨别不同转化样品的贡献,这会对预测产生负面影响。为了解决这些问题,我们提出了选择性测试时间增强,叫做STTA,其旨在通过识别可靠的样品来选择用于聚集的最有益的转化样品。此外,我们分析并实证验证了为什么TTA对某些文本数据增强方法敏感,并揭示了为什么某些数据增强方法会导致错误的预测。通过广泛的实验,我们证明STTA是一种简单有效的方法,可以在各种文本分类任务中产生有希望的结果。
    Test-time augmentation (TTA) is a well-established technique that involves aggregating transformed examples of test inputs during the inference stage. The goal is to enhance model performance and reduce the uncertainty of predictions. Despite its advantages of not requiring additional training or hyperparameter tuning, and being applicable to any existing model, TTA is still in its early stages in the field of NLP. This is partly due to the difficulty of discerning the contribution of different transformed samples, which can negatively impact predictions. In order to address these issues, we propose Selective Test-Time Augmentation, called STTA, which aims to select the most beneficial transformed samples for aggregation by identifying reliable samples. Furthermore, we analyze and empirically verify why TTA is sensitive to some text data augmentation methods and reveal why some data augmentation methods lead to erroneous predictions. Through extensive experiments, we demonstrate that STTA is a simple and effective method that can produce promising results in various text classification tasks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:ChatGPT可能充当研究助手,以帮助组织思考方向并总结研究成果。然而,很少有研究检查质量,相似性(摘要与原始摘要相似),以及研究人员提供全文基础研究论文时ChatGPT生成的摘要的准确性。
    目的:我们旨在评估人工智能(AI)模型在生成基础临床前研究摘要中的适用性。
    方法:我们选择了《自然》杂志的30篇基础研究论文,基因组生物学,和生物精神病学。不包括摘要,我们将全文输入到ChatPDF中,基于ChatGPT的语言模型的应用,我们提示它生成与原始论文中使用的相同样式的摘要。总共邀请了8位专家来评估这些摘要的质量(基于0-10的李克特量表),并确定哪些摘要是由ChatPDF生成的,使用盲目的方法。还评估了这些摘要与原始摘要的相似性以及AI内容的准确性。
    结果:ChatGPT生成的摘要质量低于实际摘要的质量(10分Likert量表:平均值4.72,SD2.09与平均值8.09,SD1.03;P<.001)。非结构化格式的质量差异显着(平均差-4.33;95%CI-4.79至-3.86;P<.001),但在4小标题结构化格式中最小(平均差-2.33;95%CI-2.79至-1.86)。在30份ChatGPT生成的摘要中,3显示错误的结论,和10个被确定为AI内容。原始摘要和生成摘要之间的平均相似性百分比不高(2.10%-4.40%)。蒙蔽的审阅者在猜测使用ChatGPT编写的摘要时达到了93%(224/240)的准确率。
    结论:使用ChatGPT生成科学摘要可能不会导致使用人类编写的真实全文时的相似性问题。然而,ChatGPT生成的摘要的质量次优,他们的准确率不是100%。
    ChatGPT may act as a research assistant to help organize the direction of thinking and summarize research findings. However, few studies have examined the quality, similarity (abstracts being similar to the original one), and accuracy of the abstracts generated by ChatGPT when researchers provide full-text basic research papers.
    We aimed to assess the applicability of an artificial intelligence (AI) model in generating abstracts for basic preclinical research.
    We selected 30 basic research papers from Nature, Genome Biology, and Biological Psychiatry. Excluding abstracts, we inputted the full text into ChatPDF, an application of a language model based on ChatGPT, and we prompted it to generate abstracts with the same style as used in the original papers. A total of 8 experts were invited to evaluate the quality of these abstracts (based on a Likert scale of 0-10) and identify which abstracts were generated by ChatPDF, using a blind approach. These abstracts were also evaluated for their similarity to the original abstracts and the accuracy of the AI content.
    The quality of ChatGPT-generated abstracts was lower than that of the actual abstracts (10-point Likert scale: mean 4.72, SD 2.09 vs mean 8.09, SD 1.03; P<.001). The difference in quality was significant in the unstructured format (mean difference -4.33; 95% CI -4.79 to -3.86; P<.001) but minimal in the 4-subheading structured format (mean difference -2.33; 95% CI -2.79 to -1.86). Among the 30 ChatGPT-generated abstracts, 3 showed wrong conclusions, and 10 were identified as AI content. The mean percentage of similarity between the original and the generated abstracts was not high (2.10%-4.40%). The blinded reviewers achieved a 93% (224/240) accuracy rate in guessing which abstracts were written using ChatGPT.
    Using ChatGPT to generate a scientific abstract may not lead to issues of similarity when using real full texts written by humans. However, the quality of the ChatGPT-generated abstracts was suboptimal, and their accuracy was not 100%.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    生成对抗网络(GAN)已经成功地产生了功能性蛋白质序列。然而,传统的GAN经常遭受固有的随机性,导致获得所需序列的概率较低。由于湿实验室实验的高成本,计算机辅助抗体优化的主要目标是从各种可能性中识别出高质量的候选抗体,然而,提高GANs产生这些所需抗体的能力是一个挑战。在这项研究中,我们提出并评估了一种新的GAN,称为语言模型引导的抗体生成对抗网络(AbGAN-LMG)。这个GAN使用语言模型作为输入,利用这些模型的强大代表性能力来提高GAN的高质量抗体的生成。我们对AbGAN-LMG针对COVID-19(SARS-CoV-2)和中东呼吸综合征(MERS-CoV)产生的抗体文库和序列进行了全面评估。结果表明,AbGAN-LMG已经了解了抗体的基本特征,并且它改善了产生的文库的多样性。此外,当使用AZD-8895作为优化的目标抗体生成序列时,超过50%的生成序列表现出比AZD-8895本身更好的显影性。通过分子对接,与AZD-8895相比,我们鉴定出70种抗体对SARS-CoV-2的野生型受体结合域(RBD)具有更高的亲和力.总之,AbGAN-LMG表明,与GAN结合使用的语言模型可以生成更高质量的文库和候选序列,从而提高了抗体优化的效率。AbGAN-LMG可在http://39.102.71.224:88/获得。
    Generative adversarial networks (GANs) have successfully generated functional protein sequences. However, traditional GANs often suffer from inherent randomness, resulting in a lower probability of obtaining desirable sequences. Due to the high cost of wet-lab experiments, the main goal of computer-aided antibody optimization is to identify high-quality candidate antibodies from a large range of possibilities, yet improving the ability of GANs to generate these desired antibodies is a challenge. In this study, we propose and evaluate a new GAN called the Language Model Guided Antibody Generative Adversarial Network (AbGAN-LMG). This GAN uses a language model as an input, harnessing such models\' powerful representational capabilities to improve the GAN\'s generation of high-quality antibodies. We conducted a comprehensive evaluation of the antibody libraries and sequences generated by AbGAN-LMG for COVID-19 (SARS-CoV-2) and Middle East Respiratory Syndrome (MERS-CoV). Results indicate that AbGAN-LMG has learned the fundamental characteristics of antibodies and that it improved the diversity of the generated libraries. Additionally, when generating sequences using AZD-8895 as the target antibody for optimization, over 50% of the generated sequences exhibited better developability than AZD-8895 itself. Through molecular docking, we identified 70 antibodies that demonstrated higher affinity for the wild-type receptor-binding domain (RBD) of SARS-CoV-2 compared to AZD-8895. In conclusion, AbGAN-LMG demonstrates that language models used in conjunction with GANs can enable the generation of higher-quality libraries and candidate sequences, thereby improving the efficiency of antibody optimization. AbGAN-LMG is available at http://39.102.71.224:88/.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:由于注册临床营养师供应不足,中国糖尿病患者的营养管理是一项重大挑战。为了解决这个问题,创建了一个基于人工智能(AI)的营养师程序,该程序使用高级语言和图像识别模型。该程序可以从患者的膳食图像中识别成分,并提供营养指导和饮食建议。
    目的:本研究的主要目的是评估支持该计划的模型的能力。
    方法:通过多步骤过程评估AI营养师计划对2型糖尿病(T2DM)患者的潜力。首先,我们在2型糖尿病患者和内分泌学家中进行了一项调查,以确定饮食习惯方面的知识差距.然后通过中国注册营养师考试对ChatGPT和GPT4.0进行测试,以评估他们提供循证饮食建议的熟练程度。将ChatGPT对有关医学营养治疗的常见问题的回答与专业营养师的专家回答进行比较,以评估其熟练程度。该模型的食品建议经过仔细审查,以确保与专家建议保持一致。开发了基于深度学习的图像识别模型,用于成分级别的食品识别,并将其性能与现有模型进行了比较。最后,开发了一个用户友好的应用程序,整合语言和图像识别模型的功能,以潜在地改善对T2DM患者的护理。
    结果:大多数患者(182/206,88.4%)需要更直接和全面的营养管理和教育。ChatGPT和GPT4.0都通过了中国注册营养师考试。ChatGPT的食品建议主要符合最佳实践,除了某些食物,如根茎类蔬菜和干豆。专业营养师对ChatGPT对常见问题的回答的评论在很大程度上是积极的,168人中有162人提供好评。多标签图像识别模型评估表明,DinoV2模型的平均F1得分为0.825,表明识别成分的准确性很高。
    结论:模型评估是有希望的。基于AI的营养师计划现在已经准备好进行有监督的试点研究。
    Nutritional management for patients with diabetes in China is a significant challenge due to the low supply of registered clinical dietitians. To address this, an artificial intelligence (AI)-based nutritionist program that uses advanced language and image recognition models was created. This program can identify ingredients from images of a patient\'s meal and offer nutritional guidance and dietary recommendations.
    The primary objective of this study is to evaluate the competence of the models that support this program.
    The potential of an AI nutritionist program for patients with type 2 diabetes mellitus (T2DM) was evaluated through a multistep process. First, a survey was conducted among patients with T2DM and endocrinologists to identify knowledge gaps in dietary practices. ChatGPT and GPT 4.0 were then tested through the Chinese Registered Dietitian Examination to assess their proficiency in providing evidence-based dietary advice. ChatGPT\'s responses to common questions about medical nutrition therapy were compared with expert responses by professional dietitians to evaluate its proficiency. The model\'s food recommendations were scrutinized for consistency with expert advice. A deep learning-based image recognition model was developed for food identification at the ingredient level, and its performance was compared with existing models. Finally, a user-friendly app was developed, integrating the capabilities of language and image recognition models to potentially improve care for patients with T2DM.
    Most patients (182/206, 88.4%) demanded more immediate and comprehensive nutritional management and education. Both ChatGPT and GPT 4.0 passed the Chinese Registered Dietitian examination. ChatGPT\'s food recommendations were mainly in line with best practices, except for certain foods like root vegetables and dry beans. Professional dietitians\' reviews of ChatGPT\'s responses to common questions were largely positive, with 162 out of 168 providing favorable reviews. The multilabel image recognition model evaluation showed that the Dino V2 model achieved an average F1 score of 0.825, indicating high accuracy in recognizing ingredients.
    The model evaluations were promising. The AI-based nutritionist program is now ready for a supervised pilot study.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号