本研究提出了一种数据驱动的方法,用于使用食品和饲料快速预警系统(RASFF)和世界卫生组织(WHO)的全球环境监测系统(GEMS)对乳制品中与化学和微生物污染物相关的食品安全警报进行分类。这项研究旨在通过探索性数据分析,根据微生物和化学危害的存在和严重程度对其进行优先级排序,并使用机器学习(ML)方法对化学危害的严重程度进行分类。它确定了单核细胞增生李斯特菌,大肠杆菌,沙门氏菌,假单胞菌属。,葡萄球菌属。,蜡样芽孢杆菌,梭菌属。,以及在乳制品中优先考虑的微生物危害。该研究还根据其存在和严重程度优先考虑了十大化学危害。这些危害包括硝酸盐,亚硝酸盐,Ergocornine,3-MCPD酯,铅,砷,曲霉毒素A,镉,水银,和黄曲霉毒素(G1、B1、G2、B2、G5和M1)。使用ML技术,将食品安全警报分类为“严重”或“非严重”的准确率高达98%。此外,研究确定了参考剂量(RfD),物质量,通知类型,产品,和物质是影响ML模型性能的最重要功能。这些ML模型(决策树,随机森林,k-最近的邻居,线性判别分析,和支持向量机)也在与乳制品中化学污染物相关的RASFF警报的外部数据集上进行了验证。他们实现了高达95.1%的准确度。这项研究的结果证明了模型的稳健性和分类能力,食品安全警报与乳制品中的化学污染物,即使是新数据。这些结果可以促进更有效的机器学习模型的开发,用于对与乳制品中化学污染物相关的食品安全警报进行分类。强调开发准确有效的分类模型以及时干预的重要性。
This study presents a data-driven approach for classifying food safety alerts related to chemical and microbial contaminants in dairy products using the Rapid Alert System for Food and Feed (RASFF) and the World Health Organization (WHO)\'s Global Environmental Monitoring System (GEMS) food contaminants databases. This research aimed to prioritise microbial and chemical hazards based on their presence and severity through exploratory data analysis and to classify the severity of chemical hazards using machine learning (ML) approaches. It identified Listeria monocytogenes, Escherichia coli, Salmonella, Pseudomonas spp., Staphylococcus spp., Bacillus cereus, Clostridium spp., and Cronobacter sakazakii as the microbial hazards of priority in dairy products. The study also prioritised the top ten chemical hazards based on their presence and severity. These hazards include nitrate, nitrite, ergocornine, 3-MCPD ester, lead, arsenic, ochratoxin A, cadmium, mercury, and aflatoxin (G1, B1, G2, B2, G5 and M1). Using ML techniques, the accuracy rate of classifying food safety alerts as either \'serious\' or \'non-serious\' was up to 98 %. Additionally, the study identified Reference dose (RfD), substance amount, notification type, product, and substance as the most important features affecting the ML models\' performance. These ML models (decision trees, random forests, k-nearest neighbors, linear discriminant analysis, and support vector machines) were also validated on an external dataset of RASFF alerts related to chemical contaminants in dairy products. They achieved an accuracy of up to 95.1 %. The study\'s findings demonstrate the models\' robustness and ability to classify food safety alerts related to chemical contaminants in dairy products, even on new data. These results can enhance the development of more effective machine-learning models for classifying food safety alerts related to chemical contaminants in dairy products, highlighting the importance of developing accurate and efficient classification models for timely intervention.