背景:非贫血性地中海贫血(TT)在华南地区TT病例中占很高比例。
目的:使用人工智能(AI)分析红细胞形态和机器学习(ML)来识别非贫血人群中的TT基因携带者。
方法:收集76个TT基因携带者和97个对照的数字形态学数据。基于AI技术的迈瑞MC-100i用于定量分析异常红细胞的百分比。Further,使用ML构建预测模型。
结果:非贫血TT携带者占TT病例的60%以上。选择随机森林作为预测模型,命名为TT@Normal。TT@Normal算法在训练中表现突出,验证,和外部验证集,可以有效地识别非贫血人群中的TT携带者。TT@Normal模型中的前三个重量是靶细胞,微细胞,和泪滴细胞。异常红细胞的百分比升高应引起人们对TT基因携带者的强烈怀疑。TT@Normal可以被提升并用作可视化和共享工具。它可以通过URL链接访问,并且可以由医务人员在线使用,以预测非贫血人群中TT基因携带的可能性。
结论:基于ML的模型TT@Normal可以有效地识别非贫血人群中的TT携带者。靶细胞百分比升高,微细胞,泪滴细胞应该引起人们对TT基因载体的强烈怀疑。
BACKGROUND: Non-anemic thalassemia trait (TT) accounted for a high proportion of TT cases in South China.
OBJECTIVE: To use artificial intelligence (AI) analysis of erythrocyte morphology and machine learning (ML) to identify TT gene carriers in a non-anemic population.
METHODS: Digital morphological data from 76 TT gene carriers and 97 controls were collected. The AI technology-based Mindray MC-100i was used to quantitatively analyze the percentage of abnormal erythrocytes. Further, ML was used to construct a prediction model.
RESULTS: Non-anemic TT carriers accounted for over 60% of the TT cases. Random Forest was selected as the prediction model and named TT@Normal. The TT@Normal algorithm showed outstanding performance in the training, validation, and external validation sets and could efficiently identify TT carriers in the non-anemic population. The top three weights in the TT@Normal model were the target cells, microcytes, and teardrop cells. Elevated percentages of abnormal erythrocytes should raise a strong suspicion of being a TT gene carrier. TT@Normal could be promoted and used as a visualization and sharing tool. It is accessible through a URL link and can be used by medical staff online to predict the possibility of TT gene carriage in a non-anemic population.
CONCLUSIONS: The ML-based model TT@Normal could efficiently identify TT carriers in non-anemic people. Elevated percentages of target cells, microcytes, and teardrop cells should raise a strong suspicion of being a TT gene carrier.