背景:这项描述性研究评估了完整性,协议,和代表性的种族记录在英国(英国)临床实践研究数据链(CPRD)初级保健数据库单独和,对于那些在英格兰注册了全科医生的病人来说,当与医院事件统计(HES)的二级护理数据相关联时。
方法:在2021年5月建立的所有英国患者的CPRDGOLD和CPRDAurum数据库中,对所有患者进行了种族记录评估。在对英国的分析中,英文数据来自CPRD-HES组合,而来自北爱尔兰的数据,苏格兰,威尔士只从CPRD抽走。在每个数据集中评估每位患者的种族记录一致性(CPRDGOLD,CPRDAurum,和HES数据集)以及最高级别种族分类的数据集之间(“亚洲”,\'黑色\',\'混合\',\'白色\',\'其他\')。通过比较CPRD-HES最高级别分类的种族分布与英国下放政府2011年人口普查的种族分布来评估代表性。此外,将CPRD-HES与2019年国家统计局(ONS2019)的英格兰和威尔士的实验性种族分布以及2021年5月的英国种族分布进行了比较,这些种族分布来自NHSDigital的全科医学提取服务数据,用于大流行规划和研究与HES数据链接(GDPPR-HES)。
结果:在CPRD-HES中,在英国,目前登记的81.7%的患者在初级保健中有种族记录。对于有多个种族记录的患者,个别一级和二级保健数据集中的不匹配种族<10%.在CPRD和HES记录的具有种族的英国患者中,93.3%的记录符合最高级别的分类;然而,“混合”和“其他”族裔群体的协议水平明显较低。与2011年英国人口普查相比,CPRD-HES的比例较低(80.3%与87.2%)和实验ONS2019数据(80.4%与84.3%)。CPRD-HES与GDPPR-HES的种族分布一致(“白色”80.4%与80.7%);然而,归类为“其他”的比例较小(1.1%与2.8%)。
结论:CPRD-HES在所有种族类别中都具有合适的代表性,与其他数据来源的英国普通人群相比,少数族裔群体的代表性过高,被归类为“其他”的比例较小。CPRD-HES数据可用于研究典型代表性不足群体的健康风险和结果。
This descriptive study assessed the completeness, agreement, and representativeness of ethnicity recording in the United Kingdom (UK) Clinical Practice Research Datalink (CPRD) primary care databases alone and, for those patients registered with a GP in England, when linked to secondary care data from Hospital Episode Statistics (HES).
Ethnicity records were assessed for all patients in the May 2021 builds of the CPRD GOLD and CPRD Aurum databases for all UK patients. In analyses of the UK, English data was from combined CPRD-HES, whereas data from Northern Ireland, Scotland, and Wales drew from CPRD only. The agreement of ethnicity records per patient was assessed within each dataset (CPRD GOLD, CPRD Aurum, and HES datasets) and between datasets at the highest level ethnicity categorisation (\'Asian\', \'black\', \'mixed\', \'white\', \'other\'). Representativeness was assessed by comparing the ethnic distributions at the highest-level categorisation of CPRD-HES to those from the Census 2011 across the UK\'s devolved administrations. Additionally, CPRD-HES was compared to the experimental ethnic distributions for England and Wales from the Office for National Statistics in 2019 (ONS2019) and the English ethnic distribution from May 2021 from NHS Digital\'s General Practice Extraction Service Data for Pandemic Planning and Research with HES data linkage (GDPPR-HES).
In CPRD-HES, 81.7% of currently registered patients in the UK had ethnicity recorded in primary care. For patients with multiple ethnicity records, mismatched ethnicity within individual primary and secondary care datasets was < 10%. Of English patients with ethnicity recorded in both CPRD and HES, 93.3% of records matched at the highest-level categorisation; however, the level of agreement was markedly lower in the \'mixed\' and \'other\' ethnic groups. CPRD-HES was less proportionately \'white\' compared to the UK Census 2011 (80.3% vs. 87.2%) and experimental ONS2019 data (80.4% vs. 84.3%). CPRD-HES was aligned with the ethnic distribution from GDPPR-HES (\'white\' 80.4% vs. 80.7%); however, with a smaller proportion classified as \'other\' (1.1% vs. 2.8%).
CPRD-HES has suitable representation of all ethnic categories with some overrepresentation of minority ethnic groups and a smaller proportion classified as \'other\' compared to the UK general population from other data sources. CPRD-HES data is useful for studying health risks and outcomes in typically underrepresented groups.