Linked electronic health records for research on a nationwide cohort of more than 54 million people in England: data resource
- PMID: 33827854
- PMCID: PMC8413899
- DOI: 10.1136/bmj.n826
Linked electronic health records for research on a nationwide cohort of more than 54 million people in England: data resource
Abstract
Objective: To describe a novel England-wide electronic health record (EHR) resource enabling whole population research on covid-19 and cardiovascular disease while ensuring data security and privacy and maintaining public trust.
Design: Data resource comprising linked person level records from national healthcare settings for the English population, accessible within NHS Digital's new trusted research environment.
Setting: EHRs from primary care, hospital episodes, death registry, covid-19 laboratory test results, and community dispensing data, with further enrichment planned from specialist intensive care, cardiovascular, and covid-19 vaccination data.
Participants: 54.4 million people alive on 1 January 2020 and registered with an NHS general practitioner in England.
Main measures of interest: Confirmed and suspected covid-19 diagnoses, exemplar cardiovascular conditions (incident stroke or transient ischaemic attack and incident myocardial infarction) and all cause mortality between 1 January and 31 October 2020.
Results: The linked cohort includes more than 96% of the English population. By combining person level data across national healthcare settings, data on age, sex, and ethnicity are complete for around 95% of the population. Among 53.3 million people with no previous diagnosis of stroke or transient ischaemic attack, 98 721 had a first ever incident stroke or transient ischaemic attack between 1 January and 31 October 2020, of which 30% were recorded only in primary care and 4% only in death registry records. Among 53.2 million people with no previous diagnosis of myocardial infarction, 62 966 had an incident myocardial infarction during follow-up, of which 8% were recorded only in primary care and 12% only in death registry records. A total of 959 470 people had a confirmed or suspected covid-19 diagnosis (714 162 in primary care data, 126 349 in hospital admission records, 776 503 in covid-19 laboratory test data, and 50 504 in death registry records). Although 58% of these were recorded in both primary care and covid-19 laboratory test data, 15% and 18%, respectively, were recorded in only one.
Conclusions: This population-wide resource shows the importance of linking person level data across health settings to maximise completeness of key characteristics and to ascertain cardiovascular events and covid-19 diagnoses. Although this resource was initially established to support research on covid-19 and cardiovascular disease to benefit clinical care and public health and to inform healthcare policy, it can broaden further to enable a wide range of research.
© Author(s) (or their employer(s)) 2019. Re-use permitted under CC BY. No commercial re-use. See rights and permissions. Published by BMJ.
Conflict of interest statement
Competing interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: support from the funders listed above; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work. SH works as a data scientist and data curator for NHS Digital, which holds and processes the data.
Figures
Comment in
-
Covid-19, open science, and the CVD-COVID-UK initiative.BMJ. 2021 Apr 7;373:n898. doi: 10.1136/bmj.n898. BMJ. 2021. PMID: 33827892 No abstract available.
References
-
- Cavallaro F, Lugg-Widger F, Cannings-John R, Harron K. Open Letter: Reducing barriers to data access for research in the public interest—lessons from covid-19. BMJ Opinion 2020. https://blogs.bmj.com/bmj/2020/07/06/reducing-barriers-to-data-access-fo...
-
- Jones KH, Ford DV, Lyons RA. The SAIL Databank: 10 years of spearheading data privacy and research utility, 2007-2017. Swansea University. [cited 2021 Feb 19]. https://saildatabank.com/
-
- McGurnaghan SJ, Weir A, Bishop J, et al. Public Health Scotland COVID-19 Health Protection Study Group. Scottish Diabetes Research Network Epidemiology Group . Risks of and risk factors for COVID-19 disease in people with diabetes: a cohort study of the total population of Scotland. Lancet Diabetes Endocrinol 2021;9:82-93. 10.1016/S2213-8587(20)30405-8 - DOI - PMC - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- 16896/CRUK_/Cancer Research UK/United Kingdom
- WT_/Wellcome Trust/United Kingdom
- PG/15/33/31394/BHF_/British Heart Foundation/United Kingdom
- MR/L003120/1/MRC_/Medical Research Council/United Kingdom
- SP/18/3/33801/BHF_/British Heart Foundation/United Kingdom
- CH/17/1/32804/BHF_/British Heart Foundation/United Kingdom
- CH/12/2/29428/BHF_/British Heart Foundation/United Kingdom
- MC_PC_20059/MRC_/Medical Research Council/United Kingdom
- MR/K014811/1/MRC_/Medical Research Council/United Kingdom
- MC_PC_20051/MRC_/Medical Research Council/United Kingdom
- MR/S004149/1/MRC_/Medical Research Council/United Kingdom
- FS/18/5/33319/BHF_/British Heart Foundation/United Kingdom
- MR/S004149/2/MRC_/Medical Research Council/United Kingdom
- MC_PC_18029/MRC_/Medical Research Council/United Kingdom
- MR/K006584/1/MRC_/Medical Research Council/United Kingdom
- MC_UU_00011/4/MRC_/Medical Research Council/United Kingdom
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials