Selection of medical diagnostic codes for analysis of electronic patient records. Application to stroke in a primary care database
- PMID: 19777060
- PMCID: PMC2744876
- DOI: 10.1371/journal.pone.0007168
Selection of medical diagnostic codes for analysis of electronic patient records. Application to stroke in a primary care database
Abstract
Background: Electronic patient records from primary care databases are increasingly used in public health and health services research but methods used to identify cases with disease are not well described. This study aimed to evaluate the relevance of different codes for the identification of acute stroke in a primary care database, and to evaluate trends in the use of different codes over time.
Methods: Data were obtained from the General Practice Research Database from 1997 to 2006. All subjects had a minimum of 24 months of up-to-standard record before the first recorded stroke diagnosis. Initially, we identified stroke cases using a supplemented version of the set of codes for prevalent stroke used by the Office for National Statistics in Key health statistics from general practice 1998 (ONS codes). The ONS codes were then independently reviewed by four raters and a restricted set of 121 codes for 'acute stroke' was identified but the kappa statistic was low at 0.23.
Results: Initial extraction of data using the ONS codes gave 48,239 cases of stroke from 1997 to 2006. Application of the restricted set of codes reduced this to 39,424 cases. There were 2,288 cases whose index medical codes were for 'stroke annual review' and 3,112 for 'stroke monitoring'. The frequency of stroke review and monitoring codes as index codes increased from 9 per year in 1997 to 1,612 in 2004, 1,530 in 2005 and 1,424 in 2006. The one year mortality of cases with the restricted set of codes was 29.1% but for 'stroke annual review,' 4.6% and for 'stroke monitoring codes', 5.7%.
Conclusion: In the analysis of electronic patient records, different medical codes for a single condition may have varying clinical and prognostic significance; utilisation of different medical codes may change over time; researchers with differing clinical or epidemiological experience may have differing interpretations of the relevance of particular codes. There is a need for greater transparency in the selection of sets of codes for different conditions, for the reporting of sensitivity analyses using different sets of codes, as well as sharing of code sets among researchers.
Conflict of interest statement
Similar articles
-
Potential application of item-response theory to interpretation of medical codes in electronic patient records.BMC Med Res Methodol. 2011 Dec 16;11:168. doi: 10.1186/1471-2288-11-168. BMC Med Res Methodol. 2011. PMID: 22176509 Free PMC article.
-
Validity of international classification of disease codes to identify ischemic stroke and intracranial hemorrhage among individuals with associated diagnosis of atrial fibrillation.Circ Cardiovasc Qual Outcomes. 2015 Jan;8(1):8-14. doi: 10.1161/CIRCOUTCOMES.113.000371. Epub 2015 Jan 13. Circ Cardiovasc Qual Outcomes. 2015. PMID: 25587093 Free PMC article.
-
Administrative data underestimate acute ischemic stroke events and thrombolysis treatments: Data from a multicenter validation survey in Italy.PLoS One. 2018 Mar 13;13(3):e0193776. doi: 10.1371/journal.pone.0193776. eCollection 2018. PLoS One. 2018. PMID: 29534079 Free PMC article.
-
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217. Cochrane Database Syst Rev. 2022. PMID: 36321557 Free PMC article.
-
A Systematic Review of Case-Identification Algorithms Based on Italian Healthcare Administrative Databases for Three Relevant Diseases of the Cardiovascular System: Acute Myocardial Infarction, Ischemic Heart Disease, and Stroke.Epidemiol Prev. 2019 Jul-Aug;43(4 Suppl 2):37-50. doi: 10.19191/EP19.4.S2.P037.091. Epidemiol Prev. 2019. PMID: 31650805
Cited by
-
Multiple risk factor control, mortality and cardiovascular events in type 2 diabetes and chronic kidney disease: a population-based cohort study.BMJ Open. 2018 May 8;8(5):e019950. doi: 10.1136/bmjopen-2017-019950. BMJ Open. 2018. PMID: 29739781 Free PMC article.
-
Completeness and diagnostic validity of recording acute myocardial infarction events in primary care, hospital care, disease registry, and national mortality records: cohort study.BMJ. 2013 May 20;346:f2350. doi: 10.1136/bmj.f2350. BMJ. 2013. PMID: 23692896 Free PMC article.
-
ClinicalCodes: an online clinical codes repository to improve the validity and reproducibility of research using electronic medical records.PLoS One. 2014 Jun 18;9(6):e99825. doi: 10.1371/journal.pone.0099825. eCollection 2014. PLoS One. 2014. PMID: 24941260 Free PMC article.
-
Psoriasis and the Risk of Major Cardiovascular Events: Cohort Study Using the Clinical Practice Research Datalink.J Invest Dermatol. 2015 Sep;135(9):2189-2197. doi: 10.1038/jid.2015.87. Epub 2015 Mar 5. J Invest Dermatol. 2015. PMID: 25742120
-
Potential application of item-response theory to interpretation of medical codes in electronic patient records.BMC Med Res Methodol. 2011 Dec 16;11:168. doi: 10.1186/1471-2288-11-168. BMC Med Res Methodol. 2011. PMID: 22176509 Free PMC article.
References
-
- Lawson DH, Sherman V, Hollowell J. The General Practice Research Database. Scientific and Ethical Advisory Group. QJM. 1998;91:445–452. - PubMed
-
- National Statistics. London: National Statistics; 2000. Key health statistics from general practice 1998. Series MB6 No 2.
-
- Toschke AM, Wolfe C, Heuschmann P, Rudd A, Gulliford M. The effect of antihypertensive treatment after stroke. An analysis of the General Practice Research Database (GPRD). Cerebrovascular Diseases (in press) 2009 - PubMed
-
- Hollowell J. The GPRD. Quality of morbidity data. Population Trends. 1997;87:36–40. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical