Potential application of item-response theory to interpretation of medical codes in electronic patient records
- PMID: 22176509
- PMCID: PMC3261214
- DOI: 10.1186/1471-2288-11-168
Potential application of item-response theory to interpretation of medical codes in electronic patient records
Abstract
Background: Electronic patient records are generally coded using extensive sets of codes but the significance of the utilisation of individual codes may be unclear. Item response theory (IRT) models are used to characterise the psychometric properties of items included in tests and questionnaires. This study asked whether the properties of medical codes in electronic patient records may be characterised through the application of item response theory models.
Methods: Data were provided by a cohort of 47,845 participants from 414 family practices in the UK General Practice Research Database (GPRD) with a first stroke between 1997 and 2006. Each eligible stroke code, out of a set of 202 OXMIS and Read codes, was coded as either recorded or not recorded for each participant. A two parameter IRT model was fitted using marginal maximum likelihood estimation. Estimated parameters from the model were considered to characterise each code with respect to the latent trait of stroke diagnosis. The location parameter is referred to as a calibration parameter, while the slope parameter is referred to as a discrimination parameter.
Results: There were 79,874 stroke code occurrences available for analysis. Utilisation of codes varied between family practices with intraclass correlation coefficients of up to 0.25 for the most frequently used codes. IRT analyses were restricted to 110 Read codes. Calibration and discrimination parameters were estimated for 77 (70%) codes that were endorsed for 1,942 stroke patients. Parameters were not estimated for the remaining more frequently used codes. Discrimination parameter values ranged from 0.67 to 2.78, while calibration parameters values ranged from 4.47 to 11.58. The two parameter model gave a better fit to the data than either the one- or three-parameter models. However, high chi-square values for about a fifth of the stroke codes were suggestive of poor item fit.
Conclusion: The application of item response theory models to coded electronic patient records might potentially contribute to identifying medical codes that offer poor discrimination or low calibration. This might indicate the need for improved coding sets or a requirement for improved clinical coding practice. However, in this study estimates were only obtained for a small proportion of participants and there was some evidence of poor model fit. There was also evidence of variation in the utilisation of codes between family practices raising the possibility that, in practice, properties of codes may vary for different coders.
Similar articles
-
Selection of medical diagnostic codes for analysis of electronic patient records. Application to stroke in a primary care database.PLoS One. 2009 Sep 24;4(9):e7168. doi: 10.1371/journal.pone.0007168. PLoS One. 2009. PMID: 19777060 Free PMC article.
-
Identification of esophageal cancer in the General Practice Research Database.Pharmacoepidemiol Drug Saf. 2011 Nov;20(11):1159-67. doi: 10.1002/pds.2249. Epub 2011 Sep 16. Pharmacoepidemiol Drug Saf. 2011. PMID: 21928362
-
Determining the predictive value of Read codes to identify congenital cardiac malformations in the UK Clinical Practice Research Datalink.Pharmacoepidemiol Drug Saf. 2013 Nov;22(11):1233-8. doi: 10.1002/pds.3511. Epub 2013 Sep 3. Pharmacoepidemiol Drug Saf. 2013. PMID: 24002995
-
Evaluating Coding Accuracy in General Surgery Residents' Accreditation Council for Graduate Medical Education Procedural Case Logs.J Surg Educ. 2016 Nov-Dec;73(6):e59-e63. doi: 10.1016/j.jsurg.2016.07.017. J Surg Educ. 2016. PMID: 27886974 Review.
-
The identification of incident cancers in UK primary care databases: a systematic review.Pharmacoepidemiol Drug Saf. 2015 Jan;24(1):11-8. doi: 10.1002/pds.3729. Epub 2014 Nov 24. Pharmacoepidemiol Drug Saf. 2015. PMID: 25421570
Cited by
-
Adverse Maternal Experiences and Neonatal Abstinence Syndrome.Matern Child Health J. 2023 Mar;27(3):497-507. doi: 10.1007/s10995-022-03577-1. Epub 2023 Jan 2. Matern Child Health J. 2023. PMID: 36592279 Free PMC article.
-
Selecting optimal screening items for delirium: an application of item response theory.BMC Med Res Methodol. 2013 Jan 22;13:8. doi: 10.1186/1471-2288-13-8. BMC Med Res Methodol. 2013. PMID: 23339752 Free PMC article.
-
Treatment-seeking behaviour in low- and middle-income countries estimated using a Bayesian model.BMC Med Res Methodol. 2017 Apr 20;17(1):67. doi: 10.1186/s12874-017-0346-0. BMC Med Res Methodol. 2017. PMID: 28427337 Free PMC article.
-
Factors influencing the development of primary care data collection projects from electronic health records: a systematic review of the literature.BMC Med Inform Decis Mak. 2017 Sep 25;17(1):139. doi: 10.1186/s12911-017-0538-x. BMC Med Inform Decis Mak. 2017. PMID: 28946908 Free PMC article.
-
Routine data for malaria morbidity estimation in Africa: challenges and prospects.BMC Med. 2020 Jun 3;18(1):121. doi: 10.1186/s12916-020-01593-y. BMC Med. 2020. PMID: 32487080 Free PMC article.
References
-
- National Institue for Health and Clinical Excellence. Stroke: Diagnosis and initial management of acute stroke and transient ischaemic attack (TIA) NICE guidance 68 report. 2008.
-
- Weir CJ, Murray GD, Adams FG, Muir KW, Grosset DG, Lees KR. Poor accuracy of stroke scoring systems for differential clinical diagnosis of intracranial haemorrhage and infarction. Lancet. 1994;344:999–1002. - PubMed
-
- Witt BJ, Brown RD Jr, Jacobsen SJ, Weston SA, Yawn BP, Roger VL. A community-based study of stroke incidence after myocardial infarction. Ann Intern Med. 2005;143:785–792. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources