Identifying and categorizing spurious weight data in electronic medical records
- PMID: 29566188
- DOI: 10.1093/ajcn/nqx056
Identifying and categorizing spurious weight data in electronic medical records
Abstract
Background: Spurious weights compromise the validity of summary measures, such as averages and trends. Even rare errors in weight records can undermine the utility of electronic medical record (EMR) data.
Objective: We sought to estimate the prevalence of spurious weight values in a large EMR, to ascertain the likely causes, and to develop and test straightforward algorithms for identifying spurious weight data.
Design: Using EMR data from 10,000 randomly selected patients aged ≥65 y in the VA system, we examined the percentage of weight change across various time intervals, from 1 to 3000 d. We examined descriptive results and developed 3 algorithms to categorize degree of weight change over time. On the basis of distributions, we identified cases that were most likely spurious. We manually reviewed these and categorized the type of error.
Results: The data followed the expected distributions. The algorithms reliably identified spurious weight. Approximately 0.8% of all weights in the record appeared to be spurious and ∼1 in 5 patient charts included ≥1 spurious weight value. The most common type of error involved the misentry of a single digit (e.g., 148 for 178).
Conclusions: Spurious weights are common in EMRs. Straightforward algorithms can identify and remove them, and thus enhance the reliability of EMR data.
Comment in
-
Addressing inaccurate measures of body weights in epidemiologic and clinical surveillance data involving older adults.Am J Clin Nutr. 2018 Mar 1;107(3):301-302. doi: 10.1093/ajcn/nqy032. Am J Clin Nutr. 2018. PMID: 29566202 No abstract available.
Similar articles
-
Measuring data reliability for preventive services in electronic medical records.BMC Health Serv Res. 2012 May 14;12:116. doi: 10.1186/1472-6963-12-116. BMC Health Serv Res. 2012. PMID: 22583552 Free PMC article.
-
Illustrating the patient journey through the care continuum: Leveraging structured primary care electronic medical record (EMR) data in Ontario, Canada using chronic obstructive pulmonary disease as a case study.Int J Med Inform. 2020 Aug;140:104159. doi: 10.1016/j.ijmedinf.2020.104159. Epub 2020 May 19. Int J Med Inform. 2020. PMID: 32473567
-
EMR-LIP: A lightweight framework for standardizing the preprocessing of longitudinal irregular data in electronic medical records.Comput Methods Programs Biomed. 2025 Feb;259:108521. doi: 10.1016/j.cmpb.2024.108521. Epub 2024 Nov 24. Comput Methods Programs Biomed. 2025. PMID: 39615196
-
EMR in Eye Units.Rom J Ophthalmol. 2020 Jan-Mar;64(1):21-24. Rom J Ophthalmol. 2020. PMID: 32292853 Free PMC article. Review.
-
Usability and Safety in Electronic Medical Records Interface Design: A Review of Recent Literature and Guideline Formulation.Hum Factors. 2015 Aug;57(5):805-34. doi: 10.1177/0018720815576827. Epub 2015 Mar 23. Hum Factors. 2015. PMID: 25850118 Review.
Cited by
-
Hypocrisy Around Medical Patient Data: Issues of Access for Biomedical Research, Data Quality, Usefulness for the Purpose and Omics Data as Game Changer.Asian Bioeth Rev. 2019 Jun 1;11(2):189-207. doi: 10.1007/s41649-019-00085-3. eCollection 2019 Jun. Asian Bioeth Rev. 2019. PMID: 33717311 Free PMC article.
-
Supporting Weight Management during COVID-19: A Randomized Controlled Trial of a Web-Based, ACT-Based, Guided Self-Help Intervention.Obes Facts. 2022;15(4):550-559. doi: 10.1159/000524031. Epub 2022 Apr 13. Obes Facts. 2022. PMID: 35417910 Free PMC article. Clinical Trial.
-
Supporting Weight Management during COVID-19 (SWiM-C): twelve-month follow-up of a randomised controlled trial of a web-based, ACT-based, guided self-help intervention.Int J Obes (Lond). 2023 Jan;47(1):51-59. doi: 10.1038/s41366-022-01232-x. Epub 2022 Nov 11. Int J Obes (Lond). 2023. PMID: 36369513 Free PMC article. Clinical Trial.
-
A machine-learned model for predicting weight loss success using weight change features early in treatment.NPJ Digit Med. 2024 Nov 29;7(1):344. doi: 10.1038/s41746-024-01299-y. NPJ Digit Med. 2024. PMID: 39613928 Free PMC article.
-
Is it time to stop sweeping data cleaning under the carpet? A novel algorithm for outlier management in growth data.PLoS One. 2020 Jan 24;15(1):e0228154. doi: 10.1371/journal.pone.0228154. eCollection 2020. PLoS One. 2020. PMID: 31978151 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical