Validation Relaxation: A Quality Assurance Strategy for Electronic Data Collection
- PMID: 28821474
- PMCID: PMC5581386
- DOI: 10.2196/jmir.7813
Validation Relaxation: A Quality Assurance Strategy for Electronic Data Collection
Abstract
Background: The use of mobile devices for data collection in developing world settings is becoming increasingly common and may offer advantages in data collection quality and efficiency relative to paper-based methods. However, mobile data collection systems can hamper many standard quality assurance techniques due to the lack of a hardcopy backup of data. Consequently, mobile health data collection platforms have the potential to generate datasets that appear valid, but are susceptible to unidentified database design flaws, areas of miscomprehension by enumerators, and data recording errors.
Objective: We describe the design and evaluation of a strategy for estimating data error rates and assessing enumerator performance during electronic data collection, which we term "validation relaxation." Validation relaxation involves the intentional omission of data validation features for select questions to allow for data recording errors to be committed, detected, and monitored.
Methods: We analyzed data collected during a cluster sample population survey in rural Liberia using an electronic data collection system (Open Data Kit). We first developed a classification scheme for types of detectable errors and validation alterations required to detect them. We then implemented the following validation relaxation techniques to enable data error conduct and detection: intentional redundancy, removal of "required" constraint, and illogical response combinations. This allowed for up to 11 identifiable errors to be made per survey. The error rate was defined as the total number of errors committed divided by the number of potential errors. We summarized crude error rates and estimated changes in error rates over time for both individuals and the entire program using logistic regression.
Results: The aggregate error rate was 1.60% (125/7817). Error rates did not differ significantly between enumerators (P=.51), but decreased for the cohort with increasing days of application use, from 2.3% at survey start (95% CI 1.8%-2.8%) to 0.6% at day 45 (95% CI 0.3%-0.9%; OR=0.969; P<.001). The highest error rate (84/618, 13.6%) occurred for an intentional redundancy question for a birthdate field, which was repeated in separate sections of the survey. We found low error rates (0.0% to 3.1%) for all other possible errors.
Conclusions: A strategy of removing validation rules on electronic data capture platforms can be used to create a set of detectable data errors, which can subsequently be used to assess group and individual enumerator error rates, their trends over time, and categories of data collection that require further training or additional quality control measures. This strategy may be particularly useful for identifying individual enumerators or systematic data errors that are responsive to enumerator training and is best applied to questions for which errors cannot be prevented through training or software design alone. Validation relaxation should be considered as a component of a holistic data quality assurance strategy.
Keywords: data accuracy; data collection; eHealth; mHealth; questionnaire design; research methodology; survey methodology; surveys.
©Avi Kenny, Nicholas Gordon, Thomas Griffiths, John D Kraemer, Mark J Siedner. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 18.08.2017.
Conflict of interest statement
Conflicts of Interest: None declared.
Figures
Similar articles
-
Design and implementation of a mobile health electronic data capture platform that functions in fully-disconnected settings: a pilot study in rural Liberia.BMC Med Inform Decis Mak. 2020 Feb 22;20(1):39. doi: 10.1186/s12911-020-1059-6. BMC Med Inform Decis Mak. 2020. PMID: 32087731 Free PMC article.
-
Evaluation of Electronic and Paper-Pen Data Capturing Tools for Data Quality in a Public Health Survey in a Health and Demographic Surveillance Site, Ethiopia: Randomized Controlled Crossover Health Care Information Technology Evaluation.JMIR Mhealth Uhealth. 2019 Feb 11;7(2):e10995. doi: 10.2196/10995. JMIR Mhealth Uhealth. 2019. PMID: 30741642 Free PMC article. Clinical Trial.
-
Ordering errors, objections and invariance in utility survey responses: a framework for understanding who, why and what to do.Appl Health Econ Health Policy. 2011 Jul 1;9(4):225-41. doi: 10.2165/11590480-000000000-00000. Appl Health Econ Health Policy. 2011. PMID: 21682351
-
Avoiding and identifying errors in health technology assessment models: qualitative study and methodological review.Health Technol Assess. 2010 May;14(25):iii-iv, ix-xii, 1-107. doi: 10.3310/hta14250. Health Technol Assess. 2010. PMID: 20501062 Review.
-
Global Health and Emergency Care: Overcoming Clinical Research Barriers.Acad Emerg Med. 2017 Apr;24(4):484-493. doi: 10.1111/acem.13142. Epub 2017 Mar 17. Acad Emerg Med. 2017. PMID: 27976457
Cited by
-
The added value of a mobile application of Community Case Management on referral, re-consultation and hospitalization rates of children aged under 5 years in two districts in Northern Malawi: study protocol for a pragmatic, stepped-wedge cluster-randomized controlled trial.Trials. 2017 Oct 11;18(1):475. doi: 10.1186/s13063-017-2213-z. Trials. 2017. PMID: 29020976 Free PMC article. Clinical Trial.
-
Design and implementation of a mobile health electronic data capture platform that functions in fully-disconnected settings: a pilot study in rural Liberia.BMC Med Inform Decis Mak. 2020 Feb 22;20(1):39. doi: 10.1186/s12911-020-1059-6. BMC Med Inform Decis Mak. 2020. PMID: 32087731 Free PMC article.
-
A Community Health Worker Intervention to Increase Childhood Disease Treatment Coverage in Rural Liberia: A Controlled Before-and-After Evaluation.Am J Public Health. 2018 Sep;108(9):1252-1259. doi: 10.2105/AJPH.2018.304555. Epub 2018 Jul 19. Am J Public Health. 2018. PMID: 30024811 Free PMC article.
-
Measuring health system responsiveness in a national community health worker primary care programme in rural Liberia.Int J Qual Health Care. 2023 May 17;35(2):mzad027. doi: 10.1093/intqhc/mzad027. Int J Qual Health Care. 2023. PMID: 37098220 Free PMC article.
-
Impact of the Liberian National Community Health Assistant Program on childhood illness care in Grand Bassa County, Liberia.PLOS Glob Public Health. 2022 Jun 30;2(6):e0000668. doi: 10.1371/journal.pgph.0000668. eCollection 2022. PLOS Glob Public Health. 2022. PMID: 36962465 Free PMC article.
References
-
- Wang R, Strong D. Beyond accuracy: what data quality means to data consumers. J Manag Inf Syst. 1996;12(4):5–33. doi: 10.1080/07421222.1996.11518099. - DOI
-
- Agmon N, Ahituv N. Assessing data reliability in an information system. J Manag Inf Syst. 1987;4(2):34–44. doi: 10.1080/07421222.1987.11517792. - DOI
-
- Levitt SH, Aeppli DM, Potish RA, Lee CK, Nierengarten ME. Influences on inferences. Effect of errors in data on statistical evaluation. Cancer. 1993 Oct 01;72(7):2075–82. http://onlinelibrary.wiley.com/resolve/openurl?genre=article&sid=nlm:pub... - PubMed
-
- Barchard KA, Pace LA. Preventing human error: the impact of data entry methods on data accuracy and statistical results. Comput Human Behav. 2011 Sep;27(5):1834–1839. doi: 10.1016/j.chb.2011.04.004. - DOI
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources