COVID-19 surveillance data quality issues: a national consecutive case series
- PMID: 34872992
- PMCID: PMC8649880
- DOI: 10.1136/bmjopen-2020-047623
COVID-19 surveillance data quality issues: a national consecutive case series
Abstract
Objectives: High-quality data are crucial for guiding decision-making and practising evidence-based healthcare, especially if previous knowledge is lacking. Nevertheless, data quality frailties have been exposed worldwide during the current COVID-19 pandemic. Focusing on a major Portuguese epidemiological surveillance dataset, our study aims to assess COVID-19 data quality issues and suggest possible solutions.
Settings: On 27 April 2020, the Portuguese Directorate-General of Health (DGS) made available a dataset (DGSApril) for researchers, upon request. On 4 August, an updated dataset (DGSAugust) was also obtained.
Participants: All COVID-19-confirmed cases notified through the medical component of National System for Epidemiological Surveillance until end of June.
Primary and secondary outcome measures: Data completeness and consistency.
Results: DGSAugust has not followed the data format and variables as DGSApril and a significant number of missing data and inconsistencies were found (eg, 4075 cases from the DGSApril were apparently not included in DGSAugust). Several variables also showed a low degree of completeness and/or changed their values from one dataset to another (eg, the variable 'underlying conditions' had more than half of cases showing different information between datasets). There were also significant inconsistencies between the number of cases and deaths due to COVID-19 shown in DGSAugust and by the DGS reports publicly provided daily.
Conclusions: Important quality issues of the Portuguese COVID-19 surveillance datasets were described. These issues can limit surveillance data usability to inform good decisions and perform useful research. Major improvements in surveillance datasets are therefore urgently needed-for example, simplification of data entry processes, constant monitoring of data, and increased training and awareness of healthcare providers-as low data quality may lead to a deficient pandemic control.
Keywords: COVID-19; epidemiology; health informatics; information management; public health; statistics & research methods.
© Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.
Conflict of interest statement
Competing interests: None declared.
Figures


Similar articles
-
Timeliness and completeness of laboratory-based surveillance of COVID-19 cases in England.Public Health. 2021 May;194:163-166. doi: 10.1016/j.puhe.2021.03.012. Epub 2021 Apr 1. Public Health. 2021. PMID: 33945929 Free PMC article.
-
COVID-19 Case Surveillance: Trends in Person-Level Case Data Completeness, United States, April 5-September 30, 2020.Public Health Rep. 2021 Jul-Aug;136(4):466-474. doi: 10.1177/00333549211006973. Epub 2021 Mar 31. Public Health Rep. 2021. PMID: 33789540 Free PMC article.
-
Surveillance Metrics of SARS-CoV-2 Transmission in Central Asia: Longitudinal Trend Analysis.J Med Internet Res. 2021 Feb 3;23(2):e25799. doi: 10.2196/25799. J Med Internet Res. 2021. PMID: 33475513 Free PMC article.
-
COVID-19 on the Nile: Review on the Management and Outcomes of the COVID-19 Pandemic in the Arab Republic of Egypt from February to August 2020.Int J Environ Res Public Health. 2021 Feb 8;18(4):1588. doi: 10.3390/ijerph18041588. Int J Environ Res Public Health. 2021. PMID: 33567519 Free PMC article. Review.
-
Thoracic imaging tests for the diagnosis of COVID-19.Cochrane Database Syst Rev. 2020 Sep 30;9:CD013639. doi: 10.1002/14651858.CD013639.pub2. Cochrane Database Syst Rev. 2020. Update in: Cochrane Database Syst Rev. 2020 Nov 26;11:CD013639. doi: 10.1002/14651858.CD013639.pub3. PMID: 32997361 Updated.
Cited by
-
Empowering open data sharing for social good: a privacy-aware approach.Sci Data. 2025 Feb 12;12(1):248. doi: 10.1038/s41597-025-04506-x. Sci Data. 2025. PMID: 39939361 Free PMC article.
-
Vaccine effectiveness of inactivated and mRNA COVID-19 vaccine platform during Delta and Omicron wave in Jakarta, Indonesia: A test-negative case-control study.PLoS One. 2025 Jun 9;20(6):e0320779. doi: 10.1371/journal.pone.0320779. eCollection 2025. PLoS One. 2025. PMID: 40489510 Free PMC article.
-
Challenges and Opportunities for Global Genomic Surveillance Strategies in the COVID-19 Era.Viruses. 2022 Nov 16;14(11):2532. doi: 10.3390/v14112532. Viruses. 2022. PMID: 36423141 Free PMC article. Review.
-
Public health surveillance perspectives from provincial COVID-19 experiences, South Africa 2021.Jamba. 2024 Oct 17;16(1):1625. doi: 10.4102/jamba.v16i1.1625. eCollection 2024. Jamba. 2024. PMID: 39507563 Free PMC article.
-
Consistency as a Data Quality Measure for German Corona Consensus Items Mapped from National Pandemic Cohort Network Data Collections.Methods Inf Med. 2023 Jun;62(S 01):e47-e56. doi: 10.1055/a-2006-1086. Epub 2023 Jan 3. Methods Inf Med. 2023. PMID: 36596462 Free PMC article.
References
-
- German RR, Lee LM, Horan JM, et al. . Updated guidelines for evaluating public health surveillance systems: recommendations from the guidelines Working group. MMWR Recomm Rep 2001;50:1-35; quiz CE1-7. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous