Unrepresentative big surveys significantly overestimated US vaccine uptake
- PMID: 34880504
- PMCID: PMC8653636
- DOI: 10.1038/s41586-021-04198-4
Unrepresentative big surveys significantly overestimated US vaccine uptake
Abstract
Surveys are a crucial tool for understanding public opinion and behaviour, and their accuracy depends on maintaining statistical representativeness of their target populations by minimizing biases from all sources. Increasing data size shrinks confidence intervals but magnifies the effect of survey bias: an instance of the Big Data Paradox1. Here we demonstrate this paradox in estimates of first-dose COVID-19 vaccine uptake in US adults from 9 January to 19 May 2021 from two large surveys: Delphi-Facebook2,3 (about 250,000 responses per week) and Census Household Pulse4 (about 75,000 every two weeks). In May 2021, Delphi-Facebook overestimated uptake by 17 percentage points (14-20 percentage points with 5% benchmark imprecision) and Census Household Pulse by 14 (11-17 percentage points with 5% benchmark imprecision), compared to a retroactively updated benchmark the Centers for Disease Control and Prevention published on 26 May 2021. Moreover, their large sample sizes led to miniscule margins of error on the incorrect estimates. By contrast, an Axios-Ipsos online panel5 with about 1,000 responses per week following survey research best practices6 provided reliable estimates and uncertainty quantification. We decompose observed error using a recent analytic framework1 to explain the inaccuracy in the three surveys. We then analyse the implications for vaccine hesitancy and willingness. We show how a survey of 250,000 respondents can produce an estimate of the population mean that is no more accurate than an estimate from a simple random sample of size 10. Our central message is that data quality matters more than data quantity, and that compensating the former with the latter is a mathematically provable losing proposition.
© 2021. The Author(s), under exclusive licence to Springer Nature Limited.
Conflict of interest statement
The authors declare no competing interests.
Figures








Comment in
-
What surveys really say.Nature. 2021 Dec;600(7890):614-615. doi: 10.1038/d41586-021-03604-1. Nature. 2021. PMID: 34880485 No abstract available.
Similar articles
-
Leveraging 13 million responses to the U.S. COVID-19 Trends and Impact Survey to examine vaccine hesitancy, vaccination, and mask wearing, January 2021-February 2022.BMC Public Health. 2022 Oct 13;22(1):1911. doi: 10.1186/s12889-022-14286-3. BMC Public Health. 2022. PMID: 36229804 Free PMC article.
-
Understanding the determinants of vaccine hesitancy in the United States: A comparison of social surveys and social media.PLoS One. 2024 Jun 6;19(6):e0301488. doi: 10.1371/journal.pone.0301488. eCollection 2024. PLoS One. 2024. PMID: 38843170 Free PMC article.
-
COVID-19 Vaccine Acceptance and Uptake in Bangkok, Thailand: Cross-sectional Online Survey.JMIR Public Health Surveill. 2023 Apr 13;9:e40186. doi: 10.2196/40186. JMIR Public Health Surveill. 2023. PMID: 36811852 Free PMC article.
-
COVID-19 Vaccine Hesitancy Among Chinese Population: A Large-Scale National Study.Front Immunol. 2021 Nov 29;12:781161. doi: 10.3389/fimmu.2021.781161. eCollection 2021. Front Immunol. 2021. PMID: 34912346 Free PMC article.
-
COVID-19 vaccine hesitancy in Turkey: A systematic review and meta-analysis.Epidemiol Infect. 2023 Nov 24;151:e199. doi: 10.1017/S0950268823001875. Epidemiol Infect. 2023. PMID: 37997650 Free PMC article.
Cited by
-
Priming close social contact protective behaviors enhances protective social norms perceptions, protection views, and self-protective behaviors during disasters.Int J Disaster Risk Reduct. 2022 Oct 1;80:103135. doi: 10.1016/j.ijdrr.2022.103135. Epub 2022 Jun 27. Int J Disaster Risk Reduct. 2022. PMID: 35784266 Free PMC article.
-
Pregnant women's attitudes and behaviours towards antenatal vaccination against Influenza and COVID-19 in the Liverpool City Region, United Kingdom: Cross-sectional survey.Vaccine X. 2023 Sep 16;15:100387. doi: 10.1016/j.jvacx.2023.100387. eCollection 2023 Dec. Vaccine X. 2023. PMID: 37753114 Free PMC article.
-
Disparities in mobile phone ownership reflect inequities in access to healthcare.PLOS Digit Health. 2023 Jul 6;2(7):e0000270. doi: 10.1371/journal.pdig.0000270. eCollection 2023 Jul. PLOS Digit Health. 2023. PMID: 37410708 Free PMC article.
-
Vaccine rhetoric on social media and COVID-19 vaccine uptake rates: A triangulation using self-reported vaccine acceptance.Soc Sci Med. 2024 May;348:116775. doi: 10.1016/j.socscimed.2024.116775. Epub 2024 Mar 15. Soc Sci Med. 2024. PMID: 38579627 Free PMC article.
-
REFORMS: Consensus-based Recommendations for Machine-learning-based Science.Sci Adv. 2024 May 3;10(18):eadk3452. doi: 10.1126/sciadv.adk3452. Epub 2024 May 1. Sci Adv. 2024. PMID: 38691601 Free PMC article. Review.
References
-
- Meng X-L. Statistical paradises and paradoxes in big data (I): Law of large populations, big data paradox, and the 2016 US presidential election. Ann. Appl. Stat. 2018;12:685–726. doi: 10.1214/18-AOAS1161SF. - DOI
-
- Barkay, N. et al. Weights and methodology brief for the COVID-19 Symptom Survey by University of Maryland and Carnegie Mellon University, in partnership with Facebook. Preprint at https://arxiv.org/abs/2009.14675 (2020).
-
- Kreuter F, et al. Partnering with Facebook on a university-based rapid turn-around global survey. Surv. Res. Methods. 2020;14:159–163.
-
- Fields, J. F. et al. Design and Operation of the 2020 Household Pulse Survey (U.S. Census Bureau, 2020).
-
- Jackson, C., Newall, M. & Yi, J. Axios Ipsos Coronavirus Index (2021).
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical