When survey science met web tracking: Presenting an error framework for metered data
- PMID: 37064430
- PMCID: PMC10100245
- DOI: 10.1111/rssa.12956
When survey science met web tracking: Presenting an error framework for metered data
Abstract
Metered data, also called web-tracking data, are generally collected from a sample of participants who willingly install or configure, onto their devices, technologies that track digital traces left when people go online (e.g., URLs visited). Since metered data allow for the observation of online behaviours unobtrusively, it has been proposed as a useful tool to understand what people do online and what impacts this might have on online and offline phenomena. It is crucial, nevertheless, to understand its limitations. Although some research have explored the potential errors of metered data, a systematic categorisation and conceptualisation of these errors are missing. Inspired by the Total Survey Error, we present a Total Error framework for digital traces collected with Meters (TEM). The TEM framework (1) describes the data generation and the analysis process for metered data and (2) documents the sources of bias and variance that may arise in each step of this process. Using a case study we also show how the TEM can be applied in real life to identify, quantify and reduce metered data errors. Results suggest that metered data might indeed be affected by the error sources identified in our framework and, to some extent, biased. This framework can help improve the quality of both stand-alone metered data research projects, as well as foster the understanding of how and when survey and metered data can be combined.
Keywords: digital trace data; error framework; metered data; passive data; total survey error; web‐tracking.
© 2022 The Authors. Journal of the Royal Statistical Society: Series A (Statistics in Society) published by John Wiley & Sons Ltd on behalf of Royal Statistical Society.
Figures
References
-
- Amaya, A. , Biemer, P.P. & Kinyon, D. (2020) Total error in a big data world: adapting the TSE framework to big data. Journal of Survey Statistics and Methodology, 8, 89–119.
-
- Bach, R.L. , Kern, C. , Amaya, A. , Keusch, F. , Kreuter, F. , Hecht, J. et al. (2019) Predicting voting behavior using digital trace data. Social Science Computer Review, 39, 862–883. 089443931988289. Available at: 10.1177/0894439319882896 - DOI
-
- Barberá, P. (2015) Birds of the same feather tweet together: Bayesian ideal point estimation using twitter data. Political Analysis, 23, 76–91.
-
- Biemer, P.P. (2010) Total survey error: design, implementation, and evaluation. Public Opinion Quarterly, 74, 817–848.
-
- Bosch, O.J. (2022) Track me but not really: Tracking undercoverage in metered data collection . Available at: 10.31219/osf.io/2grpa. - DOI
LinkOut - more resources
Full Text Sources
Miscellaneous