Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec;185(Suppl 2):S408-S436.
doi: 10.1111/rssa.12956. Epub 2022 Nov 6.

When survey science met web tracking: Presenting an error framework for metered data

Affiliations

When survey science met web tracking: Presenting an error framework for metered data

Oriol J Bosch et al. J R Stat Soc Ser A Stat Soc. 2022 Dec.

Abstract

Metered data, also called web-tracking data, are generally collected from a sample of participants who willingly install or configure, onto their devices, technologies that track digital traces left when people go online (e.g., URLs visited). Since metered data allow for the observation of online behaviours unobtrusively, it has been proposed as a useful tool to understand what people do online and what impacts this might have on online and offline phenomena. It is crucial, nevertheless, to understand its limitations. Although some research have explored the potential errors of metered data, a systematic categorisation and conceptualisation of these errors are missing. Inspired by the Total Survey Error, we present a Total Error framework for digital traces collected with Meters (TEM). The TEM framework (1) describes the data generation and the analysis process for metered data and (2) documents the sources of bias and variance that may arise in each step of this process. Using a case study we also show how the TEM can be applied in real life to identify, quantify and reduce metered data errors. Results suggest that metered data might indeed be affected by the error sources identified in our framework and, to some extent, biased. This framework can help improve the quality of both stand-alone metered data research projects, as well as foster the understanding of how and when survey and metered data can be combined.

Keywords: digital trace data; error framework; metered data; passive data; total survey error; web‐tracking.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Reproduction of the total survey error framework by Groves et al. (2009)
FIGURE 2
FIGURE 2
Data collection and analysis process for metered data. The stars indicate those processes that are different or non‐existent for opt‐in metered online panels.

References

    1. Amaya, A. , Biemer, P.P. & Kinyon, D. (2020) Total error in a big data world: adapting the TSE framework to big data. Journal of Survey Statistics and Methodology, 8, 89–119.
    1. Bach, R.L. , Kern, C. , Amaya, A. , Keusch, F. , Kreuter, F. , Hecht, J. et al. (2019) Predicting voting behavior using digital trace data. Social Science Computer Review, 39, 862–883. 089443931988289. Available at: 10.1177/0894439319882896 - DOI
    1. Barberá, P. (2015) Birds of the same feather tweet together: Bayesian ideal point estimation using twitter data. Political Analysis, 23, 76–91.
    1. Biemer, P.P. (2010) Total survey error: design, implementation, and evaluation. Public Opinion Quarterly, 74, 817–848.
    1. Bosch, O.J. (2022) Track me but not really: Tracking undercoverage in metered data collection . Available at: 10.31219/osf.io/2grpa. - DOI

LinkOut - more resources