Estimating uncertainty in respondent-driven sampling using a tree bootstrap method
- PMID: 27930328
- PMCID: PMC5187726
- DOI: 10.1073/pnas.1617258113
Estimating uncertainty in respondent-driven sampling using a tree bootstrap method
Abstract
Respondent-driven sampling (RDS) is a network-based form of chain-referral sampling used to estimate attributes of populations that are difficult to access using standard survey tools. Although it has grown quickly in popularity since its introduction, the statistical properties of RDS estimates remain elusive. In particular, the sampling variability of these estimates has been shown to be much higher than previously acknowledged, and even methods designed to account for RDS result in misleadingly narrow confidence intervals. In this paper, we introduce a tree bootstrap method for estimating uncertainty in RDS estimates based on resampling recruitment trees. We use simulations from known social networks to show that the tree bootstrap method not only outperforms existing methods but also captures the high variability of RDS, even in extreme cases with high design effects. We also apply the method to data from injecting drug users in Ukraine. Unlike other methods, the tree bootstrap depends only on the structure of the sampled recruitment trees, not on the attributes being measured on the respondents, so correlations between attributes can be estimated as well as variability. Our results suggest that it is possible to accurately assess the high level of uncertainty inherent in RDS.
Keywords: HIV; hard-to-reach population; injecting drug user; snowball sampling; social network.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
References
-
- Heckathorn DD. Respondent-driven sampling: A new approach to the study of hidden populations. Soc Probl. 1997;44(2):174–199.
-
- Heckathorn DD. Respondent-driven sampling II: Deriving valid population estimates from chain-referral samples of hidden populations. Soc Probl. 2002;49(1):11–34.
-
- Volz E, Heckathorn DD. Probability based estimation theory for respondent driven sampling. J Offic Stat. 2008;24(1):79–97.
-
- Heckathorn DD. Extensions of respondent-driven sampling: Analyzing continuous variables and controlling for differential recruitment. Socio Meth. 2007;37(1):151–207.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
