Determining the sample size for a cluster-randomised trial using knowledge elicitation: Bayesian hierarchical modelling of the intracluster correlation coefficient
- PMID: 37036110
- PMCID: PMC10262340
- DOI: 10.1177/17407745231164569
Determining the sample size for a cluster-randomised trial using knowledge elicitation: Bayesian hierarchical modelling of the intracluster correlation coefficient
Abstract
Background: The intracluster correlation coefficient is a key input parameter for sample size determination in cluster-randomised trials. Sample size is very sensitive to small differences in the intracluster correlation coefficient, so it is vital to have a robust intracluster correlation coefficient estimate. This is often problematic because either a relevant intracluster correlation coefficient estimate is not available or the available estimate is imprecise due to being based on small-scale studies with low numbers of clusters. Misspecification may lead to an underpowered or inefficiently large and potentially unethical trial.
Methods: We apply a Bayesian approach to produce an intracluster correlation coefficient estimate and hence propose sample size for a planned cluster-randomised trial of the effectiveness of a systematic voiding programme for post-stroke incontinence. A Bayesian hierarchical model is used to combine intracluster correlation coefficient estimates from other relevant trials making use of the wealth of intracluster correlation coefficient information available in published research. We employ knowledge elicitation process to assess the relevance of each intracluster correlation coefficient estimate to the planned trial setting. The team of expert reviewers assigned relevance weights to each study, and each outcome within the study, hence informing parameters of Bayesian modelling. To measure the performance of experts, agreement and reliability methods were applied.
Results: The 34 intracluster correlation coefficient estimates extracted from 16 previously published trials were combined in the Bayesian hierarchical model using aggregated relevance weights elicited from the experts. The intracluster correlation coefficients available from external sources were used to construct a posterior distribution of the targeted intracluster correlation coefficient which was summarised as a posterior median with a 95% credible interval informing researchers about the range of plausible sample size values. The estimated intracluster correlation coefficient determined a sample size of between 450 (25 clusters) and 480 (20 clusters), compared to 500-600 from a classical approach. The use of quantiles, and other parameters, from the estimated posterior distribution is illustrated and the impact on sample size described.
Conclusion: Accounting for uncertainty in an unknown intracluster correlation coefficient, trials can be designed with a more robust sample size. The approach presented provides the possibility of incorporating intracluster correlation coefficients from various cluster-randomised trial settings which can differ from the planned study, with the difference being accounted for in the modelling. By using expert knowledge to elicit relevance weights and synthesising the externally available intracluster correlation coefficient estimates, information is used more efficiently than in a classical approach, where the intracluster correlation coefficient estimates tend to be less robust and overly conservative. The intracluster correlation coefficient estimate constructed is likely to produce a smaller sample size on average than the conventional strategy of choosing a conservative intracluster correlation coefficient estimate. This may therefore result in substantial time and resources savings.
Keywords: Bayesian hierarchical model; cluster-randomised trial; intracluster correlation coefficient; knowledge elicitation; post-stroke incontinence; sample size determination.
Conflict of interest statement
The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Figures



Similar articles
-
Intracluster correlation coefficients and coefficients of variation for perinatal outcomes from five cluster-randomised controlled trials in low and middle-income countries: results and methodological implications.Trials. 2011 Jun 14;12:151. doi: 10.1186/1745-6215-12-151. Trials. 2011. PMID: 21672223 Free PMC article. Clinical Trial.
-
Power and sample size calculations for cluster randomized trials with binary outcomes when intracluster correlation coefficients vary by treatment arm.Clin Trials. 2022 Feb;19(1):42-51. doi: 10.1177/17407745211059845. Epub 2021 Dec 8. Clin Trials. 2022. PMID: 34879711 Free PMC article.
-
Hybrid sample size calculations for cluster randomised trials using assurance.Clin Trials. 2025 Feb 11:17407745241312635. doi: 10.1177/17407745241312635. Online ahead of print. Clin Trials. 2025. PMID: 39935065
-
A systematic review of sample size estimation accuracy on power in malaria cluster randomised trials measuring epidemiological outcomes.BMC Med Res Methodol. 2024 Oct 15;24(1):238. doi: 10.1186/s12874-024-02361-9. BMC Med Res Methodol. 2024. PMID: 39407101 Free PMC article.
-
Analysing cluster randomised controlled trials using GLMM, GEE1, GEE2, and QIF: results from four case studies.BMC Med Res Methodol. 2023 Dec 13;23(1):293. doi: 10.1186/s12874-023-02107-z. BMC Med Res Methodol. 2023. PMID: 38093221 Free PMC article.
Cited by
-
Intracluster correlation coefficients from cluster randomized trials conducted within the NCI Community Oncology Research Program (NCORP).J Natl Cancer Inst Monogr. 2025 Mar 1;2025(68):65-72. doi: 10.1093/jncimonographs/lgae048. J Natl Cancer Inst Monogr. 2025. PMID: 39989039
-
Interventions for Compassion Fatigue in Healthcare Providers-A Systematic Review of Randomised Controlled Trials.Healthcare (Basel). 2024 Jan 11;12(2):171. doi: 10.3390/healthcare12020171. Healthcare (Basel). 2024. PMID: 38255060 Free PMC article. Review.
References
-
- Eldridge S, Kerry S. A practical guide to cluster randomised trials in health services research. New York: Wiley, 2012.
-
- Campbell MJ, Donner A, Klar N. Developments in cluster randomized trials and statistics in medicine. Stat Med 2007; 26: 2–19. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources