Cluster randomised trials with a binary outcome and a small number of clusters: comparison of individual and cluster level analysis method
- PMID: 35962318
- PMCID: PMC9375419
- DOI: 10.1186/s12874-022-01699-2
Cluster randomised trials with a binary outcome and a small number of clusters: comparison of individual and cluster level analysis method
Abstract
Background: Cluster randomised trials (CRTs) are often designed with a small number of clusters, but it is not clear which analysis methods are optimal when the outcome is binary. This simulation study aimed to determine (i) whether cluster-level analysis (CL), generalised linear mixed models (GLMM), and generalised estimating equations with sandwich variance (GEE) approaches maintain acceptable type-one error including the impact of non-normality of cluster effects and low prevalence, and if so (ii) which methods have the greatest power. We simulated CRTs with 8-30 clusters, altering the cluster-size, outcome prevalence, intracluster correlation coefficient, and cluster effect distribution. We analysed each dataset with weighted and unweighted CL; GLMM with adaptive quadrature and restricted pseudolikelihood; GEE with Kauermann-and-Carroll and Fay-and-Graubard sandwich variance using independent and exchangeable working correlation matrices. P-values were from a t-distribution with degrees of freedom (DoF) as clusters minus cluster-level parameters; GLMM pseudolikelihood also used Satterthwaite and Kenward-Roger DoF.
Results: Unweighted CL, GLMM pseudolikelihood, and Fay-and-Graubard GEE with independent or exchangeable working correlation matrix controlled type-one error in > 97% scenarios with clusters minus parameters DoF. Cluster-effect distribution and prevalence of outcome did not usually affect analysis method performance. GEE had the least power. With 20-30 clusters, GLMM had greater power than CL with varying cluster-size but similar power otherwise; with fewer clusters, GLMM had lower power with common cluster-size, similar power with medium variation, and greater power with large variation in cluster-size.
Conclusion: We recommend that CRTs with ≤ 30 clusters and a binary outcome use an unweighted CL or restricted pseudolikelihood GLMM both with DoF clusters minus cluster-level parameters.
Keywords: Cluster level analysis; Cluster randomised trial; Cluster-level analysis; Comparison of methods; Generalised estimating equations; Generalised linear mixed model; Small number of clusters.
© 2022. The Author(s).
Conflict of interest statement
The authors of this article have no competing interests to declare.
Figures






Similar articles
-
Comparison of small-sample standard-error corrections for generalised estimating equations in stepped wedge cluster randomised trials with a binary outcome: A simulation study.Stat Methods Med Res. 2021 Feb;30(2):425-439. doi: 10.1177/0962280220958735. Epub 2020 Sep 24. Stat Methods Med Res. 2021. PMID: 32970526 Free PMC article.
-
Comparing denominator degrees of freedom approximations for the generalized linear mixed model in analyzing binary outcome in small sample cluster-randomized trials.BMC Med Res Methodol. 2015 Apr 23;15:38. doi: 10.1186/s12874-015-0026-x. BMC Med Res Methodol. 2015. PMID: 25899170 Free PMC article.
-
Small sample performance of bias-corrected sandwich estimators for cluster-randomized trials with binary outcomes.Stat Med. 2015 Jan 30;34(2):281-96. doi: 10.1002/sim.6344. Epub 2014 Oct 24. Stat Med. 2015. PMID: 25345738 Free PMC article.
-
Analysing cluster randomised controlled trials using GLMM, GEE1, GEE2, and QIF: results from four case studies.BMC Med Res Methodol. 2023 Dec 13;23(1):293. doi: 10.1186/s12874-023-02107-z. BMC Med Res Methodol. 2023. PMID: 38093221 Free PMC article.
-
Generalized estimating equations in cluster randomized trials with a small number of clusters: Review of practice and simulation study.Clin Trials. 2016 Aug;13(4):445-9. doi: 10.1177/1740774516643498. Epub 2016 Apr 19. Clin Trials. 2016. PMID: 27094487 Review.
Cited by
-
Re-analysis of data from cluster randomised trials to explore the impact of model choice on estimates of odds ratios: study protocol.Trials. 2024 Dec 18;25(1):818. doi: 10.1186/s13063-024-08653-1. Trials. 2024. PMID: 39695707 Free PMC article.
-
Design of field trials for the evaluation of transmissible vaccines in animal populations.PLoS Comput Biol. 2025 Feb 3;21(2):e1012779. doi: 10.1371/journal.pcbi.1012779. eCollection 2025 Feb. PLoS Comput Biol. 2025. PMID: 39899630 Free PMC article.
-
Impact of the WellCheck smartphone app linked to electronic health records on clinical outcomes in patients with type 2 diabetes: Study protocol for primary care-based, prospective, multicenter, cluster-randomized, pragmatic clinical trials.PLoS One. 2025 Aug 7;20(8):e0329003. doi: 10.1371/journal.pone.0329003. eCollection 2025. PLoS One. 2025. PMID: 40773465 Free PMC article.
-
Demystifying estimands in cluster-randomised trials.Stat Methods Med Res. 2024 Jul;33(7):1211-1232. doi: 10.1177/09622802241254197. Epub 2024 May 23. Stat Methods Med Res. 2024. PMID: 38780480 Free PMC article.
-
Cluster randomized controlled trial analysis at the cluster level: The clan command.Stata J. 2023 Sep;23(3):754-773. doi: 10.1177/1536867X231196294. Epub 2023 Sep 22. Stata J. 2023. PMID: 37850046 Free PMC article.
References
-
- Hayes RJ and Moulton LH. Cluster Randomised Trials. New York: CRC Press; 2017.
-
- Elff M, Heisig P, Schaeffer M, et al. Multilevel Analysis with Few Clusters: Improving Likelihood-based Methods to Provide Unbiased Estimates and Accurate Inference. Br J Polit Sci. 2019;51(1):412–26. 10.1017/S0007123419000097.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources