. 2022 Aug 12;22(1):222.

doi: 10.1186/s12874-022-01699-2.

Cluster randomised trials with a binary outcome and a small number of clusters: comparison of individual and cluster level analysis method

Jennifer A Thompson¹, Clemence Leyrat², Katherine L Fielding³, Richard J Hayes³

Affiliations

¹ Department of Infectious Disease, London School of Hygiene & Tropical Medicine, London, UK. Jennifer.Thompson@lshtm.ac.uk.
² Department of Medical Statistics, London School of Hygiene & Tropical Medicine, London, UK.
³ Department of Infectious Disease, London School of Hygiene & Tropical Medicine, London, UK.

PMID: 35962318
PMCID: PMC9375419
DOI: 10.1186/s12874-022-01699-2

Cluster randomised trials with a binary outcome and a small number of clusters: comparison of individual and cluster level analysis method

Jennifer A Thompson et al. BMC Med Res Methodol. 2022.

. 2022 Aug 12;22(1):222.

doi: 10.1186/s12874-022-01699-2.

Authors

Jennifer A Thompson¹, Clemence Leyrat², Katherine L Fielding³, Richard J Hayes³

Affiliations

¹ Department of Infectious Disease, London School of Hygiene & Tropical Medicine, London, UK. Jennifer.Thompson@lshtm.ac.uk.
² Department of Medical Statistics, London School of Hygiene & Tropical Medicine, London, UK.
³ Department of Infectious Disease, London School of Hygiene & Tropical Medicine, London, UK.

PMID: 35962318
PMCID: PMC9375419
DOI: 10.1186/s12874-022-01699-2

Abstract

Background: Cluster randomised trials (CRTs) are often designed with a small number of clusters, but it is not clear which analysis methods are optimal when the outcome is binary. This simulation study aimed to determine (i) whether cluster-level analysis (CL), generalised linear mixed models (GLMM), and generalised estimating equations with sandwich variance (GEE) approaches maintain acceptable type-one error including the impact of non-normality of cluster effects and low prevalence, and if so (ii) which methods have the greatest power. We simulated CRTs with 8-30 clusters, altering the cluster-size, outcome prevalence, intracluster correlation coefficient, and cluster effect distribution. We analysed each dataset with weighted and unweighted CL; GLMM with adaptive quadrature and restricted pseudolikelihood; GEE with Kauermann-and-Carroll and Fay-and-Graubard sandwich variance using independent and exchangeable working correlation matrices. P-values were from a t-distribution with degrees of freedom (DoF) as clusters minus cluster-level parameters; GLMM pseudolikelihood also used Satterthwaite and Kenward-Roger DoF.

Results: Unweighted CL, GLMM pseudolikelihood, and Fay-and-Graubard GEE with independent or exchangeable working correlation matrix controlled type-one error in > 97% scenarios with clusters minus parameters DoF. Cluster-effect distribution and prevalence of outcome did not usually affect analysis method performance. GEE had the least power. With 20-30 clusters, GLMM had greater power than CL with varying cluster-size but similar power otherwise; with fewer clusters, GLMM had lower power with common cluster-size, similar power with medium variation, and greater power with large variation in cluster-size.

Conclusion: We recommend that CRTs with ≤ 30 clusters and a binary outcome use an unweighted CL or restricted pseudolikelihood GLMM both with DoF clusters minus cluster-level parameters.

Keywords: Cluster level analysis; Cluster randomised trial; Cluster-level analysis; Comparison of methods; Generalised estimating equations; Generalised linear mixed model; Small number of clusters.

PubMed Disclaimer

Conflict of interest statement

The authors of this article have no competing interests to declare.

Figures

**Fig. 1**
Performance measures of cluster-level analysis methods by number of clusters (rows), cluster size and outcome prevalence (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error. Each dot represents a scenario summarised over the 1000 repetitions. All 864 scenarios are shown for each measure

**Fig. 2**
Performance measures of GLMM methods by number of clusters (rows), and mean cluster size (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error

**Fig. 3**
Performance measures of GEE methods by number of clusters (rows), and mean cluster size (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error

**Fig. 4**
Comparison of bias and type-one error of unweighted cluster-level analysis, GLMM with REPL and DF_CP, and GEE with FG standard errors and DF_CP by number of clusters (rows), and mean cluster size (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error

**Fig. 5**
Power comparison of unweighted cluster-level analysis (CL.UNW), GLMM with REPL and DF_CP (REPL), and GEE with FG standard errors and DF_CP (FG.I) (columns) by number of clusters (rows), ICC (y axis), and variability of cluster size (colour)

**Fig. 6**
Motivating example results analysed by all methods considered in the simulation study. Left panel shows odds ratios and confidence intervals, right panel shows p values. Rows are analysis methods

See this image and copyright information in PMC

Cited by

Re-analysis of data from cluster randomised trials to explore the impact of model choice on estimates of odds ratios: study protocol.
Hemming K, Thompson JY, Taljaard M, Watson SI, Kasza J, Thompson JA, Kahan BC, Copas AJ. Hemming K, et al. Trials. 2024 Dec 18;25(1):818. doi: 10.1186/s13063-024-08653-1. Trials. 2024. PMID: 39695707 Free PMC article.
Design of field trials for the evaluation of transmissible vaccines in animal populations.
Sheen JK, Kennedy-Shaffer L, Levy MZ, Metcalf CJE. Sheen JK, et al. PLoS Comput Biol. 2025 Feb 3;21(2):e1012779. doi: 10.1371/journal.pcbi.1012779. eCollection 2025 Feb. PLoS Comput Biol. 2025. PMID: 39899630 Free PMC article.
Impact of the WellCheck smartphone app linked to electronic health records on clinical outcomes in patients with type 2 diabetes: Study protocol for primary care-based, prospective, multicenter, cluster-randomized, pragmatic clinical trials.
Sang H, Kim S, Hwang J, Woo S, Hwang Y, Kim J, Lee S, Yon DK, Rhee SY. Sang H, et al. PLoS One. 2025 Aug 7;20(8):e0329003. doi: 10.1371/journal.pone.0329003. eCollection 2025. PLoS One. 2025. PMID: 40773465 Free PMC article.
Demystifying estimands in cluster-randomised trials.
Kahan BC, Blette BS, Harhay MO, Halpern SD, Jairath V, Copas A, Li F. Kahan BC, et al. Stat Methods Med Res. 2024 Jul;33(7):1211-1232. doi: 10.1177/09622802241254197. Epub 2024 May 23. Stat Methods Med Res. 2024. PMID: 38780480 Free PMC article.
Cluster randomized controlled trial analysis at the cluster level: The clan command.
Thompson JA, Leurent B, Nash S, Moulton LH, Hayes RJ. Thompson JA, et al. Stata J. 2023 Sep;23(3):754-773. doi: 10.1177/1536867X231196294. Epub 2023 Sep 22. Stata J. 2023. PMID: 37850046 Free PMC article.

See all "Cited by" articles

References

1. Kahan BC, Forbes G, Ali Y, et al. Increased risk of type I errors in cluster randomised trials with small or medium numbers of clusters: a review, reanalysis, and simulation study. Trials. 2016;17:438. doi: 10.1186/s13063-016-1571-2. - DOI - PMC - PubMed
1. Hayes RJ and Moulton LH. Cluster Randomised Trials. New York: CRC Press; 2017.
1. Donner A, Klar N. Methods for comparing event rates in intervention studies when the unit of allocation is a cluster. Am J Epidemiol. 1994;140:279–289. doi: 10.1093/oxfordjournals.aje.a117247. - DOI - PubMed
1. Boneau CA. The effects of violations of assumptions underlying the t test. Psychol Bull. 1960;57:49–64. doi: 10.1037/h0041412. - DOI - PubMed
1. Elff M, Heisig P, Schaeffer M, et al. Multilevel Analysis with Few Clusters: Improving Likelihood-based Methods to Provide Unbiased Estimates and Accurate Inference. Br J Polit Sci. 2019;51(1):412–26. 10.1017/S0007123419000097.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Cluster randomised trials with a binary outcome and a small number of clusters: comparison of individual and cluster level analysis method

Affiliations

Cluster randomised trials with a binary outcome and a small number of clusters: comparison of individual and cluster level analysis method

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources