Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Aug 12;22(1):222.
doi: 10.1186/s12874-022-01699-2.

Cluster randomised trials with a binary outcome and a small number of clusters: comparison of individual and cluster level analysis method

Affiliations

Cluster randomised trials with a binary outcome and a small number of clusters: comparison of individual and cluster level analysis method

Jennifer A Thompson et al. BMC Med Res Methodol. .

Abstract

Background: Cluster randomised trials (CRTs) are often designed with a small number of clusters, but it is not clear which analysis methods are optimal when the outcome is binary. This simulation study aimed to determine (i) whether cluster-level analysis (CL), generalised linear mixed models (GLMM), and generalised estimating equations with sandwich variance (GEE) approaches maintain acceptable type-one error including the impact of non-normality of cluster effects and low prevalence, and if so (ii) which methods have the greatest power. We simulated CRTs with 8-30 clusters, altering the cluster-size, outcome prevalence, intracluster correlation coefficient, and cluster effect distribution. We analysed each dataset with weighted and unweighted CL; GLMM with adaptive quadrature and restricted pseudolikelihood; GEE with Kauermann-and-Carroll and Fay-and-Graubard sandwich variance using independent and exchangeable working correlation matrices. P-values were from a t-distribution with degrees of freedom (DoF) as clusters minus cluster-level parameters; GLMM pseudolikelihood also used Satterthwaite and Kenward-Roger DoF.

Results: Unweighted CL, GLMM pseudolikelihood, and Fay-and-Graubard GEE with independent or exchangeable working correlation matrix controlled type-one error in > 97% scenarios with clusters minus parameters DoF. Cluster-effect distribution and prevalence of outcome did not usually affect analysis method performance. GEE had the least power. With 20-30 clusters, GLMM had greater power than CL with varying cluster-size but similar power otherwise; with fewer clusters, GLMM had lower power with common cluster-size, similar power with medium variation, and greater power with large variation in cluster-size.

Conclusion: We recommend that CRTs with ≤ 30 clusters and a binary outcome use an unweighted CL or restricted pseudolikelihood GLMM both with DoF clusters minus cluster-level parameters.

Keywords: Cluster level analysis; Cluster randomised trial; Cluster-level analysis; Comparison of methods; Generalised estimating equations; Generalised linear mixed model; Small number of clusters.

PubMed Disclaimer

Conflict of interest statement

The authors of this article have no competing interests to declare.

Figures

Fig. 1
Fig. 1
Performance measures of cluster-level analysis methods by number of clusters (rows), cluster size and outcome prevalence (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error. Each dot represents a scenario summarised over the 1000 repetitions. All 864 scenarios are shown for each measure
Fig. 2
Fig. 2
Performance measures of GLMM methods by number of clusters (rows), and mean cluster size (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error
Fig. 3
Fig. 3
Performance measures of GEE methods by number of clusters (rows), and mean cluster size (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error
Fig. 4
Fig. 4
Comparison of bias and type-one error of unweighted cluster-level analysis, GLMM with REPL and DFCP, and GEE with FG standard errors and DFCP by number of clusters (rows), and mean cluster size (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error
Fig. 5
Fig. 5
Power comparison of unweighted cluster-level analysis (CL.UNW), GLMM with REPL and DFCP (REPL), and GEE with FG standard errors and DFCP (FG.I) (columns) by number of clusters (rows), ICC (y axis), and variability of cluster size (colour)
Fig. 6
Fig. 6
Motivating example results analysed by all methods considered in the simulation study. Left panel shows odds ratios and confidence intervals, right panel shows p values. Rows are analysis methods

Similar articles

Cited by

References

    1. Kahan BC, Forbes G, Ali Y, et al. Increased risk of type I errors in cluster randomised trials with small or medium numbers of clusters: a review, reanalysis, and simulation study. Trials. 2016;17:438. doi: 10.1186/s13063-016-1571-2. - DOI - PMC - PubMed
    1. Hayes RJ and Moulton LH. Cluster Randomised Trials. New York: CRC Press; 2017.
    1. Donner A, Klar N. Methods for comparing event rates in intervention studies when the unit of allocation is a cluster. Am J Epidemiol. 1994;140:279–289. doi: 10.1093/oxfordjournals.aje.a117247. - DOI - PubMed
    1. Boneau CA. The effects of violations of assumptions underlying the t test. Psychol Bull. 1960;57:49–64. doi: 10.1037/h0041412. - DOI - PubMed
    1. Elff M, Heisig P, Schaeffer M, et al. Multilevel Analysis with Few Clusters: Improving Likelihood-based Methods to Provide Unbiased Estimates and Accurate Inference. Br J Polit Sci. 2019;51(1):412–26. 10.1017/S0007123419000097.

Publication types

LinkOut - more resources