What is an intracluster correlation coefficient? Crucial concepts for primary care researchers
- PMID: 15209195
- PMCID: PMC1466680
- DOI: 10.1370/afm.141
What is an intracluster correlation coefficient? Crucial concepts for primary care researchers
Abstract
Background: Primary care research often involves clustered samples in which subjects are randomized at a group level but analyzed at an individual level. Analyses that do not take this clustering into account may report significance where none exists. This article explores the causes, consequences, and implications of cluster data.
Methods: Using a case study with accompanying equations, we show that clustered samples are not as statistically efficient as simple random samples.
Results: Similarity among subjects within preexisting groups or clusters reduces the variability of responses in a clustered sample, which erodes the power to detect true differences between study arms. This similarity is expressed by the intracluster correlation coefficient, or p (rho), which compares the within-group variance with the between-group variance. Rho is used in equations along with the cluster size and the number of clusters to calculate the effective sample size (ESS) in a clustered design. The ESS should be used to calculate power in the design phase of a clustered study. Appropriate accounting for similarities among subjects in a cluster almost always results in a net loss of power, requiring increased total subject recruitment. Increasing the number of clusters enhances power more efficiently than does increasing the number of subjects within a cluster.
Conclusions: Primary care research frequently uses clustered designs, whether consciously or unconsciously. Researchers must recognize and understand the implications of clusters to avoid costly sample size errors.
Figures
Similar articles
-
Power and sample size calculations for cluster randomized trials with binary outcomes when intracluster correlation coefficients vary by treatment arm.Clin Trials. 2022 Feb;19(1):42-51. doi: 10.1177/17407745211059845. Epub 2021 Dec 8. Clin Trials. 2022. PMID: 34879711 Free PMC article.
-
Sample size calculations for 3-level cluster randomized trials.Clin Trials. 2008;5(5):486-95. doi: 10.1177/1740774508096476. Clin Trials. 2008. PMID: 18827041
-
Unequal cluster sizes for trials in English and Welsh general practice: implications for sample size calculations.Stat Med. 2001 Feb 15;20(3):377-90. doi: 10.1002/1097-0258(20010215)20:3<377::aid-sim799>3.0.co;2-n. Stat Med. 2001. PMID: 11180308
-
Imputation strategies for missing continuous outcomes in cluster randomized trials.Biom J. 2008 Jun;50(3):329-45. doi: 10.1002/bimj.200710423. Biom J. 2008. PMID: 18537126 Review.
-
Generalized estimating equations in cluster randomized trials with a small number of clusters: Review of practice and simulation study.Clin Trials. 2016 Aug;13(4):445-9. doi: 10.1177/1740774516643498. Epub 2016 Apr 19. Clin Trials. 2016. PMID: 27094487 Review.
Cited by
-
Effect of Patient-Centered Medical Home on Preventive Services for Adolescents and Young Adults.Pediatrics. 2016 Jun;137(6):e20153813. doi: 10.1542/peds.2015-3813. Epub 2016 May 16. Pediatrics. 2016. PMID: 27244851 Free PMC article.
-
Effect of online intervention based on life skills for mental health, self-efficacy and coping skills among Arab adolescents in the Klang Valley, Malaysia: A cluster randomised controlled trial protocol.PLoS One. 2024 Feb 23;19(2):e0298627. doi: 10.1371/journal.pone.0298627. eCollection 2024. PLoS One. 2024. PMID: 38394185 Free PMC article.
-
Coronavirus disease 2019 population-based prevalence, risk factors, hospitalization, and fatality rates in southern Brazil.Int J Infect Dis. 2020 Nov;100:402-410. doi: 10.1016/j.ijid.2020.09.028. Epub 2020 Sep 16. Int J Infect Dis. 2020. PMID: 32949778 Free PMC article.
-
Sample Size Estimates for Cluster-Randomized Trials in Hospital Infection Control and Antimicrobial Stewardship.JAMA Netw Open. 2019 Oct 2;2(10):e1912644. doi: 10.1001/jamanetworkopen.2019.12644. JAMA Netw Open. 2019. PMID: 31584684 Free PMC article.
-
Prevalence, Risk Factors for Exposure, and Socio-Economic Impact of Peste Des Petits Ruminants in Karenga District, Karamoja Region, Uganda.Pathogens. 2022 Jan 2;11(1):54. doi: 10.3390/pathogens11010054. Pathogens. 2022. PMID: 35056002 Free PMC article.
References
-
- Donner A, Klar N. Design and Analysis of Cluster Randomization Trials in Health Research. American ed. New York, NY: Oxford University Press; 2000:9,112–113.
-
- Murray DM, Rooney BL, Hannan PJ, et al. Intraclass correlation among common measures of adolescent smoking. Am J Epidemiol. 1992;140:1038–1050. - PubMed
-
- Murray DM, Short BJ. Intraclass correlation among measures related to alcohol use by young adults. J Studies Alcohol. 1995;56: 681–694. - PubMed
-
- Murray DM, Short BJ. Intraclass correlation among measures related to alcohol use by adolescents Add Behav. 1997;22:1–12. - PubMed
For Further Reading:
-
- Donner A, Klar N. Design and Analysis of Cluster Randomization Trials in Health Research. American ed. New York, NY: Oxford University Press; 2000. [Entire book.]
-
- Cochran WG. Sampling Techniques. New York, NY: John Wiley and Sons; 1977.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical