Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 May-Jun;2(3):204-8.
doi: 10.1370/afm.141.

What is an intracluster correlation coefficient? Crucial concepts for primary care researchers

Affiliations

What is an intracluster correlation coefficient? Crucial concepts for primary care researchers

Shersten Killip et al. Ann Fam Med. 2004 May-Jun.

Abstract

Background: Primary care research often involves clustered samples in which subjects are randomized at a group level but analyzed at an individual level. Analyses that do not take this clustering into account may report significance where none exists. This article explores the causes, consequences, and implications of cluster data.

Methods: Using a case study with accompanying equations, we show that clustered samples are not as statistically efficient as simple random samples.

Results: Similarity among subjects within preexisting groups or clusters reduces the variability of responses in a clustered sample, which erodes the power to detect true differences between study arms. This similarity is expressed by the intracluster correlation coefficient, or p (rho), which compares the within-group variance with the between-group variance. Rho is used in equations along with the cluster size and the number of clusters to calculate the effective sample size (ESS) in a clustered design. The ESS should be used to calculate power in the design phase of a clustered study. Appropriate accounting for similarities among subjects in a cluster almost always results in a net loss of power, requiring increased total subject recruitment. Increasing the number of clusters enhances power more efficiently than does increasing the number of subjects within a cluster.

Conclusions: Primary care research frequently uses clustered designs, whether consciously or unconsciously. Researchers must recognize and understand the implications of clusters to avoid costly sample size errors.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Two-level nesting, or clustering.
Figure 2.
Figure 2.
Three-level nesting.

Similar articles

Cited by

References

    1. Donner A, Klar N. Design and Analysis of Cluster Randomization Trials in Health Research. American ed. New York, NY: Oxford University Press; 2000:9,112–113.
    1. Murray DM, Rooney BL, Hannan PJ, et al. Intraclass correlation among common measures of adolescent smoking. Am J Epidemiol. 1992;140:1038–1050. - PubMed
    1. Murray DM, Short BJ. Intraclass correlation among measures related to alcohol use by young adults. J Studies Alcohol. 1995;56: 681–694. - PubMed
    1. Murray DM, Short BJ. Intraclass correlation among measures related to alcohol use by adolescents Add Behav. 1997;22:1–12. - PubMed

For Further Reading:

    1. Donner A, Klar N. Design and Analysis of Cluster Randomization Trials in Health Research. American ed. New York, NY: Oxford University Press; 2000. [Entire book.]
    1. Cochran WG. Sampling Techniques. New York, NY: John Wiley and Sons; 1977.

Publication types

LinkOut - more resources