. 2023 Oct 30;42(24):4333-4348.

doi: 10.1002/sim.9864. Epub 2023 Aug 7.

Rank intraclass correlation for clustered data

Shengxin Tu¹, Chun Li², Donglin Zeng³, Bryan E Shepherd¹

Affiliations

¹ Department of Biostatistics, Vanderbilt University, Nashville, Tennessee, USA.
² Department of Population and Public Health Sciences, University of Southern California, Los Angeles, California, USA.
³ Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.

PMID: 37548059
PMCID: PMC10592008
DOI: 10.1002/sim.9864

Rank intraclass correlation for clustered data

Shengxin Tu et al. Stat Med. 2023.

. 2023 Oct 30;42(24):4333-4348.

doi: 10.1002/sim.9864. Epub 2023 Aug 7.

Authors

Shengxin Tu¹, Chun Li², Donglin Zeng³, Bryan E Shepherd¹

Affiliations

¹ Department of Biostatistics, Vanderbilt University, Nashville, Tennessee, USA.
² Department of Population and Public Health Sciences, University of Southern California, Los Angeles, California, USA.
³ Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.

PMID: 37548059
PMCID: PMC10592008
DOI: 10.1002/sim.9864

Abstract

Clustered data are common in biomedical research. Observations in the same cluster are often more similar to each other than to observations from other clusters. The intraclass correlation coefficient (ICC), first introduced by R. A. Fisher, is frequently used to measure this degree of similarity. However, the ICC is sensitive to extreme values and skewed distributions, and depends on the scale of the data. It is also not applicable to ordered categorical data. We define the rank ICC as a natural extension of Fisher's ICC to the rank scale, and describe its corresponding population parameter. The rank ICC is simply interpreted as the rank correlation between a random pair of observations from the same cluster. We also extend the definition when the underlying distribution has more than two hierarchies. We describe estimation and inference procedures, show the asymptotic properties of our estimator, conduct simulations to evaluate its performance, and illustrate our method in three real data examples with skewed data, count data, and three-level ordered categorical data.

Keywords: clustered data; intraclass correlation; rank association measures.

PubMed Disclaimer

Conflict of interest statement

CONFLICT OF INTEREST STATEMENT

The authors declare no potential conflict of interest.

Figures

**FIGURE 1**
Parameters of rank ICC $(γ_{I})$ and Fisher’s ICC $(ρ_{I})$ as a function of the within-cluster correlation $(ρ)$ of $X_{i j}$ under normality (Scenario I) and after exponentiating the data (Scenario II).

**FIGURE 2**
Bias and coverage of 95% confidence intervals for our estimator of $γ_{I}$ at different true values of $γ_{I}$ and different numbers of clusters under Scenarios I (normality), II (exponentiated outcomes), and III (exponentiated cluster means). The number of observations per cluster was set at 30.

**FIGURE 3**
Bias and coverage of 95% confidence intervals for our estimator of $γ_{I}$ at different true values of $γ_{I}$ and different cluster sizes under Scenarios I (normality), II (exponentiated outcomes), and III (exponentiated cluster means). The number of clusters was set at 200. “2–50” means the cluster size follows a uniform distribution from 2 to 50, “2/30” means half of the clusters have size 2 and half have 30.

**FIGURE 4**
Bias and coverage of 95% confidence intervals for our estimator of $γ_{I}$ at different true positive and negative values of $γ_{I}$ when the cluster size in the population was 2. The number of clusters was set at 200.

**FIGURE 5**
Root mean squared error (RMSE), bias, and empirical SE of estimates obtained by the four weighting approaches for our estimator of $γ_{I}$ . “Equal clusters” refers to assigning equal weights to clusters, “Equal obs” refers to assigning equal weights to observations, “ESS” refers to the iterative weighting approach based on the effective sample size, and “Combination” refers to the iterative weighting approach based on the linear combination of equal weights for clusters and equal weights for observations. We set the tolerance of the two iterative approaches to 0.00001.

**FIGURE 6**
Parameters of rank ICC $(γ_{I})$ as a function of the within-cluster correlation $(ρ)$ of $X_{i j}$ when data are continuous or discretized into ordered categorical variables with 3,5, or 10 levels.

**FIGURE 7**
Bias and coverage of 95% confidence intervals for our estimators of $γ_{I 2}$ and $γ_{I 3}$ at different true values of $γ_{I 2}$ and $γ_{I 3}$ and different numbers of level-3 units. The number of level-2 units in a level-3 unit was set at 15. The number of level-1 units in a level-2 unit was set at 2.

**FIGURE 8**
Scatter plot of the first and second uACR measurements of each person in the example of albumin-creatinine ratio.

**FIGURE 9**
Histogram of numbers of seizures of children with untreated epilepsy from the 60 primary healthcare centers in the example of status epilepticus.

**FIGURE 10**
Scatter plot of PHQ-9 scores of male and female partners enrolled in the clustered randomized clinical trial in the example of Patient Health Questionaire-9 score.

See this image and copyright information in PMC

References

1. Fisher R. Statistical Methods for Research Workers. Edinburgh: Oliver & Boyd; 1925.
1. Murray DM, Varnell SP, Blitstein JL. Design and analysis of group-randomized trials: a review of recent methodological developments. Am J Public Health. 2004;94(3):423–432. - PMC - PubMed
1. Hedges LV, Hedberg EC. Intraclass correlation values for planning group-randomized trials in education. Educ Eval Policy Anal. 2007;29(1):60–87.
1. Harris JA. On the calculation of intra-class and inter-class coefficients of correlation from class moments when the number of possible combinations is large. Biometrika. 1913;9(3/4):446–472.
1. Shrout P, Fleiss J. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86(2):420–428. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Rank intraclass correlation for clustered data

Affiliations

Rank intraclass correlation for clustered data

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources