Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2010;Spec No 1(1):14-39.

Examining distributional characteristics of clusters

Affiliations
  • PMID: 20653176
Review

Examining distributional characteristics of clusters

A von Eye. Bull Soc Sci Med Grand Duche Luxemb. 2010.

Abstract

Standard cluster analysis creates clusters based on the criterion that their members be closer to each other than to members of other clusters. In this article, it is proposed to examine empirical clusters that result from standard clustering, with the goal of assessing whether they contradict distributional assumptions. Four models are proposed. The models consider two data generation processes, the Poisson and the multinormal, as well as two convex shapes of cluster hulls, the spherical and the ellipsoidal. Based on the model, the probability of being in a cluster of a given location, size, and shape is estimated. This probability is compared with the observed proportion of cases. The observed proportion can turn out to be larger, as large, or smaller than expected. Examples are given using simulated and empirical data. The simulation showed that the size of a cluster, the data generation process, and the true distribution of data have the strongest effect on the results obtained with the proposed method. The empirical examples discuss distributional characteristics of cross-sectional and longitudinal clusters of aggressive behavior in adolescents. The examples show that clustering methods do not always yield clusters that contradict distributional assumptions. Some clusters contain even fewer cases than expected.

PubMed Disclaimer

Similar articles