Exploring the Sensitivity of Horn's Parallel Analysis to the Distributional Form of Random Data

Alexis Dinno¹

Affiliations

PMID: 20234802
PMCID: PMC2838619
DOI: 10.1080/00273170902938969

Exploring the Sensitivity of Horn's Parallel Analysis to the Distributional Form of Random Data

Alexis Dinno. Multivariate Behav Res. 2009 May.

. 2009 May;44(3):362-388.

doi: 10.1080/00273170902938969.

Author

Alexis Dinno¹

Affiliation

¹ Center for Tobacco Control Research and Education University of California, San Francisco.

PMID: 20234802
PMCID: PMC2838619
DOI: 10.1080/00273170902938969

Abstract

Horn's parallel analysis (PA) is the method of consensus in the literature on empirical methods for deciding how many components/factors to retain. Different authors have proposed various implementations of PA. Horn's seminal 1965 article, a 1996 article by Thompson and Daniel, and a 2004 article by Hayton, Allen, and Scarpello all make assertions about the requisite distributional forms of the random data generated for use in PA. Readily available software is used to test whether the results of PA are sensitive to several distributional prescriptions in the literature regarding the rank, normality, mean, variance, and range of simulated data on a portion of the National Comorbidity Survey Replication (Pennell et al., 2004) by varying the distributions in each PA. The results of PA were found not to vary by distributional assumption. The conclusion is that PA may be reliably performed with the computationally simplest distributional assumptions about the simulated data.

PubMed Disclaimer

Figures

**Figure 1**
Graphical illustration of parallel analysis on a simulated data set of 50 observations, across 20 variables, with two uncorrelated factors, and %50 total variance. The dashed line connects unadjusted eigenvalues of the observed data, the dotted line connects mean eigenvalues of 600 random 50*20 data sets, and the solid line connects adjusted eigenvalues (i.e. subtracting the mean eigenvalues minus one from the observed eigenvalues). The retention criterion is the point at which the adjusted eigenvalues cross the horizontal line at y=1, which is the same point at which the unadjusted eigenvalues cross the line of mean eigenvalues of the random data sets. The solid adjusted eigenvalue markers are those components (or factors, if using factor analysis) that are retained.

**Figure 2**
Histograms showing the distributions of the first variable from three of nine simulated data sets. All variables have five values (the integers from 1 to 5), and variable distributions based on different parameterizations of the Beta distribution plus an amount of uniform noise.

**Figure 3**
Figures 3a and 3b. Plot connecting the means (black) and 95% quantiles (grey) of 5000 random eigenvalues for simulated data sets with 75 observations and 50 variables for parallel analyses conducted with ten different random data distributions for principal components analysis (3a) and factor analysis (3b). The near perfect overlap of the means and quantiles across the entire range of factors with such a small sample size illustrates the absolute or virtual insensitivity of parallel analysis to the distributional form of simulated data.

See this image and copyright information in PMC

Cited by

Loneliness in Intimate Relationships Scale (LIRS): Development and Validation.
Rokach A, Sha'ked A, Ben-Artzi E. Rokach A, et al. Int J Environ Res Public Health. 2022 Oct 10;19(19):12970. doi: 10.3390/ijerph191912970. Int J Environ Res Public Health. 2022. PMID: 36232275 Free PMC article.
Data-driven human transcriptomic modules determined by independent component analysis.
Zhou W, Altman RB. Zhou W, et al. BMC Bioinformatics. 2018 Sep 17;19(1):327. doi: 10.1186/s12859-018-2338-4. BMC Bioinformatics. 2018. PMID: 30223787 Free PMC article.
Factor Retention in Exploratory Factor Analysis With Missing Data.
Goretzko D. Goretzko D. Educ Psychol Meas. 2022 Jun;82(3):444-464. doi: 10.1177/00131644211022031. Epub 2021 Jun 11. Educ Psychol Meas. 2022. PMID: 35444335 Free PMC article.
Characterizing intergenic transcription at RNA polymerase II binding sites in normal and cancer tissues.
de Langen P, Hammal F, Guéret E, Mouren JC, Spinelli L, Ballester B. de Langen P, et al. Cell Genom. 2023 Sep 29;3(10):100411. doi: 10.1016/j.xgen.2023.100411. eCollection 2023 Oct 11. Cell Genom. 2023. PMID: 37868033 Free PMC article.
Loaded: Gun involvement among opioid users.
Stein MD, Kenney SR, Anderson BJ, Bailey GL. Stein MD, et al. Drug Alcohol Depend. 2018 Jun 1;187:205-211. doi: 10.1016/j.drugalcdep.2018.03.015. Epub 2018 Apr 16. Drug Alcohol Depend. 2018. PMID: 29680676 Free PMC article.

See all "Cited by" articles

References

1. Allen SJ, Hubbard R. Regression equations for the latent roots of random data correlation matrices with unities on the diagonal. Multivariate Behavioral Research. 1986;21:393–96. - PubMed
1. Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B (Methodological) 1995;57:289–300.
1. Benjamini Y, Yekutieli D. The Control of the False Discovery Rate in Multiple Testing under Dependency. The Annals of Statistics. 2001;29:1165–1188.
1. Browne MW, Cudeck R. Alternative Ways of Assessing Model Fit. Sociological Methods & Research. 1992;21:230.
1. Cattell RB. The scree test for the number of factors. Multivariate Behavioral Research. 1966;1:245–276. - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
Molecular Biology Databases
- NIAID Data Ecosystem - Find datasets on Infectious and Immune-mediated Diseases

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Exploring the Sensitivity of Horn's Parallel Analysis to the Distributional Form of Random Data

Affiliation

Exploring the Sensitivity of Horn's Parallel Analysis to the Distributional Form of Random Data

Author

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Molecular Biology Databases