Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Sep;39(5):316-327.
doi: 10.1111/opo.12636. Epub 2019 Aug 18.

Should Pearson's correlation coefficient be avoided?

Affiliations
Review

Should Pearson's correlation coefficient be avoided?

Richard A Armstrong. Ophthalmic Physiol Opt. 2019 Sep.

Abstract

Purpose: To survey the use of Pearson's correlation coefficient (r) and related statistical methods in the ophthalmic literature, to consider the limitations of r, and to suggest suitable alternative methods of analysis.

Recent findings: Searching Ophthalmic and Physiological Optics (OPO), Optometry and Vision Science (OVS), and Clinical and Experimental Optometry (CXO) online archives using correlation and Pearson's r as search terms resulted in 4057 and 281 hits respectively. Coefficient of determination, r square, or r squared received fewer hits (65, 8, and 22 hits respectively). The assumption that r follows a bivariate normal distribution was rarely encountered (3 hits) although several studies applied Spearman's rank correlation (70 hits). The intra-class correlation coefficient (ICC) was widely used (178 hits), but fewer hits were recorded for partial correlation (43 hits) and multiple correlation (13) hits. There was little evidence that the problem of sample size was addressed in correlation studies.

Summary: Investigators should be alert to whether: (1) the relationship between two variables could be non-linear, (2) the data are bivariate normal, (3) r accounts for a significant proportion of the variance in Y, (4) outliers are present, the data are clustered, or have a restricted range, (5) the sample size is appropriate, and (6) a significant correlation indicates causality. In addition, the number of significant digits used to express r and the problems of multiple testing should be addressed. The problems and limitations of r suggest a more cautious approach regarding its use and the application of alternative methods where appropriate.

Keywords: Pearson's correlation coefficient (r); bivariate normal distribution; correlation; curvilinear regression; partial correlation; range restriction.

PubMed Disclaimer

References

    1. Armstrong RA, Eperjesi F & Gilmartin B. The use of correlation and regression methods in optometry. Clin Exp Optom 2005; 88: 81-88.
    1. Pearson K & Lee A. On the laws of inheritance in man. I. Inheritance of physical characteristics. Biometrika 1902; 2: 357.
    1. Haegerstrom-Portnoy G, Schneck ME, Lott LA & Brabyn JA. The relation between visual acuity and other spatial vision measures. Optom Vis Sci 2000; 77: 653-662.
    1. Snedecor GW & Cochran WG. Statistical Methods, 7th edn. Iowa State University Press: Ames, IA, 1980.
    1. Armstrong RA & Hilton A. Statistical Analysis in Microbiology: Statnotes. Wiley-Blackwell: Hoboken, NJ, 2011.

LinkOut - more resources