Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Dec;7(6):451-9.
doi: 10.1186/cc2401. Epub 2003 Nov 5.

Statistics review 7: Correlation and regression

Affiliations

Statistics review 7: Correlation and regression

Viv Bewick et al. Crit Care. 2003 Dec.

Abstract

The present review introduces methods of analyzing the relationship between two quantitative variables. The calculation and interpretation of the sample product moment correlation coefficient and the linear regression equation are discussed and illustrated. Common misuses of the techniques are considered. Tests and confidence intervals for the population parameters are described, and failures of the underlying assumptions are highlighted.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Scatter diagram for ln urea and age
Figure 2
Figure 2
Correlation coefficient (r) = +0.9. Positive linear relationship.
Figure 3
Figure 3
Correlation coefficient (r) = -0.9. Negative linear relationship.
Figure 4
Figure 4
Correlation coefficient (r) = 0.04. No relationship.
Figure 5
Figure 5
Correlation coefficient (r) = -0.03. Nonlinear relationship.
Figure 6
Figure 6
Subgroups in the data resulting in a misleading correlation. All data: r = 0.57; males: r = -0.41; females: r = -0.26.
Figure 7
Figure 7
Regression line for ln urea and age: ln urea = 0.72 + (0.017 × age).
Figure 8
Figure 8
Regression line obtained by minimizing the sums of squares of all of the deviations.
Figure 9
Figure 9
Total, explained and unexplained deviations for a point.
Figure 10
Figure 10
Regression line, its 95% confidence interval and the 95% prediction interval for individual patients.
Figure 11
Figure 11
(a) Scatter diagram of y against x suggests that the relationship is nonlinear. (b) Plot of residuals against fitted values in panel a; the curvature of the relationship is shown more clearly. (c) Scatter diagram of y against x suggests that the variability in y increases with x. (d) Plot of residuals against fitted values for panel c; the increasing variability in y with x is shown more clearly.
Figure 12
Figure 12
Plot of residuals against fitted values for the accident and emergency unit data.
Figure 13
Figure 13
Normal plot of residuals for the accident and emergency unit data.

Similar articles

Cited by

References

    1. Whitley E, Ball J. Statistics review 1: Presenting and summarising data. Crit Care. 2002;6:66–71. doi: 10.1186/cc1455. - DOI - PMC - PubMed
    1. Kirkwood BR, Sterne JAC. Essential Medical Statistics. 2. Oxford: Blackwell Science; 2003.
    1. Whitley E, Ball J. Statistics review 2: Samples and populations. Crit Care. 2002;6:143–148. doi: 10.1186/cc1473. - DOI - PMC - PubMed
    1. Bland M. An Introduction to Medical Statistics. 3. Oxford: Oxford University Press; 2001.
    1. Bland M, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;i:307–310. - PubMed

LinkOut - more resources