Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2010 Nov;107(44):776-82.
doi: 10.3238/arztebl.2010.0776. Epub 2010 Nov 5.

Linear regression analysis: part 14 of a series on evaluation of scientific publications

Affiliations
Review

Linear regression analysis: part 14 of a series on evaluation of scientific publications

Astrid Schneider et al. Dtsch Arztebl Int. 2010 Nov.

Abstract

Background: Regression analysis is an important statistical method for the analysis of medical data. It enables the identification and characterization of relationships among multiple factors. It also enables the identification of prognostically relevant risk factors and the calculation of risk scores for individual prognostication.

Methods: This article is based on selected textbooks of statistics, a selective review of the literature, and our own experience.

Results: After a brief introduction of the uni- and multivariable regression models, illustrative examples are given to explain what the important considerations are before a regression analysis is performed, and how the results should be interpreted. The reader should then be able to judge whether the method has been used correctly and interpret the results appropriately.

Conclusion: The performance and interpretation of linear regression analysis are subject to a variety of pitfalls, which are discussed here in detail. The reader is made aware of common errors of interpretation through practical examples. Both the opportunities for applying linear regression analysis and its limitations are presented.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A scatter plot showing a linear relationship
Figure 2
Figure 2
A scatter plot showing an exponential relationship. In this case, it would not be appropriate to compute a coefficient of determination or a regression line
Figure 3
Figure 3
A scatter plot and the corresponding regression line and regression equation for the relationship between the dependent variable body weight (kg) and the independent variable height (m). r = Pearsons’s correlation coefficient R-squared linear = coefficient of determination
None

References

    1. Fahrmeir L, Kneib T, Lang S. 2nd edition. Berlin, Heidelberg: Springer; 2009. Regression - Modelle, Methoden und Anwendungen.
    1. Bortz J. 6th edition. Heidelberg: Springer; 2004. Statistik für Human-und Sozialwissenschaftler.
    1. Selvin S. Epidemiologic Analysis. Oxford University Press. 2001
    1. Bender R, Lange S. Was ist ein Konfidenzintervall? Dtsch Med Wschr. 2001;126 - PubMed
    1. Sir Bradford Hill A. The environment and disease: Association or Causation? Proc R Soc Med. 1965;58:295–300. - PMC - PubMed