Introduction to statistical modelling 2: categorical variables and interactions in linear regression
- PMID: 23681393
- DOI: 10.1093/rheumatology/ket172
Introduction to statistical modelling 2: categorical variables and interactions in linear regression
Abstract
In the first article in this series we explored the use of linear regression to predict an outcome variable from a number of predictive factors. It assumed that the predictive factors were measured on an interval scale. However, this article shows how categorical variables can also be included in a linear regression model, enabling predictions to be made separately for different groups and allowing for testing the hypothesis that the outcome differs between groups. The use of interaction terms to measure whether the effect of a particular predictor variable differs between groups is also explained. An alternative approach to testing the difference between groups of the effect of a given predictor, which consists of measuring the effect in each group separately and seeing whether the statistical significance differs between the groups, is shown to be misleading.
Keywords: categorical variable; dummy variable; indicator variable; interaction; linear regression.
© The Author 2013. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Comment in
-
Statistical modelling: essentially, all models are wrong, but some are useful. Review series on statistical modelling.Rheumatology (Oxford). 2015 Jul;54(7):1133-4. doi: 10.1093/rheumatology/kev116. Epub 2015 May 13. Rheumatology (Oxford). 2015. PMID: 25972388 No abstract available.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources