Missing observations in regression: a conditional approach
- PMID: 36778961
- PMCID: PMC9905973
- DOI: 10.1098/rsos.220267
Missing observations in regression: a conditional approach
Abstract
This note presents an alternative to multiple imputation and other approaches to regression analysis in the presence of missing covariate data. Our recommendation, based on factorial and fractional factorial arrangements, is more faithful to ancillarity considerations of regression analysis and involves assessing the sensitivity of inference on each regression parameter to missingness in each of the explanatory variables. The ideas are illustrated on a medical example concerned with the success of hematopoietic stem cell transplantation in children, and on a sociological example concerned with socio-economic inequalities in educational attainment.
Keywords: EM algorithm; Hadamard matrix; ancillarity; fractional factorial; missing data; regression.
© 2023 The Authors.
Conflict of interest statement
We declare we have no competing interests.
Figures
References
-
- Fisher RA. 1925. Theory of statistical estimation. Proc. Camb. Philol. Soc. 22, 700-725. ( 10.1017/S0305004100009580) - DOI
-
- Efron BE. 1977. Discussion of ‘Maximum likelihood from incomplete data via the EM algorithm’ by Dempster, Liard and Rubin. J. R. Statist. Soc. B 39, 29. ( 10.1111/j.2517-6161.1977.tb01600.x) - DOI
-
- Efron BE. 1982. Maximum likelihood and decision theory. Ann. Statist. 10, 340-356. ( 10.1214/aos/1176345778) - DOI
-
- Sundberg R. 1974. Maximum likelihood theory for incomplete data from an exponential family. Scand. J. Statist. 1, 49-58.
-
- Dempster AP, Laird NM, Rubin DB. 1977. Maximum likelihood from incomplete data via the EM algorithm. J. R. Statist. Soc. B 39, 1-38. ( 10.1111/j.2517-6161.1977.tb01600.x) - DOI
Associated data
LinkOut - more resources
Full Text Sources
Other Literature Sources