Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul 17;15(7):e0236067.
doi: 10.1371/journal.pone.0236067. eCollection 2020.

Variable selection in multivariate multiple regression

Affiliations

Variable selection in multivariate multiple regression

Asokan Mulayath Variyath et al. PLoS One. .

Abstract

Introduction: In many practical situations, we are interested in the effect of covariates on correlated multiple responses. In this paper, we focus on estimation and variable selection in multi-response multiple regression models. Correlation among the response variables must be modeled for valid inference.

Method: We used an extension of the generalized estimating equation (GEE) methodology to simultaneously analyze binary, count, and continuous outcomes with nonlinear functions. Variable selection plays an important role in modeling correlated responses because of the large number of model parameters that must be estimated. We propose a penalized-likelihood approach based on the extended GEEs for simultaneous parameter estimation and variable selection.

Results and conclusions: We conducted a series of Monte Carlo simulations to investigate the performance of our method, considering different sample sizes and numbers of response variables. The results showed that our method works well compared to treating the responses as uncorrelated. We recommend using an unstructured correlation model with the Bayesian information criterion (BIC) to select the tuning parameters. We demonstrated our method using data from a concrete slump test.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Scatter plot indicating the relationship between slump (Y1), flow (Y2), and compressive strength (Y3).

References

    1. Yeh I-C. (2006). Exploring concrete slump model using artificial neural networks. Journal of Computing in Civil Engineering, 20, 217–221. 10.1061/(ASCE)0887-3801(2006)20:3(217) - DOI
    1. Yeh I-C. (2007). Modeling slump flow of concrete using second-order regressions and artificial neural networks. Cement and Concrete Composites, 28, 474–480. 10.1016/j.cemconcomp.2007.02.001 - DOI
    1. Yeh I-C. (2008). Modeling slump of concrete with fly ash and super plasticizer. Computers and Concrete, 5, 559–572. 10.12989/cac.2008.5.6.559 - DOI
    1. Breiman L. and Friedman J. H. (1997). Predicting multivariate responses in multiple regression. Journal of Royal Statistics Society B, 1, 3–54. 10.1111/1467-9868.00054 - DOI
    1. Chen F, Song M and Ma X. (2019) Investigation on the injury severity of serivers in rear-end collisions between cars using a random parameters bivariate ordered probit model, International Journal of Environmental Research and Public Health. 16(14), 2632 10.3390/ijerph16142632 - DOI - PMC - PubMed

Publication types

MeSH terms