Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Aug 1;17(1):39-53.
doi: 10.1515/ijb-2019-0163.

Multivariate quasi-beta regression models for continuous bounded data

Affiliations

Multivariate quasi-beta regression models for continuous bounded data

Ricardo R Petterle et al. Int J Biostat. .

Abstract

We propose a multivariate regression model to deal with multiple continuous bounded data. The proposed model is based on second-moment assumptions, only. We adopted the quasi-score and Pearson estimating functions for estimation of the regression and dispersion parameters, respectively. Thus, the proposed approach does not require a multivariate probability distribution for the variable response vector. The multivariate quasi-beta regression model can easily handle multiple continuous bounded outcomes taking into account the correlation between the response variables. Furthermore, the model allows us to analyze continuous bounded data on the interval [0, 1], including zeros and/or ones. Simulation studies were conducted to investigate the behavior of the NORmal To Anything (NORTA) algorithm and to check the properties of the estimating function estimators to deal with multiple correlated response variables generated from marginal beta distributions. The model was motivated by a data set concerning the body fat percentage, which was measured at five regions of the body and represent the response variables. We analyze each response variable separately and compare it with the fit of the multivariate proposed model. The multivariate quasi-beta regression model provides better fit than its univariate counterparts, as well as allows us to measure the correlation between response variables. Finally, we adapted diagnostic tools to the proposed model. In the supplementary material, we provide the data set and R code.

Keywords: NORTA algorithm; correlated data; estimating functions; multiple continuous bounded outcomes; simulation study; unit interval.

PubMed Disclaimer

References

    1. Ferrari, S, Cribari-Neto, F. Beta regression for modelling rates and proportions. J Appl Stat 2004;31:799–815. https://doi.org/10.1080/0266476042000214501.
    1. Barndorff-Nielsen, OE, Jørgensen, B. Some parametric models on the simplex. J Multivariate Anal 1991;39:106–16. https://doi.org/10.1016/0047-259x(91)90008-p.
    1. Mitnik, PA, Baek, S. The Kumaraswamy distribution: median-dispersion re-parameterizations for regression modeling and simulation-based estimation. Stat Pap 2013;54:177–92. https://doi.org/10.1007/s00362-011-0417-y.
    1. Lemonte, AJ, Bazán, JL. New class of Johnson SB distributions and its associated regression model for rates and proportions. Biom J 2016;58:727–46. https://doi.org/10.1002/bimj.201500030.
    1. Mousa, AM, El-Sheikh, AA, Abdel-Fattah, MA. A gamma regression for bounded continuous variables. Adv Appl Stat 2016;49:305. https://doi.org/10.17654/as049040305.

LinkOut - more resources