Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep;79(3):2298-2310.
doi: 10.1111/biom.13765. Epub 2022 Oct 11.

Boosting distributional copula regression

Affiliations

Boosting distributional copula regression

Nicolai Hans et al. Biometrics. 2023 Sep.

Abstract

Capturing complex dependence structures between outcome variables (e.g., study endpoints) is of high relevance in contemporary biomedical data problems and medical research. Distributional copula regression provides a flexible tool to model the joint distribution of multiple outcome variables by disentangling the marginal response distributions and their dependence structure. In a regression setup, each parameter of the copula model, that is, the marginal distribution parameters and the copula dependence parameters, can be related to covariates via structured additive predictors. We propose a framework to fit distributional copula regression via model-based boosting, which is a modern estimation technique that incorporates useful features like an intrinsic variable selection mechanism, parameter shrinkage and the capability to fit regression models in high-dimensional data setting, that is, situations with more covariates than observations. Thus, model-based boosting does not only complement existing Bayesian and maximum-likelihood based estimation frameworks for this model class but rather enables unique intrinsic mechanisms that can be helpful in many applied problems. The performance of our boosting algorithm for copula regression models with continuous margins is evaluated in simulation studies that cover low- and high-dimensional data settings and situations with and without dependence between the responses. Moreover, distributional copula boosting is used to jointly analyze and predict the length and the weight of newborns conditional on sonographic measurements of the fetus before delivery together with other clinical variables.

Keywords: Archimedean copula; GAMLSS; component-wise gradient boosting; early stopping; tail dependence.

PubMed Disclaimer

References

REFERENCES

    1. Barker, D.J. (1997) The long-term outcome of retarded fetal growth. Clinical Obstetrics and Gynecology, 40(4), 853-863.
    1. Bermingham, M.L., Pong-Wong, R., Spiliopoulou, A. et al. (2015) Application of high-dimensional feature selection: evaluation for genomic prediction in man. Scientific Reports, 5(10312), 1-12.
    1. Boulet, S.L., Alexander, G.R., Salihu, H.M. & Pass, M. (2003) Macrosomic births in the united states: determinants, outcomes, and proposed grades of risk. American Journal of Obstetrics and Gynecology, 188(5), 1372-1378.
    1. Bühlmann, P. & Hothorn, T. (2007) Boosting algorithms: regularization, prediction and model fitting. Statistical Science, 22(4), 477-505.
    1. Bühlmann, P. & Yu, B. (2003) Boosting with the L2 loss: regression and classification. Journal of the American Statistical Association, 98(462), 324-339.

Publication types

LinkOut - more resources