Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Oct 18;24(4):1066-1084.
doi: 10.1093/biostatistics/kxac023.

Constrained groupwise additive index models

Affiliations

Constrained groupwise additive index models

Pierre Masselot et al. Biostatistics. .

Abstract

In environmental epidemiology, there is wide interest in creating and using comprehensive indices that can summarize information from different environmental exposures while retaining strong predictive power on a target health outcome. In this context, the present article proposes a model called the constrained groupwise additive index model (CGAIM) to create easy-to-interpret indices predictive of a response variable, from a potentially large list of variables. The CGAIM considers groups of predictors that naturally belong together to yield meaningful indices. It also allows the addition of linear constraints on both the index weights and the form of their relationship with the response variable to represent prior assumptions or operational requirements. We propose an efficient algorithm to estimate the CGAIM, along with index selection and inference procedures. A simulation study shows that the proposed algorithm has good estimation performances, with low bias and variance and is applicable in complex situations with many correlated predictors. It also demonstrates important sensitivity and specificity in index selection, but non-negligible coverage error on constructed confidence intervals. The CGAIM is then illustrated in the construction of heat indices in a health warning system context. We believe the CGAIM could become useful in a wide variety of situations, such as warning systems establishment, and multipollutant or exposome studies.

Keywords: Additive index models; Dimension reduction; Index; Linear constraints; Quadratic programming.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Estimated RMSE for different scenarios, varying the sample size (a), the correlation between variables (b), and the noise level (c). Note that the CGAIM and GAIM curves overlap each other at the bottom. Note the log scale.
Fig. 2.
Fig. 2.
Average sensitivity and specificity of index selection computed on the 1000 simulations for various number of true indices and error level.
Fig. 3.
Fig. 3.
Estimated coverage for both inference methods and various noise level. Vertical segments indicate formula image1 standard error of the coverage.
Fig. 4.
Fig. 4.
Average estimated formula image for according to the true value formula image. Segments indicate 2.5th and 97.5th percentile of estimated formula image.
Fig. 5.
Fig. 5.
Resulting indices created in Montreal. Top row: weights formula image for each selected index; bottom row: functions formula image. Indices have been standardized over the range [0–1] for ease of comparison. Each column corresponds to one index. Vertical segments and dotted lines represent block bootstrap 95formula image confidence intervals.

References

    1. Agier, L., Portengen, L., Chadeau-Hyam, M., Basagaña X., Giorgis-Allemand, L., Siroux, V., Robinson, O., Vlaanderen, J., González, J. R., Nieuwenhuijsen, M. J., and others. (2016). A systematic comparison of linear regression-based statistical methods to assess exposome-health associations. Environmental Health Perspectives 124, 1848–1856. - PMC - PubMed
    1. Armstrong, B. G., Gasparrini, A., Tobias, A. and Sera, F. (2020). Sample size issues in time series regressions of counts on environmental exposures. BMC Medical Research Methodology 20, 15. - PMC - PubMed
    1. Armstrong, B., Sera, F., Vicedo-Cabrera, A. M., Abrutzky, R., Åsträm, D. O., Bell, M. L., Chen, B.-Y., de Sousa Zanotti Stagliorio Coelho, M., Patricia Matus, C., Dang Tran, N., and others. (2019). The role of humidity in associations of high temperature with mortality: a multiauthor, multicity study. Environmental Health Perspectives 127, 097007. - PMC - PubMed
    1. Barreca, A. I. (2012). Climate change, humidity, and mortality in the United States. Journal of Environmental Economics and Management 63, 19–34. - PMC - PubMed
    1. Bates, D. M. and Watts, D. G. (1981). A relative off set orthogonality convergence criterion for nonlinear least squares. Technometrics 23, 179–183.

Publication types