Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jun 8;31(8):10.1002/env.2642.
doi: 10.1002/env.2642.

Bayesian Nonparametric Monotone Regression

Affiliations

Bayesian Nonparametric Monotone Regression

Ander Wilson et al. Environmetrics. .

Abstract

In many applications there is interest in estimating the relation between a predictor and an outcome when the relation is known to be monotone or otherwise constrained due to the physical processes involved. We consider one such application-inferring time-resolved aerosol concentration from a low-cost differential pressure sensor. The objective is to estimate a monotone function and make inference on the scaled first derivative of the function. We proposed Bayesian nonparametric monotone regression which uses a Bernstein polynomial basis to construct the regression function and puts a Dirichlet process prior on the regression coefficients. The base measure of the Dirichlet process is a finite mixture of a mass point at zero and a truncated normal. This construction imposes monotonicity while clustering the basis functions. Clustering the basis functions reduces the parameter space and allows the estimated regression function to be linear. With the proposed approach we can make closed-formed inference on the derivative of the estimated function including full quantification of uncertainty. In a simulation study the proposed method performs similar to other monotone regression approaches when the true function is wavy but performs better when the true function is linear. We apply the method to estimate time-resolved aerosol concentration with a newly-developed portable aerosol monitor. The R package bnmr is made available to implement the method.

Keywords: Aerosol monitors; Bernstein polynomials; Dirichlet process; Fine particulate matter; monotone regression.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Various representations of the Bernstein polynomial (BP) basis functions. Panel 1a shows the 51 BP basis functions of order M = 50 (Ψ(x, M)). Panel 1b shows the transformed BP basis represented as Ψ(x, M)A−1 as described in 2.1. This transformation is used for both BNMR and BISOREG. Panel 1c shows the posterior mode group of basis functions selected to be included into the model with BISOREG. This is a subset of the transformed basis functions shown in Panel 1a. Panel 1d shows the posterior model combination of basis functions included with BNMR. This includes the intercept and three basis functions which are each a linear combination of one to three of the basis functions shown in panel 1b and subsequently linear combinations of the basis functions shown in Panel 1a. Results from all 12 runs are shown in the supplemental material.
Figure 2:
Figure 2:
Simulation results for the number of non-zero regression coefficients (dashed line) and the number of unique values of the non-zero regression coefficient (solid line) for BISOREG (triangle) and BNMR (×). The number of unique non-zero values is always equal to the total number of non-zero values in BISOREG.
Figure 3:
Figure 3:
Estimated pressure drop from the MARS data for one run. Each panel shows the estimates and 95% intervals for each method separately. Results from all 12 runs are shown in the supplemental material.
Figure 4:
Figure 4:
Estimated PM2.5 concentration from the MARS data. Panel 4a shows the posterior mean and 95% interval from BNMR and Panel 4b shows the posterior mean and 95% interval from BISOREG. The dashed line in each panel is the PM2.5 concentration measured with the TEOM. Results from all 12 runs are shown in the supplemental material. Results with other methods are also shown in the supplemental material.

References

    1. Blackwell D and MacQueen JB (1973). Ferguson Distributions Via Polya Urn Schemes Author. The Annals of Statistics, 1(2):353–355.
    1. Brunk HD (1955). Maximum Likelihood Estimates of Monotone Parameters. The Annals of Mathematical Statistics, 26(4):607–616.
    1. Bush CA and MacEachern SN (1996). A Semiparametric Bayesian Model for Randomised Block Designs. Biometrika, 83(2):275–285.
    1. Calder CA (2008). A dynamic process convolution approach to modeling ambient particulate matter concentrations. Environmetrics, 19(1):39–48.
    1. Canale A, Durante D, and Dunson DB (2018). Convex mixture regression for quantitative risk assessment. Biometrics, 74(4):1331–1340. - PMC - PubMed

LinkOut - more resources