Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013:7:323-341.
doi: 10.1214/13-EJS774.

Deconvolution estimation of mixture distributions with boundaries

Affiliations

Deconvolution estimation of mixture distributions with boundaries

Mihee Lee et al. Electron J Stat. 2013.

Abstract

In this paper, motivated by an important problem in evolutionary biology, we develop two sieve type estimators for distributions that are mixtures of a finite number of discrete atoms and continuous distributions under the framework of measurement error models. While there is a large literature on deconvolution problems, only two articles have previously addressed the problem taken up in our article, and they use relatively standard Fourier deconvolution. As a result the estimators suggested in those two articles are degraded seriously by boundary effects and negativity. A major contribution of our article is correct handling of boundary effects; our method is asymptotically unbiased at the boundaries, and also is guaranteed to be nonnegative. We use roughness penalization to improve the smoothness of the resulting estimator and reduce the estimation variance. We illustrate the performance of the proposed estimators via our real driving application in evolutionary biology and two simulation studies. Furthermore, we establish asymptotic properties of the proposed estimators.

Keywords: Boundary effect; maximum likelihood; measurement error; mixture distribution; penalization; sieve method.

PubMed Disclaimer

Figures

Fig 1
Fig 1
Simulation Study I. Density plot of the continuous component in Model (4.1).
Fig 2
Fig 2
The motivating virus lineage application. Panel (a) shows various density estimators of the deleterious mutation effects. Panel (b) shows the selection for the penalty parameter λ.
Fig 3
Fig 3
Validation of the exponential assumption: the density-envelope plots for various values of the roughness penalty parameter λ. The light grey curves form the envelope to show the natural estimation variation, the black dash-dotted curve shows the exponential density which generates the simulation samples, the black solid curve is the penalized sieve density estimator obtained from the data, and the dark grey solid curve is the average of the 100 grey curves.

References

    1. Akaike H. A new look at the statistical model identification. IEEE transactions on automatic control. 1974;19:716–723. MR0423716.
    1. Bertsekas DP. Nonlinear Programming. second edition Athena Scientific; 2005.
    1. Brown L, Greenshtein E. Non parametric empirical Bayes and compound decision approaches to estimation of a high dimensional vector of normal means. Ann. Statist. 2009;37:1685–1704. MR2533468.
    1. Burch CL, Chao L. Epistasis and its relationship to canalization in the RNA virus φ6. Genetics. 2004;167:559–567. - PMC - PubMed
    1. Burch CL, Guyader S, Samarov D, Shen H. Experimental estimate of the abundance and effects of nearly neutral mutations in the RNA virus φ6. Genetics. 2007;176:467–476. - PMC - PubMed

LinkOut - more resources