Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Sep;76(3):811-820.
doi: 10.1111/biom.13204. Epub 2019 Dec 31.

A fast score test for generalized mixture models

Affiliations

A fast score test for generalized mixture models

Rui Duan et al. Biometrics. 2020 Sep.

Abstract

In biomedical studies, testing for homogeneity between two groups, where one group is modeled by mixture models, is often of great interest. This paper considers the semiparametric exponential family mixture model proposed by Hong et al. (2017) and studies the score test for homogeneity under this model. The score test is nonregular in the sense that nuisance parameters disappear under the null hypothesis. To address this difficulty, we propose a modification of the score test, so that the resulting test enjoys the Wilks phenomenon. In finite samples, we show that with fixed nuisance parameters the score test is locally most powerful. In large samples, we establish the asymptotic power functions under two types of local alternative hypotheses. Our simulation studies illustrate that the proposed score test is powerful and computationally fast. We apply the proposed score test to an UK ovarian cancer DNA methylation data for identification of differentially methylated CpG sites.

Keywords: DNA methylation; asymptotics; conditional likelihood; nonregular problem; semiparametric mixture model.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Power curves of the proposed score test, PLEMT test by Hong et al. (2017), MPRT proposed by Ning and Chen (2015), PELT proposed by Liu et al. (2012), EST proposed by Qin and Liang (2011), t-test, and Wilcoxon test. In the normal model, we took (μ1, σ1) = (0, 1) and (μ2, σ2) = (1, 1.5) (M&V scenario),(μ1, σ1) = (0, 1) and (μ2, σ2) = (1, 1) (M only scenario), and (μ1, σ1) = (0, 1) and (μ2, σ2) = (0, 2.5) (V only scenario). In the Beta model, we took (α1, β1) = (3, 3) and (α2, β2) = (2, 3) (M & V scenario), (α1, β1) = (2, 2) and (α2, β2) = (1, 2.256) (M only scenario), and (α1, β1) = (3, 3) and (α2, β2) = (1, 1) (V only scenario).
Figure 2.
Figure 2.
Receiver operating characteristics (ROC) curves for the proposed test, the PLEMT test and t-test using logistic regression as the classification method.

References

    1. Ansel J, Bottin H, Rodriguez-Beltran C, Damon C, Nagarajan M, Fehrmann S, François J, and Yvert G. Cell-to-cell stochastic variation in gene expression is a complex genetic trait. PLoS genetics, 4(4):e1000049, 2008. - PMC - PubMed
    1. Aschard H, Zaitlen N, Tamimi RM, Lindström S, and Kraft P. A nonparametric test to detect quantitative trait loci where the phenotypic distribution differs by genotypes. Genetic Epidemiology, 2013. - PMC - PubMed
    1. Barndorff-Nielsen O. Exponential families. Encyclopedia of Statistical Sciences, 2006.
    1. Chen H, Chen J, and Kalbfleisch J. A modified likelihood ratio test for homogeneity in finite mixture models. Journal of the Royal Statistical Society: Series B, 63(1):19–29, 2001.
    1. Cox DR. Regression models and life-tables. Journal of the Royal Statistical Society. Series B, 34(2):187–220, 1972.

Publication types