Sparse quadratic classification rules via linear dimension reduction

Irina Gaynanova¹, Tianying Wang¹

Affiliations

PMID: 31105355
PMCID: PMC6516858
DOI: 10.1016/j.jmva.2018.09.011

Sparse quadratic classification rules via linear dimension reduction

Irina Gaynanova et al. J Multivar Anal. 2019 Jan.

. 2019 Jan:169:278-299.

doi: 10.1016/j.jmva.2018.09.011. Epub 2018 Oct 3.

Authors

Irina Gaynanova¹, Tianying Wang¹

Affiliation

¹ Department of Statistics, Texas A&M University, 3143 TAMU, College Station, TX 77843, USA.

PMID: 31105355
PMCID: PMC6516858
DOI: 10.1016/j.jmva.2018.09.011

Abstract

We consider the problem of high-dimensional classification between two groups with unequal covariance matrices. Rather than estimating the full quadratic discriminant rule, we propose to perform simultaneous variable selection and linear dimension reduction on the original data, with the subsequent application of quadratic discriminant analysis on the reduced space. In contrast to quadratic discriminant analysis, the proposed framework doesn't require the estimation of precision matrices; it scales linearly with the number of measurements, making it especially attractive for the use on high-dimensional datasets. We support the methodology with theoretical guarantees on variable selection consistency, and empirical comparisons with competing approaches. We apply the method to gene expression data of breast cancer patients, and confirm the crucial importance of the ESR1 gene in differentiating estrogen receptor status.

Keywords: Convex optimization; Discriminant analysis; High-dimensional statistics; Variable selection.

PubMed Disclaimer

Figures

**Figure 1:**
Two-group classification problem with p = 2 and unequal covariance matrices. Left: Projection using Fisher’s discriminant vector. Middle: Projection using the covariance structure from the 1st group (circles). Right: Projection using the covariance structure from the 2nd group (triangles).

**Figure 2:**
Misclassification error rates over 100 replications, the horizontal lines show the median errors of the proposed DAP, discriminant analysis via projections. SLDA: Sparse linear discriminant analysis; SLOG: Sparse logistic regression with interactions; SQDA_LH: Sparse QDA of Le and Hastie [30]; SQDA_LS: Sparse QDA of Li and Shao [31]; SQDA_RF: Sparse QDA via ridge fusion; RDA: Regularized discriminant analysis.

**Figure 3:**
Number of selected variables over 100 replications, the horizontal lines indicate the median model sizes of proposed DAP, discriminant analysis via projections. RDA, SQDA_RF and SQDA_LH use all p variables, not shown. SLDA: Sparse linear discriminant analysis; SLOG: Sparse logistic regression with interactions; SQDA_LH: Sparse QDA of Le and Hastie [30]; SQDA_LS: Sparse QDA of Li and Shao [31]; SQDA_RF: Sparse QDA via ridge fusion; RDA: Regularized discriminant analysis.

**Figure 4:**
Left: Misclassification error rates over 100 splits. Right: Number of variables used in corresponding classification rules. DAP consistently selects the smallest model. SQDA_LS, SQDA_LH and RDA always use all p = 1000 variables, not shown. DAP: Discriminant analysis via projections, proposed method; SQDA_LS: Sparse QDA of Li and Shao [31]; SQDA_LH: Sparse QDA of Le and Hastie [30]; SLDA: Sparse linear discriminant analysis; RDA: Regularized discriminant analysis.

See this image and copyright information in PMC

Cited by

Interpretable discriminant analysis for functional data supported on random nonlinear domains with an application to Alzheimer's disease.
Lila E, Zhang W, Rane Levendovszky S; Alzheimer’s Disease Neuroimaging Initiative. Lila E, et al. J R Stat Soc Series B Stat Methodol. 2024 Mar 22;86(4):1013-1044. doi: 10.1093/jrsssb/qkae023. eCollection 2024 Sep. J R Stat Soc Series B Stat Methodol. 2024. PMID: 39279915 Free PMC article.
Simultaneous differential network analysis and classification for matrix-variate data with application to brain connectivity.
Chen H, Guo Y, He Y, Ji J, Liu L, Shi Y, Wang Y, Yu L, Zhang X; Alzheimers Disease Neuroimaging Initiative. Chen H, et al. Biostatistics. 2022 Jul 18;23(3):967-989. doi: 10.1093/biostatistics/kxab007. Biostatistics. 2022. PMID: 33769450 Free PMC article.
Identification of resistance in Escherichia coli and Klebsiella pneumoniae using excitation-emission matrix fluorescence spectroscopy and multivariate analysis.
Costa FSL, Bezerra CCR, Neto RM, Morais CLM, Lima KMG. Costa FSL, et al. Sci Rep. 2020 Aug 3;10(1):12994. doi: 10.1038/s41598-020-70033-x. Sci Rep. 2020. PMID: 32747745 Free PMC article.
Deep learning-based immunohistochemical estimation of breast cancer via ultrasound image applications.
Yan D, Zhao Z, Duan J, Qu J, Shi L, Wang Q, Zhang H. Yan D, et al. Front Oncol. 2024 Jan 9;13:1263685. doi: 10.3389/fonc.2023.1263685. eCollection 2023. Front Oncol. 2024. PMID: 38264739 Free PMC article.

References

1. Bach FR, Consistency of the group Lasso and multiple kernel learning, J. Mach. Learn. Res 9 (2008) 1179–1225.
1. Barber RF, Drton M, Exact block-wise optimization in group lasso and sparse group lasso for linear regression, arXiv.org (2010).
1. Boyd SP, Vandenberghe L, Convex Optimization, Cambridge Univ Press, Cambridge, 2004.
1. Breheny P, Huang J, Group descent algorithms for nonconvex penalized linear and logistic regression models with grouped predictors, Statistics and Computing 25 (2015) 173–187. - PMC - PubMed
1. Cai TT, Liu W, A direct estimation approach to sparse linear discriminant analysis, J. Amer. Statist. Assoc. 106 (2011) 1566–1577.

Grants and funding

U01 CA057030/CA/NCI NIH HHS/United States

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Other Literature Sources
- The Lens - Patent Citations Database
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Sparse quadratic classification rules via linear dimension reduction

Affiliation

Sparse quadratic classification rules via linear dimension reduction

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Miscellaneous