Comment

. 2022 Jan 17;23(1):bbab532.

doi: 10.1093/bib/bbab532.

Letter to the Editor: on the stability and internal consistency of component-wise sparse mixture regression-based clustering

Bo Zhang¹, Jianghua He¹, Jinxiang Hu¹, Devin C Koestler¹, Prabhakar Chalise¹

Affiliations

PMID: 34953466
PMCID: PMC8769908
DOI: 10.1093/bib/bbab532

Comment

Letter to the Editor: on the stability and internal consistency of component-wise sparse mixture regression-based clustering

Bo Zhang et al. Brief Bioinform. 2022.

. 2022 Jan 17;23(1):bbab532.

doi: 10.1093/bib/bbab532.

Authors

Bo Zhang¹, Jianghua He¹, Jinxiang Hu¹, Devin C Koestler¹, Prabhakar Chalise¹

Affiliation

¹ Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, KS 66160, USA.

PMID: 34953466
PMCID: PMC8769908
DOI: 10.1093/bib/bbab532

Abstract

Understanding the relationship between molecular markers and a phenotype of interest is often obfuscated by patient-level heterogeneity. To address this challenge, Chang et al. recently published a novel method called Component-wise Sparse Mixture Regression (CSMR), a regression-based clustering method that promises to detect heterogeneous relationships between molecular markers and a phenotype of interest under high-dimensional settings. In this Letter to the Editor, we raise awareness to several issues concerning the assessment of CSMR in Chang et al., particularly its assessment in settings where the number of features, P, exceeds the study sample size, N, and advocate for additional metrics/approaches when assessing the performance of regression-based clustering methodologies.

Keywords: disease heterogeneity; mixture modeling; supervised learning.

PubMed Disclaimer

Figures

**Figure 1**
Performance of CSMR on the CCLE data set. (A) Boxplot of clustering IC, calculated as the ARI of cluster memberships between each pair of runs, from 100 separate applications of CSMR to the CCLE data set using the same tuning parameters for each run. (B) Bar plot of frequency of specific features being selected by CSMR from 100 separate applications of CSMR to the CCLE data set. Out of the 500 considered features, 425 features were selected by CSMR at least once. On average, CSMR selected 43.6 features each iteration (standard deviation = 11.7).

**Figure 2**
CSMR model performance in various simulation settings. Boxplots showing performance metric values when . The x-axis shows at 100, 200 and 300, and the colors indicate . As and P increase, both accuracy and IC decline.

formula image — **Figure 2**
CSMR model performance in various simulation settings. Boxplots showing performance metric values when . The x-axis shows at 100, 200 and 300, and the colors indicate . As and P increase, both accuracy and IC decline.

See this image and copyright information in PMC

Comment on

Supervised clustering of high-dimensional data using regularized mixture modeling.
Chang W, Wan C, Zang Y, Zhang C, Cao S. Chang W, et al. Brief Bioinform. 2021 Jul 20;22(4):bbaa291. doi: 10.1093/bib/bbaa291. Brief Bioinform. 2021. PMID: 34293851 Free PMC article.

References

1. Chang W, Wan C, Zang Y, et al. . Supervised clustering of high-dimensional data using regularized mixture modeling. Brief Bioinform 2020;22(4):1–11. - PMC - PubMed
1. Li Q, Shi R, Liang F. Drug sensitivity prediction with high-dimensional mixture regression. PLoS One 2019;14(2):e0212108. - PMC - PubMed
1. Khalili A, Chen J. Variable selection in finite mixture of regression models. J Am Stat Assoc 2007;102(479):1025–38.
1. Wang H, Leng C. Unified LASSO estimation by least squares approximation. J Am Stat Assoc 2007;102(479):1039–48.
1. Barretina J, Caponigro G, Stransky N, et al. . The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 2012;483(7391):603–7. - PMC - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Letter to the Editor: on the stability and internal consistency of component-wise sparse mixture regression-based clustering

Affiliation

Letter to the Editor: on the stability and internal consistency of component-wise sparse mixture regression-based clustering

Authors

Affiliation

Abstract

Figures

Comment on

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources