Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Feb 27;14(2):e0212108.
doi: 10.1371/journal.pone.0212108. eCollection 2019.

Drug sensitivity prediction with high-dimensional mixture regression

Affiliations

Drug sensitivity prediction with high-dimensional mixture regression

Qianyun Li et al. PLoS One. .

Abstract

This paper proposes a mixture regression model-based method for drug sensitivity prediction. The proposed method explicitly addresses two fundamental issues in drug sensitivity prediction, namely, population heterogeneity and feature selection pertaining to each of the subpopulations. The mixture regression model is estimated using the imputation-conditional consistency algorithm, and the resulting estimator is consistent. This paper also proposes an average-BIC criterion for determining the number of components for the mixture regression model. The proposed method is applied to the CCLE dataset, and the numerical results indicate that the proposed method can make a drastic improvement over the existing ones, such as random forest, support vector regression, and regularized linear regression, in both drug sensitivity prediction and feature selection. The p-values for the comparisons in drug sensitivity prediction can reach the order O(10-8) or lower for the drugs with heterogeneous populations.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. BIC paths and cluster dendrograms produced by ICC.
The BIC paths and cluster dendrograms produced by ICC with K = 2, 3, 4, where each cluster dendrogram was produced using a hierarchical clustering procedure (with the average link) based on the dissimilarity matrix calculated along the corresponding BIC path after discarding the first 100 iterations as the burn-in process.
Fig 2
Fig 2. Comparison of the single component and mixture regression models for training data fitting.
The left column is for the single regression model, the middle column is for the mixture regression model, and the right column is the cluster dendrogram produced by the mixture regression model; the top, middle and lower panels are for the drugs AZD0530, L-685458 and Lapatinib, respectively.
Fig 3
Fig 3. Comparison of the single component and mixture regression models for test data prediction.
The left column is for the single component regression model, and the right column is for the mixture regression model; the top, middle and lower panels are for the drugs AZD0530, L-685458 and Lapatinib, respectively.
Fig 4
Fig 4. Prediction performance comparison.
Comparison of the prediction performance (measured by corr(Ytest, Y^test)) of the mixture regression model with support vector regression, random forest, ridge regression, elastic net, and SIS-MCP regression (i.e., single model regression).

References

    1. de Niz C., Rahman R., Zhao X. and Pal R. (2016). Algorithms for drug sensitivity prediction. Algorithms, 9, 77 10.3390/a9040077 - DOI
    1. Geeleher P., Cox N.J. and Huang R.S. (2014). Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines. Genome Biol., 15, R47 10.1186/gb-2014-15-3-r47 - DOI - PMC - PubMed
    1. Barretina J., Caponigro G., Stransky N., Venkatesan K., Margolin A.A., Kim S., et al. (2012). The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature, 483, 603–607. 10.1038/nature11003 - DOI - PMC - PubMed
    1. Zou H. and Hastie T. (2005). Regularization and Variable Selection via the Elastic Net. J. R. Statist. Soc. Ser. B, 67, 301–320. 10.1111/j.1467-9868.2005.00503.x - DOI
    1. Jang I.S., Neto E.C., Guinney J., Friend S.H., and Margolin A.A. (2014). Systematic assessment of analytical methods for drug sensitivity prediction from cancer cell line data. Pac. Symp. Biocomput., 63–74. - PMC - PubMed

Publication types