Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017:2017:6576840.
doi: 10.1155/2017/6576840. Epub 2017 Feb 8.

Integrating Biological Covariates into Gene Expression-Based Predictors of Radiation Sensitivity

Affiliations

Integrating Biological Covariates into Gene Expression-Based Predictors of Radiation Sensitivity

Vidya P Kamath et al. Int J Genomics. 2017.

Abstract

The use of gene expression-based classifiers has resulted in a number of promising potential signatures of patient diagnosis, prognosis, and response to therapy. However, these approaches have also created difficulties in trying to use gene expression alone to predict a complex trait. A practical approach to this problem is to integrate existing biological knowledge with gene expression to build a composite predictor. We studied the problem of predicting radiation sensitivity within human cancer cell lines from gene expression. First, we present evidence for the need to integrate known biological conditions (tissue of origin, RAS, and p53 mutational status) into a gene expression prediction problem involving radiation sensitivity. Next, we demonstrate using linear regression, a technique for incorporating this knowledge. The resulting correlations between gene expression and radiation sensitivity improved through the use of this technique (best-fit adjusted R2 increased from 0.3 to 0.84). Overfitting of data was examined through the use of simulation. The results reinforce the concept that radiation sensitivity is not driven solely by gene expression, but rather by a combination of distinct parameters. We show that accounting for biological heterogeneity significantly improves the ability of the model to identify genes that are associated with radiosensitivity.

PubMed Disclaimer

Conflict of interest statement

Javier F. Torres-Roca and Steven A. Eschrich hold patents and are cofounders of Cvergenx, Inc.

Figures

Figure 1
Figure 1
Investigation of building predictors for radiation sensitivity in 48 cell lines. (a) Classification accuracy of radiation sensitivity predictor built from 48 cell lines, using different numbers of features in the regression model. (b) Classification accuracy of radiation sensitivity predictor built from 48 cell lines, using different types of normalization. MAS5.0 and MAS4.0 algorithms generated the most accurate predictors. (c) Classification accuracy of radiation sensitivity predictor built from 48 cell lines, using different types of classification algorithms, including linear regression, least median, and SMO.
Figure 2
Figure 2
Biological characteristics differ when considering 35 cell lines versus an expanded set of 48 cell lines. (a) The proportion of RAS wild-type cell lines increased (60% to 69%). (b) The proportion of p53 wild-type cell lines increased (26% to 35%). (c) Tissue of origin of cell lines did not change significantly. (d) Venn diagram showing the lack of concordance in correlation when using a test for correlation (p < 0.05) using only RAS mutant or RAS wt cell lines in the 35-cell line set. Only 16 probesets were found correlated in both sets.
Figure 3
Figure 3
Adj-R2 values for linear equations fitting SF2 on 48 cell lines. Adj-R2 values increase systematically as more covariates are included in the linear model.
Figure 4
Figure 4
Change in Adj-R2 values obtained by incorporating interaction terms in linear model for either biological indicators or random variables. Linear models were created for each gene, incorporating the three biological indicators. Differences in Adj-R2 values were computed for each experiment between using an additive model and an interaction-based model. The mean difference in R2 was recorded when adding interactions terms to the individual linear models.

References

    1. Dalton W. S., Friend S. H. Cancer biomarkers—an invitation to the table. Science. 2006;312(5777):1165–1168. doi: 10.1126/science.1125948. - DOI - PubMed
    1. Yeatman T. J. Predictive biomarkers: identification and verification. Journal of Clinical Oncology. 2009;27(17):2743–2744. doi: 10.1200/jco.2008.21.5087. - DOI - PubMed
    1. Ioannidis J. P. A. Is molecular profiling ready for use in clinical decision making? Oncologist. 2007;12(3):301–311. doi: 10.1634/theoncologist.12-3-301. - DOI - PubMed
    1. Malumbres M., Barbacid M. RAS oncogenes: the first 30 years. Nature Reviews Cancer. 2003;3(6):459–465. doi: 10.1038/nrc1097. - DOI - PubMed
    1. Goodsell D. S. The molecular perspective: the ras oncogene. Oncologist. 1999;4(3):263–264. - PubMed

LinkOut - more resources