Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2017 Sep 1;18(5):820-829.
doi: 10.1093/bib/bbw065.

Computational models for predicting drug responses in cancer research

Review

Computational models for predicting drug responses in cancer research

Francisco Azuaje. Brief Bioinform. .

Abstract

The computational prediction of drug responses based on the analysis of multiple types of genome-wide molecular data is vital for accomplishing the promise of precision medicine in oncology. This will benefit cancer patients by matching their tumor characteristics to the most effective therapy available. As larger and more diverse layers of patient-related data become available, further demands for new bioinformatics approaches and expertise will arise. This article reviews key strategies, resources and techniques for the prediction of drug sensitivity in cell lines and patient-derived samples. It discusses major advances and challenges associated with the different model development steps. This review highlights major trends in this area, and will assist researchers in the assessment of recent progress and in the selection of approaches to emerging applications in oncology.

Keywords: cancer; computational prediction models; drug sensitivity; precision medicine; translational bioinformatics.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Key steps in the development of computational models for predicting drug response. Data obtained from cell lines, animals or humans are stored in different data repositories, including public databases. These resources also include drug response information. Data sets are obtained to be subsequently used as training data sets, and may contain one or more types of ‘omics’ data, e.g. transcriptomics and DNA sequence. Such data are used as inputs to statistical or machine learning techniques. The prediction problem may be defined as either a classification or a regression problem, and a variety of techniques may be applied. The predictive performance of the models is assessed with cross-validation sampling techniques. The most-promising models are selected and evaluated using testing data sets, which were not used during the training phase. The model and its predictions undergo human expert interpretation and their reporting to stakeholders follows. Further independent validations using clinically relevant data are required to continue bridging the gap between the laboratory and the clinic.
Figure 2
Figure 2
A graphical synthesis of the diversity of computational models available for the prediction of drug responses. (A) List of data types most commonly used. (B) Categorization of models on the basis of the prediction problems addressed. (C) General hierarchy of statistical and machine learning techniques most commonly investigated. (D) Fundamental data sampling strategies for assessing model prediction capability.

Similar articles

Cited by

References

    1. Adams JU. Genetics: big hopes for big data. Nature 2015;527(7578):S108–9. - PubMed
    1. Schmidt C. Cancer: reshaping the cancer clinic. Nature 2015;527(7576):S10–1. - PubMed
    1. Rubin MA. Health: make precision medicine work for cancer care. Nature 2015;520(7547):290–1. - PubMed
    1. Kohane IS. Health care policy. Ten things we have to do to achieve precision medicine. Science 2015;349(6243):37–8. - PubMed
    1. Baselga J, Bhardwaj N, Cantley LC, et al.AACR cancer progress report 2015. Clin Cancer Res 2015;21(Suppl 19):S1–128. - PMC - PubMed