Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 May 15;31(10):1526-35.
doi: 10.1093/bioinformatics/btu858. Epub 2015 Jan 7.

MADGiC: a model-based approach for identifying driver genes in cancer

Affiliations

MADGiC: a model-based approach for identifying driver genes in cancer

Keegan D Korthauer et al. Bioinformatics. .

Abstract

Motivation: Identifying and prioritizing somatic mutations is an important and challenging area of cancer research that can provide new insights into gene function as well as new targets for drug development. Most methods for prioritizing mutations rely primarily on frequency-based criteria, where a gene is identified as having a driver mutation if it is altered in significantly more samples than expected according to a background model. Although useful, frequency-based methods are limited in that all mutations are treated equally. It is well known, however, that some mutations have no functional consequence, while others may have a major deleterious impact. The spatial pattern of mutations within a gene provides further insight into their functional consequence. Properly accounting for these factors improves both the power and accuracy of inference. Also important is an accurate background model.

Results: Here, we develop a Model-based Approach for identifying Driver Genes in Cancer (termed MADGiC) that incorporates both frequency and functional impact criteria and accommodates a number of factors to improve the background model. Simulation studies demonstrate advantages of the approach, including a substantial increase in power over competing methods. Further advantages are illustrated in an analysis of ovarian and lung cancer data from The Cancer Genome Atlas (TCGA) project.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Counts of samples with mutation by position and type for TCGA ovarian and COSMIC (Catalogue Of Somatic Mutations In Cancer) datasets. The left panel displays the ten genes with the lowest entropy in COSMIC (putative oncogenes) that have at least one mutation in TCGA ovarian. The right panel displays the ten genes with the highest proportion of truncating mutations (putative tumor-suppressor genes) that have at least one mutation in TCGA ovarian. Blue represents missense mutations and red represents a location with at least one truncating mutation. Each vertical bar spans five amino acids and darker colors correspond to more mutations. For genes with more than 500 mutations, a random sample of 500 was plotted, and positions with more than 25 mutations are given the same color intensity as those with 25 mutations
Fig. 2.
Fig. 2.
Mutation rate is shown to depend significantly on replication timing region and expression level. Specifically, mutation rate is shown for three replication timing regions (top) and for three levels of expression (bottom) for four types of mutations in TCGA ovarian data. Within each mutation type, Chi-Square tests of mutation counts stratified by replication timing or expression level categories were found to be significant (P < 0.05)
Fig. 3.
Fig. 3.
The proportion of driver genes identified by each method in each replication timing (top) and expression level (bottom) category for the TCGA ovarian case study. Refer to Supplementary Section S5 for similar results from the lung case study

References

    1. Adzhubei I.A., et al. . (2010). A method and server for predicting damaging missense mutations. Nat. Methods , 7, 248–249. - PMC - PubMed
    1. Barretina J., et al. . (2012). The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature , 483, 603607. - PMC - PubMed
    1. Birkbak N.J., et al. . (2013). Tumor mutation burden forecasts outcome in ovarian cancer with brca1 or brca2 mutations. PLoS One , 8, e80023. - PMC - PubMed
    1. Bozic I., et al. . (2010). Accumulation of driver and passenger mutations during tumor progression. Proc. Natl Acad. Sci. USA, 107, 18545–18550. - PMC - PubMed
    1. Chapman M.A., et al. . (2011). Initial genome sequencing and analysis of multiple myeloma. Nature , 471, 476–472. - PMC - PubMed

Publication types

Substances