Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Nov 26;4(11):89.
doi: 10.1186/gm390. eCollection 2012.

Improving the prediction of the functional impact of cancer mutations by baseline tolerance transformation

Affiliations

Improving the prediction of the functional impact of cancer mutations by baseline tolerance transformation

Abel Gonzalez-Perez et al. Genome Med. .

Abstract

High-throughput prioritization of cancer-causing mutations (drivers) is a key challenge of cancer genome projects, due to the number of somatic variants detected in tumors. One important step in this task is to assess the functional impact of tumor somatic mutations. A number of computational methods have been employed for that purpose, although most were originally developed to distinguish disease-related nonsynonymous single nucleotide variants (nsSNVs) from polymorphisms. Our new method, transformed Functional Impact score for Cancer (transFIC), improves the assessment of the functional impact of tumor nsSNVs by taking into account the baseline tolerance of genes to functional variants.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The distribution of MutationAssessor functional impact scores of nonsynonymous single nucleotide variants differs significantly in proteins belonging to different functional groups. (a) Candlestick representation of the distributions of MutationAssessor (MA) scores of germline single nucleotide variants (SNVs) in genes in all Gene Ontology Molecular Function (GOMF) categories, ordered from higher to lower mean. (b,c) Thirty least-tolerant and 30 most-tolerant GOMF groups of nsSNVs ordered by their mean MA scores. Groups in the lower end of the tolerance scale (less tolerant) correspond to essential GOMF categories, involved in signal transduction, transcription, and translation. On the other hand, the most tolerant molecular functions correspond mainly to metabolic-related activities.
Figure 2
Figure 2
Outline of the method to transform the scores. (a) Functional impact scores (FISs) of all germline single nucleotide variants (SNVs) from the 1000 Genomes Project are computed. (b) SNVs are partitioned into subsets according to the category of the genes that harbor them (for example, Gene Ontology Molecular Function). (c) FISs of a given cancer somatic mutation are computed and transformed using the distribution of the scores of SNVs in the same category as the protein where the mutation under analysis occurs. We give these transformed scores the generic name transFIC (transformed Functional Impact scores in Cancer).
Figure 3
Figure 3
Transformed Functional Impact for Cancer (transFIC) systematically outperforms original scores in the task of differentiating cancer driver mutations from neutral variants. (a) Performance of GOMF transFIC is compared to the three original functional impact scores (FISs) classifying the nine proxy datasets, using as cutoff the value of FIS (or transFIC) that maximizes the Mathews correlation coefficient (MCC) in each case. (b) Performance of GOMF transFIC is compared to the original score of CHASM (q-value cutoff <0.05) in two proxy datasets after removal of mutations within the training set of CHASM.
Figure 4
Figure 4
Complementary cumulative distribution of the three transFIC of subsets of nonsynonymous single nucleotide variants from COSMIC. (a-c) Complementary cumulative distribution of transFIC SIFT (a), transFIC PPH2 (b) and transFIC MA (c) of nonrecurrent (blue), recurrent (orange) and highly recurrent (red) COSMIC mutations.

Similar articles

Cited by

References

    1. Ng PC, Henikoff S. SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003;4:3812–3814. doi: 10.1093/nar/gkg509. - DOI - PMC - PubMed
    1. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR. A method and server for predicting damaging missense mutations. Nat Methods. 2010;4:248–249. doi: 10.1038/nmeth0410-248. - DOI - PMC - PubMed
    1. Reva B, Antipin Y, Sander C. Predicting the functional impact of protein mutations: application to cancer genomics. Nucleic Acids Res. 2011;4:e11810. - PMC - PubMed
    1. Kaminker JS, Zhang Y, Watanabe C, Zhang Z. CanPredict: a computational tool for predicting cancer-associated missense mutations. Nucleic Acids Res. 2007;4:W595–598. doi: 10.1093/nar/gkm405. - DOI - PMC - PubMed
    1. Carter H, Chen S, Isik L, Tyekucheva S, Velculescu VE, Kinzler KW, Vogelstein B, Karchin R. Cancer-specific high-throughput annotation of somatic mutations: computational prediction of driver missense mutations. Cancer Res. 2009;4:6660–6667. doi: 10.1158/0008-5472.CAN-09-1133. - DOI - PMC - PubMed

LinkOut - more resources