Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Feb 3;16(2):421-432.
doi: 10.1021/acs.jproteome.6b00505. Epub 2016 Dec 15.

CanProVar 2.0: An Updated Database of Human Cancer Proteome Variation

Affiliations

CanProVar 2.0: An Updated Database of Human Cancer Proteome Variation

Menghuan Zhang et al. J Proteome Res. .

Abstract

Identification and annotation of the mutations involved in oncogenesis and tumor progression are crucial for both cancer biology and clinical applications. Previously, we developed a public resource CanProVar, a human cancer proteome variation database for storing and querying single amino acid alterations in the human cancers. Since the publication of CanProVar, extensive cancer genomics efforts have revealed the enormous genomic complexity of various types of human cancers. Thus, there is an overwhelming need for comprehensive annotation of the genomic alterations at the protein level and making such knowledge easily accessible. Here, we describe CanProVar 2.0, a significantly expanded version of CanProVar, in which the amount of cancer-related variations and noncancer specific variations was increased by about 10-fold as compared to the previous version. To facilitate the interpretation of the variations, we added to the database functional data on potential impact of the cancer-related variations on 3D protein interaction and on the differential expression of the variant-bearing proteins between cancer and normal samples. The web interface allows for flexible queries based on gene or protein IDs, cancer types, chromosome locations, or pathways. An integrated protein sequence database containing variations that can be directly used for proteomics database searching can be downloaded.

Keywords: cancer; database; proteome; variation.

PubMed Disclaimer

Figures

Figure 1
Figure 1. The system architecture of CanProVar 2.0
Five query methods are provided, and the output information is shown through different background colors.
Figure 2
Figure 2. Variation content
(A) Comparison of crVARs among different data sources between CanProVar version 1.0 and 2.0. (B) The frequency statistics for cancer types in CanProVar 2.0 show that much more crVARs have been identified in ovarian, breast, skin, and head and neck cancers than other cancer types.
Figure 3
Figure 3. Example of differential expression of a crPRO and its partners
Protein with differential expression, cancer-related variation, and its interaction partners are displayed in graph.
Figure 4
Figure 4. New searching methods
CanProVar 2.0 introduced three additional searching methods based on protein sets defined by protein lists (A), chromosome locations (B) and biological pathways (C).
Figure 5
Figure 5. Example results of a protein list-based query
(A) The amounts of crVARs in the query proteins in different cancer types are displayed in a heatmap, in which darker colors correspond to more variations. (B) The detailed information about these variations can be found when the user selects the number of variations in a specific cancer type.
Figure 6
Figure 6. Example of a chromosome location-based query
In the searching results, the accumulated mutation numbers (crVARs) are plotted by the chromosome position and the data cross cancer types are illustrated in different colors. Therefore, a hot chromosome band with a significantly higher number of crVARs can be spotted easily and clearly. For example, we found a peak of crVARs in chromosome band ‘chr17 p13.1’.
Figure 7
Figure 7. Example of results of a biological pathway-based query
By entering a KEGG pathway ID, e.g. has00010, or selecting a name from the menu of pathways implemented in CanProVar 2.0, the user can see the crPROs and crDEPs as highlighted in different colors on the graph of the given pathway. For instance, in the p53 signaling pathway, most of members have crVARs and half of these crPROs also showed differential expression between cancer and normal samples.
Figure 8
Figure 8. Significantly mutated proteins in cancers
(A) The crVARs: ncrVARs ratio analysis revealed 167 significantly mutated proteins across all cancer types, of which the ratios are higher than 3.0. (B) The crVARs: ncrVARs ratios in ovarian cancer. (C) The crVARs: ncrVARs ratios in breast cancer. (D) The crVARs: ncrVARs ratios in skin cancer.
Figure 9
Figure 9. Association network of cancers
This network was constructed based on shared crVARs. Two cancer types were linked if they had common crVARs. The network has 217 edges, and the edge width represents the number of shared crVARs between the two connected cancer types. The node size corresponds to the degree in the network.

Similar articles

Cited by

References

    1. Garraway LA, Lander ES. Lessons from the cancer genome. Cell. 2013;153(1):17–37. - PubMed
    1. Forbes SA, Bindal N, Bamford S, Cole C, Kok CY, Beare D, Jia M, Shepherd R, Leung K, Menzies A. COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer. Nucleic acids research. 2011;39(suppl 1):D945–D950. - PMC - PubMed
    1. Weinstein JN, Collisson EA, Mills GB, Shaw KRM, Ozenberger BA, Ellrott K, Shmulevich I, Sander C, Stuart JM, Network CGAR. The cancer genome atlas pan-cancer analysis project. Nature genetics. 2013;45(10):1113–1120. - PMC - PubMed
    1. Altshuler DM, Gibbs RA, Peltonen L, Dermitzakis E, Schaffner SF, Yu F, Bonnen PE, De Bakker P, Deloukas P, Gabriel SB. Integrating common and rare genetic variation in diverse human populations. Nature. 2010;467(7311):52–58. - PMC - PubMed
    1. Clarke L, Zheng-Bradley X, Smith R, Kulesha E, Xiao C, Toneva I, Vaughan B, Preuss D, Leinonen R, Shumway M. The 1000 Genomes Project: data management and community access. Nature methods. 2012;9(5):459–462. - PMC - PubMed

Publication types

Substances

LinkOut - more resources