Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2015 Jan;33(1):3-18.
doi: 10.3892/or.2014.3579. Epub 2014 Oct 31.

Human cancer databases (review)

Affiliations
Review

Human cancer databases (review)

Athanasia Pavlopoulou et al. Oncol Rep. 2015 Jan.

Abstract

Cancer is one of the four major non‑communicable diseases (NCD), responsible for ~14.6% of all human deaths. Currently, there are >100 different known types of cancer and >500 genes involved in cancer. Ongoing research efforts have been focused on cancer etiology and therapy. As a result, there is an exponential growth of cancer‑associated data from diverse resources, such as scientific publications, genome‑wide association studies, gene expression experiments, gene‑gene or protein‑protein interaction data, enzymatic assays, epigenomics, immunomics and cytogenetics, stored in relevant repositories. These data are complex and heterogeneous, ranging from unprocessed, unstructured data in the form of raw sequences and polymorphisms to well‑annotated, structured data. Consequently, the storage, mining, retrieval and analysis of these data in an efficient and meaningful manner pose a major challenge to biomedical investigators. In the current review, we present the central, publicly accessible databases that contain data pertinent to cancer, the resources available for delivering and analyzing information from these databases, as well as databases dedicated to specific types of cancer. Examples for this wealth of cancer‑related information and bioinformatic tools have also been provided.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Example of searching for and downloading data from The Cancer Genome Atlas (TCGA). (A) Attributes are selected and (B) the results are presented on a different page. The user can select samples by clicking on them (shadowed). The data can be downloaded by clicking on the ‘Build Archive’ button.
Figure 2
Figure 2
Example of querying COSMIC database. (A) Cancer attributes are selected. (B) A list of the top 20 genes involved in the particular type of cancer. (C) A list of the mutations along with links and pertinent information and (D) distribution of the mutations present in the KRAS gene.
Figure 2
Figure 2
Example of querying COSMIC database. (A) Cancer attributes are selected. (B) A list of the top 20 genes involved in the particular type of cancer. (C) A list of the mutations along with links and pertinent information and (D) distribution of the mutations present in the KRAS gene.
Figure 3
Figure 3
Example of querying Integrated Tumor Transcriptome Array and Clinical data Analysis (ITTACA) database. (A) A study was selected and (B) a new set of patient groups from clinical parameters was added. In a new page, (C) the values ‘alive’ and ‘dead of disease’ under the clinical parameter ‘Patient status’ were chosen. (D) Two different groups corresponding to the two different patient statuses were evident. The patient groups were subsequently analyzed using a survival curve. (E) A Kaplan-Meier curve based on overall survival was generated.
Figure 4
Figure 4
Screenshots showing (A) the results of selecting Breast Cancer (BC) as query. (B) A table is provided where the differentially expressed proteins (DEPs) in BC are shown, including pertinent information and links. By selecting two experiments, (C) the results are returned in a tabulated form where the DEPs in the two experiments are shown. There are links to Universal Protein Resource (UniProt) and back to the experiments.
Figure 5
Figure 5
Example of querying IntOGen for the genes lost in a type of bladder cancer. Dataset, Filters and Attributes were selected. By clicking on the ‘Results’ button, the results are returned and can be viewed and exported in several formats.
Figure 6
Figure 6
Results of a gene-centered search of Cervical Cancer gene DataBase (CCDB) using BCL2 as query. Information is provided regarding gene ID, gene description, synonyms, chromosomal location and the molecules with which the gene interacts. There are also links to mRNA/CCDS/Protein sequence entries, to homologous genes from various species and gene ontology information.

References

    1. International Cancer Genome Consortium. Hudson TJ, Anderson W, Artez A, et al. International network of cancer genome projects. Nature. 2010;464:993–998. doi: 10.1038/nature08987. - DOI - PMC - PubMed
    1. Cancer Genome Atlas Research Network. Weinstein JN, Collisson EA, Mills GB, et al. The Cancer Genome Atlas Pan-Cancer analysis project. Nat Genet. 2013;45:1113–1120. doi: 10.1038/ng.2764. - DOI - PMC - PubMed
    1. Zhang J, Baran J, Cros A, et al. International Cancer Genome Consortium Data Portal - a one-stop shop for cancer genomics data. Database (Oxford) 2011. 2011:bar026. - PMC - PubMed
    1. Parsons DW, Jones S, Zhang X, et al. An integrated genomic analysis of human glioblastoma multiforme. Science. 2008;321:1807–1812. doi: 10.1126/science.1164382. - DOI - PMC - PubMed
    1. Jones S, Zhang X, Parsons DW, et al. Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science. 2008;321:1801–1806. doi: 10.1126/science.1164368. - DOI - PMC - PubMed