Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 23:5:11966.
doi: 10.1038/srep11966.

Identification of a multi-cancer gene expression biomarker for cancer clinical outcomes using a network-based algorithm

Affiliations

Identification of a multi-cancer gene expression biomarker for cancer clinical outcomes using a network-based algorithm

Emmanuel Martinez-Ledesma et al. Sci Rep. .

Abstract

Cancer types are commonly classified by histopathology and more recently through molecular characteristics such as gene expression, mutations, copy number variations, and epigenetic alterations. These molecular characterizations have led to the proposal of prognostic biomarkers for many cancer types. Nevertheless, most of these biomarkers have been proposed for a specific cancer type or even specific subtypes. Although more challenging, it is useful to identify biomarkers that can be applied for multiple types of cancer. Here, we have used a network-based exploration approach to identify a multi-cancer gene expression biomarker highly connected by ESR1, PRKACA, LRP1, JUN and SMAD2 that can be predictive of clinical outcome in 12 types of cancer from The Cancer Genome Atlas (TCGA) repository. The gene signature of this biomarker is highly supported by cancer literature, biological terms, and prognostic power in other cancer types. Additionally, the signature does not seem to be highly associated with specific mutations or copy number alterations. Comparisons with cancer-type specific and other multi-cancer biomarkers in TCGA and other datasets showed that the performance of the proposed multi-cancer biomarker is superior, making the proposed approach and multi-cancer biomarker potentially useful in research and clinical settings.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interests.

Figures

Figure 1
Figure 1. Schematic representation of the network clinical association algorithm (NCA).
Starting from a single seed gene (black), the first cycle generates modules that include the seed gene and each of the connected genes (blue). The 6 modules of 2 genes are then evaluated by their goodness of fit in a Cox survival model. Only those grown modules that improve (filled blue circles) the evaluation are considered for the next grow cycle. Only a proportion of the best improved modules are further explored in the next cycle (represented by a percentage of the distribution of all modules, shown in green, evaluated in the 4th cycle). This procedure continues until no improvement is observed. The NCA algorithm was run for each cancer type and for all cancer datasets (multi-NCA).
Figure 2
Figure 2. Comparison of biomarkers generated by the network clinical association (NCA) algorithm.
Panel A shows the number of genes that were included in any two biomarkers. Underlined numbers represent the number of genes per biomarker. Red indicates high overlaps and blue indicates no overlap. The “Sum” row shows the total number of overlaps with other biomarkers while the “Unique” row shows the number of unique genes that overlap. Panel B shows the C-index evaluation of NCA biomarkers (rows) across cancer datasets (columns). Underlined numbers represent the biomarkers evaluated within the cancer dataset. Red indicates high values within the cancer dataset (column) and blue indicates low values. Boldface and framed values represent significant predictions using 10,000 random models of the same length. The “Average” row shows the average C-index per cancer type and the “Average” column shows the mean C-index per biomarker. Panel C shows the NCA biomarkers (horizontal) evaluated in all datasets using C-index (vertical axis). The mean is shown as a horizontal line. Panel D shows cancer types (horizontal) evaluated with all biomarkers using C-index (vertical axis).
Figure 3
Figure 3. The multi-NCA biomarker identified when all databases were combined.
Panel A shows the genes and network identified. The connections correspond to data from the PPI database used. The most connected genes were PRKACA, ESR1, LRP1, SMAD2 and JUN. Panel B shows the risk group prediction (splitting the prognostic index by the median) of the multi-NCA biomarker across cancer datasets. Panel C depict the color-coded differential expression of genes between risk groups. Darker red indicates more significant differences. The scales were estimated in -Log10 of the t test p value. Only p values <0.01 are highlighted. Darker purple indicates more significant hazard ratio associations within the Cox model. The scales were expressed in -Log10 of the Z p value. Only p values <0.05 are highlighted. Panel D shows, in the top, the curated biological terms and pathways associated with the genes composing the biomarker. The associations of genes with specific cancers based on the literature are shown at the bottom.
Figure 4
Figure 4. Evaluation of all biomarkers in SurvExpress using C-index.
PGC biomarker derived from other authors is not shown (0.74 for all datasets and 0.81 for per tissue) to emphasize biomarkers with higher C-index values.

References

    1. Helpap B. & Egevad L. Modified Gleason grading. An updated review. Histol Histopathol 24, 661–666 (2009). - PubMed
    1. Astler V. B. & Coller F. A. The prognostic significance of direct extension of carcinoma of the colon and rectum. Ann Surg 139, 846–852 (1954). - PMC - PubMed
    1. Mutch D. M., Berger A., Mansourian R., Rytz A. & Roberts M. A. Microarray data analysis: a practical approach for selecting differentially expressed genes. Genome Biol 2, PREPRINT0009 (2001). - PubMed
    1. Edge S. B. & Compton C. C. The American Joint Committee on Cancer: the 7th edition of the AJCC cancer staging manual and the future of TNM. Ann Surg Oncol 17, 1471–1474, 10.1245/s10434-010-0985-4 (2010). - DOI - PubMed
    1. Brennan C. W. et al. The somatic genomic landscape of glioblastoma. Cell 155, 462–477, 10.1016/j.cell.2013.09.034 (2013). - DOI - PMC - PubMed

Publication types

Substances