Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Sep 16;10(9):e0138141.
doi: 10.1371/journal.pone.0138141. eCollection 2015.

High Prevalence and Clinical Relevance of Genes Affected by Chromosomal Breaks in Colorectal Cancer

Affiliations

High Prevalence and Clinical Relevance of Genes Affected by Chromosomal Breaks in Colorectal Cancer

Evert van den Broek et al. PLoS One. .

Abstract

Background: Cancer is caused by somatic DNA alterations such as gene point mutations, DNA copy number aberrations (CNA) and structural variants (SVs). Genome-wide analyses of SVs in large sample series with well-documented clinical information are still scarce. Consequently, the impact of SVs on carcinogenesis and patient outcome remains poorly understood. This study aimed to perform a systematic analysis of genes that are affected by CNA-associated chromosomal breaks in colorectal cancer (CRC) and to determine the clinical relevance of recurrent breakpoint genes.

Methods: Primary CRC samples of patients with metastatic disease from CAIRO and CAIRO2 clinical trials were previously characterized by array-comparative genomic hybridization. These data were now used to determine the prevalence of CNA-associated chromosomal breaks within genes across 352 CRC samples. In addition, mutation status of the commonly affected APC, TP53, KRAS, PIK3CA, FBXW7, SMAD4, BRAF and NRAS genes was determined for 204 CRC samples by targeted massive parallel sequencing. Clinical relevance was assessed upon stratification of patients based on gene mutations and gene breakpoints that were observed in >3% of CRC cases.

Results: In total, 748 genes were identified that were recurrently affected by chromosomal breaks (FDR <0.1). MACROD2 was affected in 41% of CRC samples and another 169 genes showed breakpoints in >3% of cases, indicating that prevalence of gene breakpoints is comparable to the prevalence of well-known gene point mutations. Patient stratification based on gene breakpoints and point mutations revealed one CRC subtype with very poor prognosis.

Conclusions: We conclude that CNA-associated chromosomal breaks within genes represent a highly prevalent and clinically relevant subset of SVs in CRC.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. CNA-associated gene breakpoint detection.
(A) Array-CGH DNA copy number profile of one CRC sample. The X-axis depicts chromosomes 1–22 and X (numbered 23) with chromosome boundaries indicated by vertical dotted lines. The Y-axis depicts the log2 ratio of the amount of tumor DNA versus patient-matched normal DNA. Black dots represent individual array-CGH probe datapoints. The blue horizontal lines represent DNA segments, indicative for tumor DNA copy number aberrations when deviating from 0. The green arrow and bar indicate the region of chromosome 6 that is highlighted in Fig 1B. (B) Enlargement of Fig 1A (green bar). Vertical dotted red lines indicate genomic locations of CNA-associated chromosomal breakpoints, i.e. the genomic positions where log2 ratios of DNA segments change. (C) Frequency plot of CNA-associated chromosomal breakpoints on the q-arm of chromosome 13. The X-axis depicts the genomic position in Mb. The Y-axis depicts the chromosomal breakpoint frequencies across the cohort of 352 CRC samples. Breakpoint frequencies are indicated on array-CGH probe-level (vertical black bars) and on gene-level (horizontal red bars). Recurrent breakpoint genes (FDR<0.1) are named. The green arrow and bar indicate the PIBF1 region of chromosome 13q that is highlighted in Fig 1D. (D) Enlargement of Fig 1C (green bar), which illustrates that PIBF1 gene breakpoints are concentrated at the distal part of the gene. Neighbouring genes that do not harbor significant breakpoint recurrence rates are indicated in blue.
Fig 2
Fig 2. Gene breakpoint and gene mutation frequencies of the 25 most frequently affected genes in CRC.
Gene breakpoint frequencies (red bars) were based on the analysis of 352 CRC samples and gene mutation frequencies (blue bars) on the analysis of 204 samples. Genes marked with a “*” indicate a pool of genes that share probe(s) associated with chromosomal breakpoints: the PCMTD2* pool also includes LINC00266-1; PARK2* also includes PACRG; ZNF337* also includes NCOR1P1, FAM182A, FAM182B, FRG1B, MIR663A, MLLT10P1; CD99* also includes XG; PARP8* also includes EMB.
Fig 3
Fig 3. Clustering of 203 CRC patients by NBS based on gene breakpoints and gene mutations revealed four CRC subtypes.
(A) Co-clustering matrix of CRC samples generated by NBS analysis. The matrix color intensity represents the similarity score. The color bar on top indicates the groups of patients related to the four CRC subtypes (k = 4) as determined by hierarchical clustering after NBS analysis. (B) Kaplan-Meier plot for overall survival (in days) of CRC subtype 1 (n = 80 patients), subtype 2 (n = 45 patients), subtype 3 (n = 27 patients) and subtype 4 (n = 51 patients). There are significant differences in OS among the four CRC subtypes (log-rank P = 0.001), with poorest OS for subtype 3 CRC patients. (C) Kaplan-Meier plot for OS of CRC subtype 3 patients versus patients in other CRC subtypes, showing a hazard ratio (HR) of 2.17 and a median OS of 392 days versus 610 days, respectively (log-rank P = 0.0002).
Fig 4
Fig 4. Four distinct core modules of putative CRC driver genes were retrieved by Multi-Dendrix analysis.
The nodes comprise both gene breakpoints (red outline) and gene mutations (blue outline). Edges (grey lines) connect genes that are mutually exclusively affected. The thickness of the grey lines and the corresponding number reflect the robustness score. The strongest mutual exclusivity is observed between PIBF1 and TP53. Genes marked with a “*” indicate a pool of genes that share probe(s) associated with chromosomal breakpoints: the ZNF337* pool also includes NCOR1P1, FAM182A, FAM182B, FRG1B, MIR663A, MLLT10P1; ZNF519* also includes ANKRD20A5P, ANKRD30B.

References

    1. Stratton MR, Campbell PJ, Futreal PA. The cancer genome. Nature. 2009;458: 719–724. 10.1038/nature07943 - DOI - PMC - PubMed
    1. Edwards PAW. Fusion genes and chromosome translocations in the common epithelial cancers. J Pathol. 2010;220: 244–254. 10.1002/path.2632 - DOI - PubMed
    1. Bass AJ, Lawrence MS, Brace LE, Ramos AH, Drier Y, Cibulskis K, et al. Genomic sequencing of colorectal adenocarcinomas identifies a recurrent VTI1A-TCF7L2 fusion. Nat Genet. 2011;43: 964–968. 10.1038/ng.936 - DOI - PMC - PubMed
    1. Cancer Genome Atlas Network. Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012;487: 330–337. 10.1038/nature11252 - DOI - PMC - PubMed
    1. Seshagiri S, Stawiski EW, Durinck S, Modrusan Z, Storm EE, Conboy CB, et al. Recurrent R-spondin fusions in colon cancer. Nature. 2012;488: 660–664. 10.1038/nature11282 - DOI - PMC - PubMed

Publication types

Substances

Associated data