Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Apr 3;23(1):227-238.e3.
doi: 10.1016/j.celrep.2018.03.050.

Driver Fusions and Their Implications in the Development and Treatment of Human Cancers

Collaborators, Affiliations

Driver Fusions and Their Implications in the Development and Treatment of Human Cancers

Qingsong Gao et al. Cell Rep. .

Abstract

Gene fusions represent an important class of somatic alterations in cancer. We systematically investigated fusions in 9,624 tumors across 33 cancer types using multiple fusion calling tools. We identified a total of 25,664 fusions, with a 63% validation rate. Integration of gene expression, copy number, and fusion annotation data revealed that fusions involving oncogenes tend to exhibit increased expression, whereas fusions involving tumor suppressors have the opposite effect. For fusions involving kinases, we found 1,275 with an intact kinase domain, the proportion of which varied significantly across cancer types. Our study suggests that fusions drive the development of 16.5% of cancer cases and function as the sole driver in more than 1% of them. Finally, we identified druggable fusions involving genes such as TMPRSS2, RET, FGFR3, ALK, and ESR1 in 6.0% of cases, and we predicted immunogenic peptides, suggesting that fusions may provide leads for targeted drug and immune therapy.

Keywords: RNA; cancer; fusion; gene fusions; translocation.

PubMed Disclaimer

Conflict of interest statement

DECLARATION OF INTERESTS

Michael Seiler, Peter G. Smith, Ping Zhu, Silvia Buonamici, and Lihua Yu are employees of H3 Biomedicine, Inc. Parts of this work are the subject of a patent application: WO2017040526 titled “Splice variants associated with neomorphic sf3b1 mutants.” Shouyoung Peng, Anant A. Agrawal, James Palacino, and Teng Teng are employees of H3 Biomedicine, Inc. Andrew D. Cherniack, Ashton C. Berger, and Galen F. Gao receive research support from Bayer Pharmaceuticals. Gordon B. Mills serves on the External Scientific Review Board of Astrazeneca. Anil Sood is on the Scientific Advisory Board for Kiyatec and is a shareholder in BioPath. Jonathan S. Serody receives funding from Merck, Inc. Kyle R. Covington is an employee of Castle Biosciences, Inc. Preethi H. Gunaratne is founder, CSO, and shareholder of NextmiRNA Therapeutics. Christina Yau is a part-time employee/consultant at NantOmics. Franz X. Schaub is an employee and shareholder of SEngine Precision Medicine, Inc. Carla Grandori is an employee, founder, and shareholder of SEngine Precision Medicine, Inc. Robert N. Eisenman is a member of the Scientific Advisory Boards and shareholder of Shenogen Pharma and Kronos Bio. Daniel J. Weisenberger is a consultant for Zymo Research Corporation. Joshua M. Stuart is the founder of Five3 Genomics and shareholder of NantOmics. Marc T. Goodman receives research support from Merck, Inc. Andrew J. Gentles is a consultant for Cibermed. Charles M. Perou is an equity stock holder, consultant, and Board of Directors member of BioClassifier and GeneCentric Diagnostics and is also listed as an inventor on patent applications on the Breast PAM50 and Lung Cancer Subtyping assays. Matthew Meyerson receives research support from Bayer Pharmaceuticals; is an equity holder in, consultant for, and Scientific Advisory Board chair for OrigiMed; and is an inventor of a patent for EGFR mutation diagnosis in lung cancer, licensed to LabCorp. Eduard Porta-Pardo is an inventor of a patent for domainXplorer. Han Liang is a shareholder and scientific advisor of Precision Scientific and Eagle Nebula. Da Yang is an inventor on a pending patent application describing the use of antisense oligonucleotides against specific lncRNA sequence as diagnostic and therapeutic tools. Yonghong Xiao was an employee and shareholder of TESARO, Inc. Bin Feng is an employee and shareholder of TESARO, Inc. Carter Van Waes received research funding for the study of IAP inhibitor ASTX660 through a Cooperative Agreement between NIDCD, NIH, and Astex Pharmaceuticals. Raunaq Malhotra is an employee and shareholder of Seven Bridges, Inc. Peter W. Laird serves on the Scientific Advisory Board for AnchorDx. Joel Tepper is a consultant at EMD Serono. Kenneth Wang serves on the Advisory Board for Boston Scientific, Microtech, and Olympus. Andrea Califano is a founder, shareholder, and advisory board member of DarwinHealth, Inc. and a shareholder and advisory board member of Tempus, Inc. Toni K. Choueiri serves as needed on advisory boards for Bristol-Myers Squibb, Merck, and Roche. Lawrence Kwong receives research support from Array BioPharma. Sharon E. Plon is a member of the Scientific Advisory Board for Baylor Genetics Laboratory. Beth Y. Karlan serves on the Advisory Board of Invitae.

Figures

Figure 1
Figure 1. Fusion Detection and Landscape in Cancer
(A) Fusion calling and filtering pipeline. (B) Cartoon overview of fusion gene partner breakpoints. Purple indicates the 5′ gene partner, and green indicates the 3′ gene partner. For both the 5′ and 3′ gene partners, fusion gene breakpoints can occur in the following genomic regions: 5′ UTR (triangle), coding sequence (CDS; rectangle), 3′ UTR (circle), and non-coding region (rounded rectangle). For each fusion event, a dotted line connects the breakpoints in the 5′ and 3′ gene partners to create the predicted fusion and the circle size, while number represents the total fusion events classified into the associated fusion category. (C) The dot plot shows the frequency of recurrent fusions found in each cancer type. The most recurrent fusion in each cancer type is labeled. Cancer types without recurrent fusions are not shown.
Figure 2
Figure 2. Fusion Expression Outliers
(A) The dot plot indicates the percentage of fusions called in which one of the partner genes is an expression outlier (overexpression or underexpression). The size of the dot corresponds to the number of fusions called in each cancer type. Color corresponds to genes of interest coming from lists of oncogenes, protein kinases, and tumor suppressor genes. (B) The dot plot shows the relative expression level of samples with fusions compared with those without fusions. Each sample has a particular expression percentile at a given gene, and color indicates the median percentile of samples with a fusion in that gene. Genes are the 15 most recurrent oncogenes and tumor suppressor genes. Size corresponds to the number of samples in each cancer type with a fusion at that gene. (C and D) Expression of samples at RET and CBFB in thyroid carcinoma (THCA) (C) and acute myeloid leukemia (LAML) (D), respectively. Color indicates a categorical copy number ranging from deep deletion to high amplification.
Figure 3
Figure 3. Protein Kinase Fusions
(A) The bar chart indicates the number of protein kinase fusions with the kinase at the 5′ or 3′ end, inframe or frameshift, and kinase domain intact or disrupted. (B) The left bar plot shows the percentage of samples with kinase fusions across different cancer types. The number of samples with a kinase fusion is also indicated at the end of each bar. Light green and blue denote 5′ kinase and 3′ kinase fusions, respectively. The right bar plot shows the normalized percentage of kinase fusions broken down by kinase groups. (C) The dot plot shows the numbers of samples for recurrent fusions across different cancer types. Light green and blue denote 5′ kinase and 3′ kinase fusions, respectively.
Figure 4
Figure 4. Kinase Gene Expression Regulated by Fusion
(A) The scatterplot shows the gene expression quantile (y axis) for the 5′-kinase without copy number variation (between one and three copies; x axis). All genes are classified among three categories: kinase expression higher, equal, and lower, compared with partner expression, marked in blue, gray, and red, respectively. The density plot for expression quantile is also shown on the right. (B) The scatterplot shows the gene expression quantile (y axis) for the 3′-kinase without copy number variation (between one and three copies; x axis). The colors represent the same three categories as (A). The density plot for expression quantile is also shown. (C) Boxplot comparing the distribution of kinase gene expression quantile between the three groups defined in (A) for 5′-kinase and 3′-kinase, respectively. (D) Schematic of TBABD–DDR2 fusion gene structure in an HNSC sample and scatterplot of DDR2 copy number versus mRNA expression in HNSC. The samples with and without this fusion are marked in red and blue, respectively.
Figure 5
Figure 5. Mutual Exclusivity between Driver Mutations and Driver Fusions
(A) The bar plot shows the percentages of samples with driver mutations only (green), mutations only (orange), driver mutation and fusion (blue), mutation and fusion (pink), or fusion only (light green) events in 299 cancer driver genes. (B) Distribution of mutation burden across each alteration group designated in all figures. (C) All samples with fusions or mutations in any of the genes indicated on the left are displayed on the x axis. For each gene, samples are clustered by the alteration group. Bottom bar indicates cancer type.
Figure 6
Figure 6. Druggable Fusion Targets
(A) The bar chart indicates the number of samples potentially treatable on the basis of their fusion status. (B) Percentages of LUAD samples with known smoking status. (C) ESR1 domains kept in ESR1 fusions across cancer types. (D) ALK expression across cancer types indicating ALK fusion status.

References

    1. Andreatta M, Nielsen M. Gapped sequence alignment using artificial neural networks: application to the MHC class I system. Bioinformatics. 2016;32:511–517. - PMC - PubMed
    1. Babiceanu M, Qin F, Xie Z, Jia Y, Lopez K, Janus N, Facemire L, Kumar S, Pang Y, Qi Y, et al. Recurrent chimeric fusion RNAs in non-cancer tissues and cells. Nucleic Acids Res. 2016;44:2859–2872. - PMC - PubMed
    1. Bailey MH, Tokheim C, Porta-Pardo E, Sengupta S, Bertrand D, Weerasinghe A, Colaprico A, Wendl MC, Kim J, Reardon B, et al. Comprehensive Characterization of Cancer Driver Genes and Mutations. Cell. 2018;173 https://doi.org/10.1016/j.cell.2018.02.060. - DOI - PMC - PubMed
    1. Bass AJ, Lawrence MS, Brace LE, Ramos AH, Drier Y, Cibulskis K, Sougnez C, Voet D, Saksena G, Sivachenko A, et al. Genomic sequencing of colorectal adenocarcinomas identifies a recurrent VTI1A-TCF7L2 fusion. Nat Genet. 2011;43:964–968. - PMC - PubMed
    1. Benelli M, Pescucci C, Marseglia G, Severgnini M, Torricelli F, Magi A. Discovering chimeric transcripts in paired-end RNA-seq data by using EricScript. Bioinformatics. 2012;28:3232–3239. - PubMed

Publication types

MeSH terms

Substances