Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec 4;22(1):872.
doi: 10.1186/s12864-021-08094-z.

Discovery of clinically relevant fusions in pediatric cancer

Affiliations

Discovery of clinically relevant fusions in pediatric cancer

Stephanie LaHaye et al. BMC Genomics. .

Abstract

Background: Pediatric cancers typically have a distinct genomic landscape when compared to adult cancers and frequently carry somatic gene fusion events that alter gene expression and drive tumorigenesis. Sensitive and specific detection of gene fusions through the analysis of next-generation-based RNA sequencing (RNA-Seq) data is computationally challenging and may be confounded by low tumor cellularity or underlying genomic complexity. Furthermore, numerous computational tools are available to identify fusions from supporting RNA-Seq reads, yet each algorithm demonstrates unique variability in sensitivity and precision, and no clearly superior approach currently exists. To overcome these challenges, we have developed an ensemble fusion calling approach to increase the accuracy of identifying fusions.

Results: Our Ensemble Fusion (EnFusion) approach utilizes seven fusion calling algorithms: Arriba, CICERO, FusionMap, FusionCatcher, JAFFA, MapSplice, and STAR-Fusion, which are packaged as a fully automated pipeline using Docker and Amazon Web Services (AWS) serverless technology. This method uses paired end RNA-Seq sequence reads as input, and the output from each algorithm is examined to identify fusions detected by a consensus of at least three algorithms. These consensus fusion results are filtered by comparison to an internal database to remove likely artifactual fusions occurring at high frequencies in our internal cohort, while a "known fusion list" prevents failure to report known pathogenic events. We have employed the EnFusion pipeline on RNA-Seq data from 229 patients with pediatric cancer or blood disorders studied under an IRB-approved protocol. The samples consist of 138 central nervous system tumors, 73 solid tumors, and 18 hematologic malignancies or disorders. The combination of an ensemble fusion-calling pipeline and a knowledge-based filtering strategy identified 67 clinically relevant fusions among our cohort (diagnostic yield of 29.3%), including RBPMS-MET, BCAN-NTRK1, and TRIM22-BRAF fusions. Following clinical confirmation and reporting in the patient's medical record, both known and novel fusions provided medically meaningful information.

Conclusions: The EnFusion pipeline offers a streamlined approach to discover fusions in cancer, at higher levels of sensitivity and accuracy than single algorithm methods. Furthermore, this method accurately identifies driver fusions in pediatric cancer, providing clinical impact by contributing evidence to diagnosis and, when appropriate, indicating targeted therapies.

Keywords: Cancer; Gene fusions; Genomics; Pediatric neoplasms; RNA-Seq; Transcriptomics.

PubMed Disclaimer

Conflict of interest statement

No Competing interests: Stephanie LaHaye, James Fitch, Kyle Voytovich, Adam Herman, Benjamin Kelly, Grant Lammi, Jeremy Arbesfeld, Saranga Wijeratne, Kathleen Schieffer, Natalie Bir, Sean McGrath, Anthony Miller, Amy Wetzel, Katherine Miller, Tracy Bedrosian, Kristen Leraas, Elizabeth Varga, Ajay Gupta, Bhuvana Setty, Jeffrey Leonard, Jonathan Finlay, Mohamed Abdelbaki, Diana Osorio, Selene Koo, Daniel Koboldt, Alex Wagner, Ann-Kathrin Eisfeld, Krzysztof Mrozek, Vincent Magrini, Catherine Cottrell, Richard Wilson and Peter White.

Elaine Mardis: Qiagen N.V., supervisory board member, honorarium and stock-based compensation.

Daniel Boué: Illumina (ILMN) share holder.

Figures

Fig. 1
Fig. 1
The EnFusion pipeline identifies true positive fusions. A The EnFusion approach identifies fusions in RNA-Seq data by overlapping results from Arriba, CICERO, FusionCatcher, FusionMap, JAFFA, MapSplice, and STAR-Fusion. It hierarchically prioritizes and filters the fusions utilizing an in-house PostgreSQL database and knowledge-base, prior to producing an output list of predicted fusions. In many cases, detected fusions were orthogonally tested by clinical confirmation in order to return a medically meaningful result. B The EnFusion pipeline was tested on a dilution series of a reference control reagent (SeraCare) to determine sensitivity and limit of detection. We optimized the pipeline using the undiluted reference control reagent, identifying that by requiring ≥3 callers to have overlap for a detected fusion, and by utilizing filtering of known false positive fusion calls and cross-referencing a list of known fusions, all 14 fusions were identified. Colors representing different fusions present in the Seraseq v2 reagent are ordered by their absolute proportions. We then applied the optimized pipeline to the dilution series, showing that the numbers of identified fusions were reduced in serial dilutions, and no fusions were identified in the negative control. All images depicted in the associated figure are the authors’ own and not taken from another source
Fig. 2
Fig. 2
Clinically relevant fusions identified by the EnFusion approach in a pediatric cancer and hematologic disease cohort. A The EnFusion approach, with automated filtering, identifies significantly fewer fusions compared to individual callers. The number of fusions is plotted as log10(x + 1) to account for 0 fusions identified in some cases. Callers are sorted by the lowest median number of fusions identified to the highest. B 67 Clinically relevant fusions were identified, represented as a bar graph with decreasing fusions per individual algorithm, highlighting the sensitivity of the ensemble approach compared to individual algorithms. No individual algorithm was able to identify all 67 fusions. C Of the 67 clinically relevant fusions identified, 30 were interchromosomal chimeric (blue), 29 were intrachromosomal chimeric (orange), 3 were loss of function (green), and 5 were promoter swapping (yellow) fusions. D Of the 67 clinically relevant fusions identified, 7 are novel events (red asterisk), while the remaining 60 fusion partners had been described previously in the literature. E A stacked bar graph represents the individual fusion callers that contributed to each clinically relevant fusion. EnFusion 3 refers to the EnFusion approach with ≥3 callers and EnFusion 2 refers to the EnFusion approach with ≥2 callers. All images depicted in the associated figure are the authors’ own and not taken from another source
Fig. 3
Fig. 3
An RBPMS-MET fusion identified in a patient with an infantile fibrosarcoma-like tumor. A RBPMS-MET fusion was identified by all seven fusion callers in the filtered overlap results. The number of fusions identified by each caller is in the outer VENN diagram sections, while internal numbers indicate overlapping fusions found post-filtering (0 overlaps between callers are not shown). B The RBPMS-MET fusion is an interchromosomal event, occurring between 8p12 and 7q31.2 and joining exon 5 of RBPMS (blue) to exon 15 of MET (red). C The fusion protein product includes the RNA recognition motif domain of RBPMS and the tyrosine kinase catalytic domain of MET. D The RBPMS-MET fusion is predicted to cause constitutive phosphorylation and activation of MET, targetable using cabozantinib. All images depicted in the associated figure are the authors’ own and not taken from another source
Fig. 4
Fig. 4
Targetable NTRK1 fusion identified in an infiltrating glioma. A The BCAN-NTRK1 fusion was identified by 5 of 7 fusion callers, and was the only fusion returned by the filtered overlap results. Total fusions identified by each caller are shown, FusionMap and MapSplice identified no overlapping fusions that passed filtering (0 overlaps between callers are not shown). B The BCAN-NTRK1 fusion is an intrachromosomal event occurring on 1q23.1, joining exon 6 of BCAN (blue) and exon 8 of NTRK1 (red). C This fusion results in the juxtaposition of the tyrosine kinase catalytic domain of the NTRK1 gene to the 5′ end of the BCAN gene. D NTRK1 is highly expressed in this patient (red) compared to CNS tumors (black) in the NCH cohort (CNS tumors: n = 138), with a normalized read count that is 7.70 standard deviations above the mean (131.2). E The BCAN-NTRK1 fusion is predicted to increase expression and activation of the tyrosine kinase NTRK1, which may be inhibited by TRK inhibitor therapy (green). All images depicted in the associated figure are the authors’ own and not taken from another source
Fig. 5
Fig. 5
Identification of a novel BRAF fusion in a mixed neuronal-glial tumor. A The TRIM22-BRAF fusion was identified by all seven fusion callers and in the filtered overlap results, total fusions identified by each caller and overlapping fusions are shown (0 overlaps between callers are not shown). B The TRIM22-BRAF fusion is an interchromosomal event between 11p15.4 and 7q34, joining exon 2 of TRIM22 (blue) to exon 9 of BRAF (red). C The resulting fusion product contains the 5′ TRIM22 zinc finger binding domains and BRAF tyrosine kinase catalytic domain. D Single sample gene set enrichment analysis (ssGSEA) indicates a trend toward an enrichment of the MEK (above the 75th percentile, 0.68 standard deviations above the mean of 22,756.87), RAF (above the 75th percentile, 0.60 standard deviations above the mean of 22,635.74), and mTOR (above the 75th percentile, 0.72 standard deviations above the mean of 22,191.50) upregulated gene sets in the TRIM22-BRAF sample (red) compared to the pan-cancer NCH cohort (black) (pan-cancer cohort: n = 229). E The TRIM22-BRAF fusion is predicted to cause constitutive dimerization and activation of the BRAF kinase domain, shown in D, which could be targeted by RAF, MEK, and mTOR inhibitors (green). All images depicted in the associated figure are the authors’ own and not taken from another source

References

    1. Steliarova-Foucher E, Colombet M, Ries LAG, Moreno F, Dolya A, Bray F, Hesseling P, Shin HY, Stiller CA. IICC-3 contributors: international incidence of childhood cancer, 2001-10: a population-based registry study. Lancet Oncol. 2017;18(6):719–731. doi: 10.1016/S1470-2045(17)30186-9. - DOI - PMC - PubMed
    1. Amatu A, Sartore-Bianchi A, Siena S. NTRK gene fusions as novel targets of cancer therapy across multiple tumour types. ESMO Open. 2016;1(2):e000023. doi: 10.1136/esmoopen-2015-000023. - DOI - PMC - PubMed
    1. Pui CH, Gajjar AJ, Kane JR, Qaddoumi IA, Pappo AS. Challenging issues in pediatric oncology. Nat Rev Clin Oncol. 2011;8(9):540–549. doi: 10.1038/nrclinonc.2011.95. - DOI - PMC - PubMed
    1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2016. CA Cancer J Clin. 2016;66(1):7–30. doi: 10.3322/caac.21332. - DOI - PubMed
    1. Vogelstein B, Papadopoulos N, Velculescu VE, Zhou S, Diaz LA, Jr, Kinzler KW. Cancer genome landscapes. Science. 2013;339(6127):1546–1558. doi: 10.1126/science.1235122. - DOI - PMC - PubMed