Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Feb 16:13:820493.
doi: 10.3389/fgene.2022.820493. eCollection 2022.

Fusion Gene Detection Using Whole-Exome Sequencing Data in Cancer Patients

Affiliations

Fusion Gene Detection Using Whole-Exome Sequencing Data in Cancer Patients

Wenjiang Deng et al. Front Genet. .

Abstract

Several fusion genes are directly involved in the initiation and progression of cancers. Numerous bioinformatics tools have been developed to detect fusion events, but they are mainly based on RNA-seq data. The whole-exome sequencing (WES) represents a powerful technology that is widely used for disease-related DNA variant detection. In this study, we build a novel analysis pipeline called Fuseq-WES to detect fusion genes at DNA level based on the WES data. The same method applies also for targeted panel sequencing data. We assess the method to real datasets of acute myeloid leukemia (AML) and prostate cancer patients. The result shows that two of the main AML fusion genes discovered in RNA-seq data, PML-RARA and CBFB-MYH11, are detected in the WES data in 36 and 63% of the available samples, respectively. For the targeted deep-sequencing of prostate cancer patients, detection of the TMPRSS2-ERG fusion, which is the most frequent chimeric alteration in prostate cancer, is 91% concordant with a manually curated procedure based on four other methods. In summary, the overall results indicate that it is challenging to detect fusion genes in WES data with a standard coverage of ∼ 15-30x, where fusion candidates discovered in the RNA-seq data are often not detected in the WES data and vice versa. A subsampling study of the prostate data suggests that a coverage of at least 75x is necessary to achieve high accuracy.

Keywords: acute myeloid leukemia; discordant read; fusion gene; prostate cancer; split read; whole exome sequencing.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Workflow of Fuseq-WES to detect fusion genes from whole-exome sequencing data.
FIGURE 2
FIGURE 2
Construction of fusion equivalence class and fusion transcripts; prediction of fusion genes.

References

    1. Bao R., Huang L., Andrade J., Tan W., Kibbe W. A., Jiang H., et al. (2014). Review of Current Methods, Applications, and Data Management for the Bioinformatics Analysis of Whole Exome Sequencing. Cancer Inform. 13, 67–82. 10.4137/CIN.S13779 - DOI - PMC - PubMed
    1. Breese M. R., Liu Y. (2013). Ngsutils: a Software Suite for Analyzing and Manipulating Next-Generation Sequencing Datasets. Bioinformatics 29, 494–496. 10.1093/bioinformatics/bts731 - DOI - PMC - PubMed
    1. Brennan C. W., Verhaak R. G., McKenna A., Campos B., Noushmehr H., Salama S. R., et al. (2013). The Somatic Genomic Landscape of Glioblastoma. Cell 155, 462–477. 10.1016/j.cell.2013.09.034 - DOI - PMC - PubMed
    1. Bruno R., Fontanini G. (2020). Next Generation Sequencing for Gene Fusion Analysis in Lung Cancer: a Literature Review. Diagnostics 10, 521. 10.3390/diagnostics10080521 - DOI - PMC - PubMed
    1. Cameron D. L., Schröder J., Penington J. S., Do H., Molania R., Dobrovic A., et al. (2017). Gridss: sensitive and specific genomic rearrangement detection using positional de bruijn graph assembly. Genome Res. 27, 2050–2060. 10.1101/gr.222109.117 - DOI - PMC - PubMed

LinkOut - more resources