Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 May 20;22(3):bbaa065.
doi: 10.1093/bib/bbaa065.

iWhale: a computational pipeline based on Docker and SCons for detection and annotation of somatic variants in cancer WES data

iWhale: a computational pipeline based on Docker and SCons for detection and annotation of somatic variants in cancer WES data

Andrea Binatti et al. Brief Bioinform. .

Abstract

Whole exome sequencing (WES) is a powerful approach for discovering sequence variants in cancer cells but its time effectiveness is limited by the complexity and issues of WES data analysis. Here we present iWhale, a customizable pipeline based on Docker and SCons, reliably detecting somatic variants by three complementary callers (MuTect2, Strelka2 and VarScan2). The results are combined to obtain a single variant call format file for each sample and variants are annotated by integrating a wide range of information extracted from several reference databases, ultimately allowing variant and gene prioritization according to different criteria. iWhale allows users to conduct a complex series of WES analyses with a powerful yet customizable and easy-to-use tool, running on most operating systems (macOs, GNU/Linux and Windows). iWhale code is freely available at https://github.com/alexcoppe/iWhale and the docker image is downloadable from https://hub.docker.com/r/alexcoppe/iwhale.

Keywords: Bioinformatics; Cancer; Docker; Pipeline; Whole exome sequencing.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Flowchart diagram of the iWhale pipeline. The software runs under Docker and all the steps are managed by SCons.
Figure 2
Figure 2
(A) Venn diagram displaying the comparison of the detected variants found by the three variant callers used by iWhale. (B) Performance evaluation of iWhale, using simulated paired tumor and control WES data.

References

    1. Vogelstein B, Papadopoulos N, Velculescu VE, et al. Cancer genome landscapes. Science 2013;339:1546–58. - PMC - PubMed
    1. Grandori C, Kemp CJ. Personalized cancer models for target discovery and precision medicine. Trends Cancer Res 2018;4:634–42. - PMC - PubMed
    1. Bailey MH, Tokheim C, Porta-Pardo E, et al. Comprehensive characterization of cancer driver genes and mutations. Cell 2018;174:1034–5. - PMC - PubMed
    1. Brown A-L, Li M, Goncearenco A, et al. Finding driver mutations in cancer: elucidating the role of background mutational processes. PLoS Comput Biol 2019;15:e1006981. - PMC - PubMed
    1. Jin J, Wu X, Yin J, et al. Identification of genetic mutations in cancer: challenge and opportunity in the new era of targeted therapy. Front Oncol 2019;9:263. - PMC - PubMed