Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015:2015:456479.
doi: 10.1155/2015/456479. Epub 2015 Oct 11.

A Comparison of Variant Calling Pipelines Using Genome in a Bottle as a Reference

Affiliations

A Comparison of Variant Calling Pipelines Using Genome in a Bottle as a Reference

Adam Cornish et al. Biomed Res Int. 2015.

Abstract

High-throughput sequencing, especially of exomes, is a popular diagnostic tool, but it is difficult to determine which tools are the best at analyzing this data. In this study, we use the NIST Genome in a Bottle results as a novel resource for validation of our exome analysis pipeline. We use six different aligners and five different variant callers to determine which pipeline, of the 30 total, performs the best on a human exome that was used to help generate the list of variants detected by the Genome in a Bottle Consortium. Of these 30 pipelines, we found that Novoalign in conjunction with GATK UnifiedGenotyper exhibited the highest sensitivity while maintaining a low number of false positives for SNVs. However, it is apparent that indels are still difficult for any pipeline to handle with none of the tools achieving an average sensitivity higher than 33% or a Positive Predictive Value (PPV) higher than 53%. Lastly, as expected, it was found that aligners can play as vital a role in variant detection as variant callers themselves.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic of the data analysis pipeline used. To ensure that the highest quality alignments are created, reads are first filtered and then aligned to the human reference genome, hg19. Next, PCR duplicates are removed, reads are aligned around putative indels, and base quality scores are recalibrated. Finally, variants are called and validated against the NIST-GiaB list of variants.
Figure 2
Figure 2
Sensitivity as a function of depth for the top five pipelines. The top five pipelines are shown here with the depth of every SNV plotted against sensitivity.
Figure 3
Figure 3
Sensitivity as a function of depth for the top aligner and top variant caller. (a) Results for the depth of every SNV plotted against sensitivity for the top aligner, BWA mem, paired with every variant caller. (b) Results for the depth of every SNV plotted against sensitivity for the top variant caller, GATK UnifiedGenotyper, paired with every aligner.
Figure 4
Figure 4
The intersection of the SNVs identified by the top five pipelines.

References

    1. Paria N., Copley L. A., Herring J. A., et al. Whole-exome sequencing: discovering genetic causes of orthopaedic disorders. Journal of Bone and Joint Surgery—Series A. 2013;95(23):e1851–e1858. doi: 10.2106/jbjs.l.01620. - DOI - PubMed
    1. Biesecker L. G., Green R. C. Diagnostic clinical genome and exome sequencing. The New England Journal of Medicine. 2014;370(25):2418–2425. doi: 10.1056/nejmra1312543. - DOI - PubMed
    1. Delanty N., Goldstein D. B. Diagnostic exome sequencing: a new paradigm in neurology. Neuron. 2013;80(4):841–843. doi: 10.1016/j.neuron.2013.09.011. - DOI - PubMed
    1. Shanks M. E., Downes S. M., Copley R. R., et al. Next-generation sequencing (NGS) as a diagnostic tool for retinal degeneration reveals a much higher detection rate in early-onset disease. European Journal of Human Genetics. 2013;21(3):274–280. doi: 10.1038/ejhg.2012.172. - DOI - PMC - PubMed
    1. Buermans H. P., den Dunnen J. T. Next generation sequencing technology: advances and applications. Biochimica et Biophysica Acta (BBA)—Molecular Basis of Disease. 2014;1842(10):1932–1941. doi: 10.1016/j.bbadis.2014.06.015. - DOI - PubMed

Publication types