Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Feb 1;28(3):311-7.
doi: 10.1093/bioinformatics/btr665. Epub 2011 Dec 6.

SomaticSniper: identification of somatic point mutations in whole genome sequencing data

Affiliations

SomaticSniper: identification of somatic point mutations in whole genome sequencing data

David E Larson et al. Bioinformatics. .

Abstract

Motivation: The sequencing of tumors and their matched normals is frequently used to study the genetic composition of cancer. Despite this fact, there remains a dearth of available software tools designed to compare sequences in pairs of samples and identify sites that are likely to be unique to one sample.

Results: In this article, we describe the mathematical basis of our SomaticSniper software for comparing tumor and normal pairs. We estimate its sensitivity and precision, and present several common sources of error resulting in miscalls.

Availability and implementation: Binaries are freely available for download at http://gmt.genome.wustl.edu/somatic-sniper/current/, implemented in C and supported on Linux and Mac OS X.

Contact: delarson@wustl.edu; lding@wustl.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Simulations of power across various variant allele frequencies and across different depths in both samples: 30 reads (A), 60 reads (B) and 90 reads (C). Contours indicate the power for each combination of variant allele frequencies. The dashed line indicates simulations where the allele frequencies in tumor and normal are equal and thus, the contours intersecting this line indicate the expected FDR for those allele frequencies.
Fig. 2.
Fig. 2.
Association of erroneous calls with the effective 3 end. Kernel density estimates comparing the distribution of two different measures of variant location across three different validation results. The bandwidth (bw) for each kernel density is shown in the legend. (A) Variants that validated reference show a clear bias toward association with the effective 3 end. (B) All three classes of validated mutations show no clear bias in association with the end of the sequenced read.

References

    1. Carvalho B.S., et al. Quantifying uncertainty in genotype calls. Bioinformatics. 2010;26:242–249. - PMC - PubMed
    1. Ding L., et al. Genome remodelling in a basal-like breast cancer metastasis and xenograft. Nature. 2010;464:999–1005. - PMC - PubMed
    1. Forbes S.A., et al. COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer. Nucleic Acids Res. 2011;39:D945–D950. - PMC - PubMed
    1. Goya R., et al. SNVMix: predicting single nucleotide variants from next-generation sequencing of tumors. Bioinformatics. 2010;26:730–736. - PMC - PubMed
    1. Greenman C., et al. Patterns of somatic mutation in human cancer genomes. Nature. 2007;446:153–158. - PMC - PubMed

Publication types