SNPdetector: a software tool for sensitive and accurate SNP detection
- PMID: 16261194
- PMCID: PMC1274293
- DOI: 10.1371/journal.pcbi.0010053
SNPdetector: a software tool for sensitive and accurate SNP detection
Abstract
Identification of single nucleotide polymorphisms (SNPs) and mutations is important for the discovery of genetic predisposition to complex diseases. PCR resequencing is the method of choice for de novo SNP discovery. However, manual curation of putative SNPs has been a major bottleneck in the application of this method to high-throughput screening. Therefore it is critical to develop a more sensitive and accurate computational method for automated SNP detection. We developed a software tool, SNPdetector, for automated identification of SNPs and mutations in fluorescence-based resequencing reads. SNPdetector was designed to model the process of human visual inspection and has a very low false positive and false negative rate. We demonstrate the superior performance of SNPdetector in SNP and mutation analysis by comparing its results with those derived by human inspection, PolyPhred (a popular SNP detection tool), and independent genotype assays in three large-scale investigations. The first study identified and validated inter- and intra-subspecies variations in 4,650 traces of 25 inbred mouse strains that belong to either the Mus musculus species or the M. spretus species. Unexpected heterozygosity in CAST/Ei strain was observed in two out of 1,167 mouse SNPs. The second study identified 11,241 candidate SNPs in five ENCODE regions of the human genome covering 2.5 Mb of genomic sequence. Approximately 50% of the candidate SNPs were selected for experimental genotyping; the validation rate exceeded 95%. The third study detected ENU-induced mutations (at 0.04% allele frequency) in 64,896 traces of 1,236 zebra fish. Our analysis of three large and diverse test datasets demonstrated that SNPdetector is an effective tool for genome-scale research and for large-sample clinical studies. SNPdetector runs on Unix/Linux platform and is available publicly (http://lpg.nci.nih.gov).
Conflict of interest statement
Figures




Similar articles
-
InSNP: a tool for automated detection and visualization of SNPs and InDels.Hum Mutat. 2005 Jul;26(1):11-9. doi: 10.1002/humu.20188. Hum Mutat. 2005. PMID: 15931688
-
SNP-VISTA: an interactive SNP visualization tool.BMC Bioinformatics. 2005 Dec 8;6:292. doi: 10.1186/1471-2105-6-292. BMC Bioinformatics. 2005. PMID: 16336665 Free PMC article.
-
Mining SNPs from EST sequences using filters and ensemble classifiers.Genet Mol Res. 2010 May 4;9(2):820-34. doi: 10.4238/vol9-2gmr765. Genet Mol Res. 2010. PMID: 20449815
-
Techniques patents for SNP genotyping.Pharmacogenomics. 2003 Jan;4(1):67-79. doi: 10.1517/phgs.4.1.67.22582. Pharmacogenomics. 2003. PMID: 12517287 Review.
-
High throughput genotyping technologies.Brief Funct Genomic Proteomic. 2002 Jul;1(2):139-50. doi: 10.1093/bfgp/1.2.139. Brief Funct Genomic Proteomic. 2002. PMID: 15239900 Review.
Cited by
-
SNP-PHAGE--High throughput SNP discovery pipeline.BMC Bioinformatics. 2006 Oct 23;7:468. doi: 10.1186/1471-2105-7-468. BMC Bioinformatics. 2006. PMID: 17059604 Free PMC article.
-
High-throughput genetic mapping of mutants via quantitative single nucleotide polymorphism typing.Genetics. 2010 Jan;184(1):19-26. doi: 10.1534/genetics.109.107557. Epub 2009 Nov 2. Genetics. 2010. PMID: 19884313 Free PMC article.
-
Genomic subtyping and therapeutic targeting of acute erythroleukemia.Nat Genet. 2019 Apr;51(4):694-704. doi: 10.1038/s41588-019-0375-1. Epub 2019 Mar 29. Nat Genet. 2019. PMID: 30926971 Free PMC article.
-
Current Progresses of Single Cell DNA Sequencing in Breast Cancer Research.Int J Biol Sci. 2017 Jul 18;13(8):949-960. doi: 10.7150/ijbs.19627. eCollection 2017. Int J Biol Sci. 2017. PMID: 28924377 Free PMC article. Review.
-
The genomic landscape of hypodiploid acute lymphoblastic leukemia.Nat Genet. 2013 Mar;45(3):242-52. doi: 10.1038/ng.2532. Epub 2013 Jan 20. Nat Genet. 2013. PMID: 23334668 Free PMC article.
References
-
- Yeung AT, Hattangadi D, Blakesley L, Nicolas E. Enzymatic mutation detection technologies. Biotechiques. 2005;38:749–758. - PubMed
-
- Sachidanandam R, Weissman D, Schmidt SC, Kakol JM, Stein LD, et al. A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature. 2001;409:928–933. - PubMed
-
- Altshuler D, Pollara VJ, Cowles CR, Van Etten WJ, Baldwin J, et al. An SNP map of the human genome generated by reduced representation shotgun sequencing. Nature. 2000;407:513–516. - PubMed
-
- Mullikin JC, Hunt SE, Cole CG, Mortimore BJ, Rice CM, et al. An SNP map of human chromosome 22. Nature. 2000;407:516–520. - PubMed
-
- Marth GT, Korf I, Yandell MD, Yeh RT, Gu Z, et al. A general approach to single-nucleotide polymorphism discovery. Nat Genet. 1999;23:452–456. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous