Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA
- PMID: 19497932
- PMCID: PMC2722999
- DOI: 10.1093/bioinformatics/btp344
Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNA
Abstract
Summary: Here, we present a method for estimating the frequencies of SNP alleles present within pooled samples of DNA using high-throughput short-read sequencing. The method was tested on real data from six strains of the highly monomorphic pathogen Salmonella Paratyphi A, sequenced individually and in a pool. A variety of read mapping and quality-weighting procedures were tested to determine the optimal parameters, which afforded > or =80% sensitivity of SNP detection and strong correlation with true SNP frequency at poolwide read depth of 40x, declining only slightly at read depths 20-40x.
Availability: The method was implemented in Perl and relies on the opensource software Maq for read mapping and SNP calling. The Perl script is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/pools/.
References
-
- Falush D, Bowden R. Genome-wide association mapping in bacteria? Trends Microbiol. 2006;14:353–355. - PubMed
-
- Keim P, et al. Anthrax molecular epidemiology and forensics: using the appropriate marker for different evolutionary scales. Infect. Genet. Evol. 2004;4:205–213. - PubMed
-
- Kidgell C, et al. Salmonella typhi the causative agent of typhoid fever, is approximately 50 000 years old. Infect. Genet. Evol. 2002;2:39–45. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
