SeqMap: mapping massive amount of oligonucleotides to the genome
- PMID: 18697769
- PMCID: PMC2562015
- DOI: 10.1093/bioinformatics/btn429
SeqMap: mapping massive amount of oligonucleotides to the genome
Abstract
SeqMap is a tool for mapping large amount of short sequences to the genome. It is designed for finding all the places in a reference genome where each sequence may come from. This task is essential to the analysis of data from ultra high-throughput sequencing machines. With a carefully designed index-filtering algorithm and an efficient implementation, SeqMap can map tens of millions of short sequences to a genome of several billions of nucleotides. Multiple substitutions and insertions/deletions of the nucleotide bases in the sequences can be tolerated and therefore detected. SeqMap supports FASTA input format and various output formats, and provides command line options for tuning almost every aspect of the mapping process. A typical mapping can be done in a few hours on a desktop PC. Parallel use of SeqMap on a cluster is also very straightforward.
References
-
- Li R, et al. SOAP: short oligonucleotide alignment program. Bioinformatics. 2008;24:713–714. - PubMed
-
- Manku GS, et al. Proceedings of the 16th international conference on World Wide Web. ACM, New York, NY, USA: 2007. Detecting near-duplicates for web crawling; pp. 141–150.
-
- Mortazavi A, et al. Mapping and quantifying mammalian transcriptomes by RNA-seq. Nat. Methods. 2008;5:621–628. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources