Selecting signature oligonucleotides to identify organisms using DNA arrays
- PMID: 12376378
- DOI: 10.1093/bioinformatics/18.10.1340
Selecting signature oligonucleotides to identify organisms using DNA arrays
Abstract
Motivation: DNA arrays are a very useful tool to quickly identify biological agents present in some given sample, e.g. to identify viruses causing disease, for quality control in the food industry, or to determine bacteria contaminating drinking water. The selection of specific oligos to attach to the array surface is a relevant problem in the experiment design process. Given a set S of genomic sequences (the target sequences), the task is to find at least one oligonucleotide, called probe, for each sequence in S. This probe will be attached to the array surface, and must be chosen in a way that it will not hybridize to any other sequence but the intended target. Furthermore, all probes on the array must hybridize to their intended targets under the same reaction conditions, most importantly at the temperature T at which the experiment is conducted.
Results: We present an efficient algorithm for the probe design problem. Melting temperatures are calculated for all possible probe-target interactions using an extended nearest-neighbor model, allowing for both non-Watson-Crick base-pairing and unpaired bases within a duplex. To compute temperatures efficiently, a combination of suffix trees and dynamic programming based alignment algorithms is introduced. Additional filtering steps during preprocessing increase the speed of the computation. The practicability of the algorithms is demonstrated by two case studies: The identification of HIV-1 subtypes, and of 28S rDNA sequences from >or=400 organisms.
Similar articles
-
Comprehensive aligned sequence construction for automated design of effective probes (CASCADE-P) using 16S rDNA.Bioinformatics. 2003 Aug 12;19(12):1461-8. doi: 10.1093/bioinformatics/btg200. Bioinformatics. 2003. PMID: 12912825
-
Optimal robust non-unique probe selection using Integer Linear Programming.Bioinformatics. 2004 Aug 4;20 Suppl 1:i186-93. doi: 10.1093/bioinformatics/bth936. Bioinformatics. 2004. PMID: 15262798
-
PRIMEGENS: robust and efficient design of gene-specific probes for microarray analysis.Bioinformatics. 2002 Nov;18(11):1432-7. doi: 10.1093/bioinformatics/18.11.1432. Bioinformatics. 2002. PMID: 12424113
-
Sequencing by hybridization (SBH): advantages, achievements, and opportunities.Adv Biochem Eng Biotechnol. 2002;77:75-101. doi: 10.1007/3-540-45713-5_5. Adv Biochem Eng Biotechnol. 2002. PMID: 12227738 Review.
-
High-density genechip oligonucleotide probe arrays.Adv Biochem Eng Biotechnol. 2002;77:21-42. doi: 10.1007/3-540-45713-5_2. Adv Biochem Eng Biotechnol. 2002. PMID: 12227735 Review.
Cited by
-
BOND: Basic OligoNucleotide Design.BMC Bioinformatics. 2013 Feb 27;14:69. doi: 10.1186/1471-2105-14-69. BMC Bioinformatics. 2013. PMID: 23444904 Free PMC article.
-
Genome-wide identification of specific oligonucleotides using artificial neural network and computational genomic analysis.BMC Bioinformatics. 2007 May 22;8:164. doi: 10.1186/1471-2105-8-164. BMC Bioinformatics. 2007. PMID: 17518996 Free PMC article.
-
An evaluation of custom microarray applications: the oligonucleotide design challenge.Nucleic Acids Res. 2009 Apr;37(6):1726-39. doi: 10.1093/nar/gkp053. Epub 2009 Feb 10. Nucleic Acids Res. 2009. PMID: 19208645 Free PMC article.
-
A parallel and incremental algorithm for efficient unique signature discovery on DNA databases.BMC Bioinformatics. 2010 Mar 16;11:132. doi: 10.1186/1471-2105-11-132. BMC Bioinformatics. 2010. PMID: 20230647 Free PMC article.
-
Genome-wide selection of unique and valid oligonucleotides.Nucleic Acids Res. 2005 Jul 26;33(13):e115. doi: 10.1093/nar/gni110. Nucleic Acids Res. 2005. PMID: 16049019 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources