The diagonal-traverse homology search algorithm for locating similarities between two sequences
- PMID: 6320108
- PMCID: PMC321090
- DOI: 10.1093/nar/12.1part2.751
The diagonal-traverse homology search algorithm for locating similarities between two sequences
Abstract
We present a fast computer algorithm for finding homology between two DNA sequences. It generates a two-dimensional display in which a diagonal string of dots represents a stretch of homology between the two sequences. Our algorithm performs the search very rapidly, and has no internal data storage requirement except for the sequences themselves. These characteristics make it particularly well suited for execution on microcomputers. Without slowing execution, the matching criterion can be that a specified fraction of contiguous bases must be identical. Even with gapped sequences, we have found large search windows to be surprisingly good for detecting poor homologies with nearly complete background suppression. A diagonal search pattern is used that reports the finds in a compact and logically ordered form. A simple and rapid plotting algorithm for unsophisticated printers is also reported.
Similar articles
-
A high speed, high capacity homology matrix: zooming through SV40 and polyoma.Nucleic Acids Res. 1982 Aug 11;10(15):4765-82. doi: 10.1093/nar/10.15.4765. Nucleic Acids Res. 1982. PMID: 6290988 Free PMC article.
-
Similarities in the structural organization of the genomes of stumptailed macaque virus (strain HD) and simian virus 40.J Gen Virol. 1979 Oct;45(1):223-6. doi: 10.1099/0022-1317-45-1-223. J Gen Virol. 1979. PMID: 230302
-
Should nucleotide sequence analyzing computer algorithms always extend homologies by extending homologies?Nucleic Acids Res. 1986 Jan 10;14(1):425-30. doi: 10.1093/nar/14.1.425. Nucleic Acids Res. 1986. PMID: 3003678 Free PMC article.
-
[Advances in SV40 and polyoma virus research--organization of viral genomes (author's transl)].Tanpakushitsu Kakusan Koso. 1978 Jul;23(8):878-97. Tanpakushitsu Kakusan Koso. 1978. PMID: 211550 Review. Japanese. No abstract available.
-
[Replication, integration, transcription and gene products of tumor virus genome (author's transl)].Tanpakushitsu Kakusan Koso. 1978 May;23(6):504-23. Tanpakushitsu Kakusan Koso. 1978. PMID: 211543 Review. Japanese. No abstract available.
Cited by
-
Fast analysis of DNA and protein sequence on Apple IIe: restriction sites search, alignment of short sequence and dot matrix analysis.Nucleic Acids Res. 1986 Jan 10;14(1):583-90. doi: 10.1093/nar/14.1.583. Nucleic Acids Res. 1986. PMID: 3003684 Free PMC article.
-
Analysis of the complete nucleotide sequence of the group IV RNA coliphage SP.Nucleic Acids Res. 1988 Jul 11;16(13):6205-21. doi: 10.1093/nar/16.13.6205. Nucleic Acids Res. 1988. PMID: 3399390 Free PMC article.
-
Structure of the Saccharomyces cerevisiae HO gene and analysis of its upstream regulatory region.Mol Cell Biol. 1986 Dec;6(12):4281-94. doi: 10.1128/mcb.6.12.4281-4294.1986. Mol Cell Biol. 1986. PMID: 3025649 Free PMC article.
-
The F-type 5' motif of mouse L1 elements: a major class of L1 termini similar to the A-type in organization but unrelated in sequence.Nucleic Acids Res. 1988 Jan 25;16(2):739-49. doi: 10.1093/nar/16.2.739. Nucleic Acids Res. 1988. PMID: 3340553 Free PMC article.
-
Effects of heavy metals on Drosophila larvae and a metallothionein cDNA.Environ Health Perspect. 1986 Mar;65:107-16. doi: 10.1289/ehp.8665107. Environ Health Perspect. 1986. PMID: 3086075 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources