Motif recognition and alignment for many sequences by comparison of dot-matrices
- PMID: 1900535
- DOI: 10.1016/0022-2836(91)90871-3
Motif recognition and alignment for many sequences by comparison of dot-matrices
Abstract
Calculation of dot-matrices is a widespread tool in the search for sequence similarities. When sequences are distant, even this approach may fail to point out common regions. If several plots calculated for all members of a sequence set consistently displayed a similarity between them, this would increase its credibility. We present an algorithm to delineate dot-plot agreement. A novel procedure based on matrix multiplication is developed to identify common patterns and reliably aligned regions in a set of distantly related sequences. The algorithm finds motifs independent of input sequence lengths and reduces the dependence on gap penalties. When sequences share greater similarity, the same approach converts to a multiple sequence alignment procedure.
Similar articles
-
Profile analysis: detection of distantly related proteins.Proc Natl Acad Sci U S A. 1987 Jul;84(13):4355-8. doi: 10.1073/pnas.84.13.4355. Proc Natl Acad Sci U S A. 1987. PMID: 3474607 Free PMC article.
-
A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis.Gene. 1995 Dec 29;167(1-2):GC1-10. doi: 10.1016/0378-1119(95)00714-8. Gene. 1995. PMID: 8566757
-
Recognition of distantly related protein sequences using conserved motifs and neural networks.J Mol Biol. 1992 Dec 5;228(3):951-62. doi: 10.1016/0022-2836(92)90877-m. J Mol Biol. 1992. PMID: 1469726
-
Improved tools for biological sequence comparison.Proc Natl Acad Sci U S A. 1988 Apr;85(8):2444-8. doi: 10.1073/pnas.85.8.2444. Proc Natl Acad Sci U S A. 1988. PMID: 3162770 Free PMC article.
-
Sequence alignment and penalty choice. Review of concepts, case studies and implications.J Mol Biol. 1994 Jan 7;235(1):1-12. doi: 10.1016/s0022-2836(05)80006-3. J Mol Biol. 1994. PMID: 8289235 Review.
Cited by
-
Multiple alignment using simulated annealing: branch point definition in human mRNA splicing.Nucleic Acids Res. 1992 May 25;20(10):2511-6. doi: 10.1093/nar/20.10.2511. Nucleic Acids Res. 1992. PMID: 1598209 Free PMC article.
-
The 78,000 M(r) intermediate chain of Chlamydomonas outer arm dynein isa WD-repeat protein required for arm assembly.J Cell Biol. 1995 Apr;129(1):169-78. doi: 10.1083/jcb.129.1.169. J Cell Biol. 1995. PMID: 7698982 Free PMC article.
-
Suppression subtractive hybridization identifies distinctive expression markers for coronary and internal mammary arteries.Arterioscler Thromb Vasc Biol. 2003 Mar 1;23(3):425-33. doi: 10.1161/01.ATV.0000059303.94760.5C. Epub 2003 Jan 30. Arterioscler Thromb Vasc Biol. 2003. PMID: 12615697 Free PMC article.
-
The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools.Nucleic Acids Res. 1997 Dec 15;25(24):4876-82. doi: 10.1093/nar/25.24.4876. Nucleic Acids Res. 1997. PMID: 9396791 Free PMC article.
-
ProbCons: Probabilistic consistency-based multiple sequence alignment.Genome Res. 2005 Feb;15(2):330-40. doi: 10.1101/gr.2821705. Genome Res. 2005. PMID: 15687296 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous