SSMAL: similarity searching with alignment graphs
- PMID: 9694989
- DOI: 10.1093/bioinformatics/14.6.508
SSMAL: similarity searching with alignment graphs
Abstract
Motivation: We want to provide biologists with a fast and sensitive scanning tool for searching local alignments of a protein query sequence against databases of protein multiple alignments, such as ProDom. Conversely, we want to provide a tool for locally aligning a protein multiple alignment query against a protein database such as SWISSPROT.
Results: We developed the program SSMAL (Shuffling Similarities with Multiple Alignments) which utilizes features of the Blast (Altschul et al., J. Mol. Biol., 215, 403-410, 1990) algorithm and part of the Blast code. Our software allows both scanning of multiple alignments and searching with a multiple alignment. Deletions in the multiple alignment only are handled and a SSMAL search may miss some similarities found by a profile search. However, an SSMAL scan of a database such as ProDom would be 20-30 times faster that a profile scan. In the worst case, a SSMAL search is approximately 9 times faster than a profile search.
Availability: http://www.dkfz-heidelberg.de/tbi/ people/nicodeme and follow the hyperlink SSMAL.
Contact: p.nicodeme@DKFZ-Heidelberg.de
Similar articles
-
Automated protein sequence database classification. I. Integration of compositional similarity search, local similarity search, and multiple sequence alignment.Bioinformatics. 1998;14(2):164-73. doi: 10.1093/bioinformatics/14.2.164. Bioinformatics. 1998. PMID: 9545449
-
A set-theoretic approach to database searching and clustering.Bioinformatics. 1998 Jun;14(5):430-8. doi: 10.1093/bioinformatics/14.5.430. Bioinformatics. 1998. PMID: 9682056
-
SALSA: improved protein database searching by a new algorithm for assembly of sequence fragments into gapped alignments.Bioinformatics. 1998;14(10):839-45. doi: 10.1093/bioinformatics/14.10.839. Bioinformatics. 1998. PMID: 9927712
-
Sequence Similarity Searching.Curr Protoc Protein Sci. 2019 Feb;95(1):e71. doi: 10.1002/cpps.71. Epub 2018 Aug 13. Curr Protoc Protein Sci. 2019. PMID: 30102464 Review.
-
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389. Nucleic Acids Res. 1997. PMID: 9254694 Free PMC article. Review.
Cited by
-
The SYSTERS protein sequence cluster set.Nucleic Acids Res. 2000 Jan 1;28(1):270-2. doi: 10.1093/nar/28.1.270. Nucleic Acids Res. 2000. PMID: 10592244 Free PMC article.
-
Automated search of natively folded protein fragments for high-throughput structure determination in structural genomics.Protein Sci. 2000 Dec;9(12):2313-21. doi: 10.1110/ps.9.12.2313. Protein Sci. 2000. PMID: 11206052 Free PMC article.
-
SYSTERS, GeneNest, SpliceNest: exploring sequence space from genome to protein.Nucleic Acids Res. 2002 Jan 1;30(1):299-300. doi: 10.1093/nar/30.1.299. Nucleic Acids Res. 2002. PMID: 11752319 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials