Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1983 Feb;80(3):726-30.
doi: 10.1073/pnas.80.3.726.

Rapid similarity searches of nucleic acid and protein data banks

Rapid similarity searches of nucleic acid and protein data banks

W J Wilbur et al. Proc Natl Acad Sci U S A. 1983 Feb.

Abstract

With the development of large data banks of protein and nucleic acid sequences, the need for efficient methods of searching such banks for sequences similar to a given sequence has become evident. We present an algorithm for the global comparison of sequences based on matching k-tuples of sequence elements for a fixed k. The method results in substantial reduction in the time required to search a data bank when compared with prior techniques of similarity analysis, with minimal loss in sensitivity. The algorithm has also been adapted, in a separate implementation, to produce rigorous sequence alignments. Currently, using the DEC KL-10 system, we can compare all sequences in the entire Protein Data Bank of the National Biomedical Research Foundation with a 350-residue query sequence in less than 3 min and carry out a similar analysis with a 500-base query sequence against all eukaryotic sequences in the Los Alamos Nucleic Acid Data Base in less than 2 min.

PubMed Disclaimer

Similar articles

Cited by

References

    1. J Mol Evol. 1981;18(1):38-46 - PubMed
    1. Proc Natl Acad Sci U S A. 1979 Jul;76(7):3041 - PubMed
    1. Nucleic Acids Res. 1982 Jan 11;10(1):197-206 - PubMed
    1. Proc Natl Acad Sci U S A. 1972 Jan;69(1):4-6 - PubMed
    1. Proc Natl Acad Sci U S A. 1981 Dec;78(12):7665-9 - PubMed