Principle of codification for quick comparisons with the entire biomolecule databanks and associated programs in FORTRAN 77
- PMID: 3753764
- PMCID: PMC339374
- DOI: 10.1093/nar/14.1.197
Principle of codification for quick comparisons with the entire biomolecule databanks and associated programs in FORTRAN 77
Abstract
We propose a new method for homology search of nucleic acids or proteins in databanks. All the possible subsequences of a specific length in a sequence are converted into a code and stored in an indexed file (hash-coding). This preliminary work of codifying an entire bank is rather long but it enables an immediate access to all the sequence fragments of a given type. With our method a strict homology pattern of twenty nucleotides can be found for example in the Los Alamos bank (GENBANK) in less than 2 seconds. We can also use this data storage to considerably speed up the non-strict homology search programs and to write a program to help in the selection of nucleic acid hybridization probes.
Similar articles
-
A rapid access motif database (RAMdb) with a search algorithm for the retrieval patterns in nucleic acids or protein databanks.Comput Appl Biosci. 1995 Jun;11(3):273-9. doi: 10.1093/bioinformatics/11.3.273. Comput Appl Biosci. 1995. PMID: 7583695
-
A common philosophy and FORTRAN 77 software package for implementing and searching sequence databases.Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):397-407. doi: 10.1093/nar/12.1part1.397. Nucleic Acids Res. 1984. PMID: 6546424 Free PMC article.
-
A collection of programs for nucleic acid and protein analysis, written in FORTRAN 77 for IBM-PC compatible microcomputers.Nucleic Acids Res. 1986 Jan 10;14(1):455-65. doi: 10.1093/nar/14.1.455. Nucleic Acids Res. 1986. PMID: 3753781 Free PMC article.
-
Computational analysis of genetic sequences.Annu Rev Biophys Biophys Chem. 1986;15:79-95. doi: 10.1146/annurev.bb.15.060186.000455. Annu Rev Biophys Biophys Chem. 1986. PMID: 3521662 Review. No abstract available.
-
[Biosequence analysis system on a VAX computer].Tanpakushitsu Kakusan Koso. 1986 Jun;(29 Suppl):177-86. Tanpakushitsu Kakusan Koso. 1986. PMID: 3534953 Review. Japanese. No abstract available.
Cited by
-
Apple Macintosh programs for nucleic and protein sequence analyses.Nucleic Acids Res. 1988 Mar 11;16(5):1837-46. doi: 10.1093/nar/16.5.1837. Nucleic Acids Res. 1988. PMID: 2832832 Free PMC article.
-
Rhizobium meliloti fixGHI sequence predicts involvement of a specific cation pump in symbiotic nitrogen fixation.J Bacteriol. 1989 Feb;171(2):929-39. doi: 10.1128/jb.171.2.929-939.1989. J Bacteriol. 1989. PMID: 2536685 Free PMC article.
-
Approaching the function of new genes by detection of their potential upstream activation sequences in Saccharomyces cerevisiae: application to chromosome III.Curr Genet. 1994 May;25(5):396-406. doi: 10.1007/BF00351777. Curr Genet. 1994. PMID: 8082184
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources