Detection of protein similarities using nucleotide sequence databases
- PMID: 3135536
- PMCID: PMC336856
- DOI: 10.1093/nar/16.13.6191
Detection of protein similarities using nucleotide sequence databases
Abstract
A simple procedure is described for finding similarities between proteins using nucleotide sequence databases. The approach is illustrated by several examples of previously unknown correspondences with important biological implications: Drosophila elongation factor Tu is shown to be encoded by two genes that are differently expressed during development; a cluster of three Drosophila genes likely encode maltases; a flesh-fly fat body protein resembles the hypothesized Drosophila alcohol dehydrogenase ancestral protein; an unknown protein encoded at the multifunctional E. coli hisT locus resembles aspartate beta-semialdehyde dehydrogenase; and the E. coli tyrR protein is related to nitrogen regulatory proteins. These and other matches were discovered using a personal computer of the type available in most laboratories collecting DNA sequence data. As relatively few sequences were sampled to find these matches, it is likely that much of the existing data has not been adequately examined.
Similar articles
-
Finding protein similarities with nucleotide sequence databases.Methods Enzymol. 1990;183:111-32. doi: 10.1016/0076-6879(90)83009-x. Methods Enzymol. 1990. PMID: 2314271
-
The Drosophila alcohol dehydrogenase gene may have evolved independently of the functionally homologous medfly, olive fly, and flesh fly genes.Mol Biol Evol. 2001 Mar;18(3):322-9. doi: 10.1093/oxfordjournals.molbev.a003808. Mol Biol Evol. 2001. PMID: 11230533
-
Duplication, dicistronic transcription, and subsequent evolution of the Alcohol dehydrogenase and Alcohol dehydrogenase-related genes in Drosophila.Mol Biol Evol. 2000 Sep;17(9):1344-52. doi: 10.1093/oxfordjournals.molbev.a026418. Mol Biol Evol. 2000. PMID: 10958851
-
Molecular linguistics: extracting information from gene and protein sequences.Proc Natl Acad Sci U S A. 1997 May 27;94(11):5506-7. doi: 10.1073/pnas.94.11.5506. Proc Natl Acad Sci U S A. 1997. PMID: 9159100 Free PMC article. Review. No abstract available.
-
Comprehensive, human cellular protein databases and their implication for the study of genome organization and function.FEBS Lett. 1989 Feb 27;244(2):247-54. doi: 10.1016/0014-5793(89)80538-1. FEBS Lett. 1989. PMID: 2646149 Review.
Cited by
-
Divergent transcription of pdxB and homology between the pdxB and serA gene products in Escherichia coli K-12.J Bacteriol. 1989 Nov;171(11):6084-92. doi: 10.1128/jb.171.11.6084-6092.1989. J Bacteriol. 1989. PMID: 2681152 Free PMC article.
-
A large family of bacterial activator proteins.Proc Natl Acad Sci U S A. 1988 Sep;85(18):6602-6. doi: 10.1073/pnas.85.18.6602. Proc Natl Acad Sci U S A. 1988. PMID: 3413113 Free PMC article.
-
Drosophila fat body protein P6 and alcohol dehydrogenase are derived from a common ancestral protein.J Mol Evol. 1991 Aug;33(2):194-203. doi: 10.1007/BF02193634. J Mol Evol. 1991. PMID: 1920455
-
Evolutionary history of eukaryotic α-glucosidases from the α-amylase family.J Mol Evol. 2013 Mar;76(3):129-45. doi: 10.1007/s00239-013-9545-4. Epub 2013 Feb 10. J Mol Evol. 2013. PMID: 23397242
-
Knowns and Unknowns of Vitamin B6 Metabolism in Escherichia coli.EcoSal Plus. 2021 Apr;9(2):eESP-0004-2021. doi: 10.1128/ecosalplus.ESP-0004-2021. EcoSal Plus. 2021. PMID: 33787481 Free PMC article. Review.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
Research Materials