Identification of protein coding regions by database similarity search
- PMID: 8485583
- DOI: 10.1038/ng0393-266
Identification of protein coding regions by database similarity search
Abstract
Sequence similarity between a translated nucleotide sequence and a known biological protein can provide strong evidence for the presence of a homologous coding region, even between distantly related genes. The computer program BLASTX performed conceptual translation of a nucleotide query sequence followed by a protein database search in one programmatic step. We characterized the sensitivity of BLASTX recognition to the presence of substitution, insertion and deletion errors in the query sequence and to sequence divergence. Reading frames were reliably identified in the presence of 1% query errors, a rate that is typical for primary sequence data. BLASTX is appropriate for use in moderate and large scale sequencing projects at the earliest opportunity, when the data are most prone to containing errors.
Similar articles
-
Effective protein sequence comparison.Methods Enzymol. 1996;266:227-58. doi: 10.1016/s0076-6879(96)66017-0. Methods Enzymol. 1996. PMID: 8743688
-
Combined use of sequence similarity and codon bias for coding region identification.J Comput Biol. 1994 Spring;1(1):39-50. doi: 10.1089/cmb.1994.1.39. J Comput Biol. 1994. PMID: 8790452
-
Finding errors in DNA sequences.Proc Natl Acad Sci U S A. 1992 May 15;89(10):4698-702. doi: 10.1073/pnas.89.10.4698. Proc Natl Acad Sci U S A. 1992. PMID: 1316617 Free PMC article.
-
Finding homologs to nucleic acid or protein sequences using the framesearch program.Curr Protoc Bioinformatics. 2002 Aug;Chapter 3:Unit 3.2. doi: 10.1002/0471250953.bi0302s00. Curr Protoc Bioinformatics. 2002. PMID: 18792937 Review.
-
Identifying distantly related protein sequences.Comput Appl Biosci. 1997 Aug;13(4):325-32. doi: 10.1093/bioinformatics/13.4.325. Comput Appl Biosci. 1997. PMID: 9283747 Review. No abstract available.
Cited by
-
Evaluation of genetic diversity among strains of the human gut commensal Bifidobacterium adolescentis.Sci Rep. 2016 Apr 1;6:23971. doi: 10.1038/srep23971. Sci Rep. 2016. PMID: 27035119 Free PMC article.
-
Quorum Sensing in Streptococcus mutans Regulates Production of Tryglysin, a Novel RaS-RiPP Antimicrobial Compound.mBio. 2021 Mar 16;12(2):e02688-20. doi: 10.1128/mBio.02688-20. mBio. 2021. PMID: 33727351 Free PMC article.
-
Conformation-Specific Inhibitory Anti-MMP-7 Monoclonal Antibody Sensitizes Pancreatic Ductal Adenocarcinoma Cells to Chemotherapeutic Cell Kill.Cancers (Basel). 2021 Apr 2;13(7):1679. doi: 10.3390/cancers13071679. Cancers (Basel). 2021. PMID: 33918254 Free PMC article.
-
Comparative analysis of non structural protein 1 of SARS-CoV2 with SARS-CoV1 and MERS-CoV: An in silico study.J Mol Struct. 2021 Nov 5;1243:130854. doi: 10.1016/j.molstruc.2021.130854. Epub 2021 Jun 9. J Mol Struct. 2021. PMID: 34121768 Free PMC article.
-
A novel toxoflavin-quenching regulation in bacteria and its application to resistance cultivars.Microb Biotechnol. 2021 Jul;14(4):1657-1670. doi: 10.1111/1751-7915.13831. Epub 2021 May 19. Microb Biotechnol. 2021. PMID: 34009736 Free PMC article.
Publication types
MeSH terms
Substances
Associated data
- Actions
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases