Numerical classification of coding sequences
- PMID: 1561097
- PMCID: PMC312190
- DOI: 10.1093/nar/20.6.1405
Numerical classification of coding sequences
Abstract
DNA sequences coding for protein may be represented by counts of nucleotides or codons. A complete reading frame may be abbreviated by its base count, e.g. A76C158G121T74, or with the corresponding codon table, e.g. (AAA)0(AAC)1(AAG)9 ... (TTT)0. We propose that these numerical designations be used to augment current methods of sequence annotation. Because base counts and codon tables do not require revision as knowledge of function evolves, they are well-suited to act as cross-references, for example to identify redundant GenBank entries. These descriptors may be compared, in place of DNA sequences, to extract homologous genes from large databases. This approach permits rapid searching with good selectivity.
Similar articles
-
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].Yi Chuan Xue Bao. 2004 May;31(5):431-43. Yi Chuan Xue Bao. 2004. PMID: 15478601 Chinese.
-
FISH: a guide to protein-coding DNA sequences in the GenBank database.Comput Appl Biosci. 1993 Jun;9(3):337-42. doi: 10.1093/bioinformatics/9.3.337. Comput Appl Biosci. 1993. PMID: 8324634
-
CRITICA: coding region identification tool invoking comparative analysis.Mol Biol Evol. 1999 Apr;16(4):512-24. doi: 10.1093/oxfordjournals.molbev.a026133. Mol Biol Evol. 1999. PMID: 10331277
-
A complementary circular code in the protein coding genes.J Theor Biol. 1996 Sep 7;182(1):45-58. doi: 10.1006/jtbi.1996.0142. J Theor Biol. 1996. PMID: 8917736
-
Can codon usage bias explain intron phase distributions and exon symmetry?J Mol Evol. 2005 Jan;60(1):99-104. doi: 10.1007/s00239-004-0032-9. J Mol Evol. 2005. PMID: 15696372
Cited by
-
Relationship between G + C in silent sites of codons and amino acid composition of human proteins.J Mol Evol. 1993 Mar;36(3):201-13. doi: 10.1007/BF00160475. J Mol Evol. 1993. PMID: 8483158
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases