Comparative statistics for DNA and protein sequences: multiple sequence analysis
- PMID: 3929250
- PMCID: PMC391017
- DOI: 10.1073/pnas.82.18.6186
Comparative statistics for DNA and protein sequences: multiple sequence analysis
Abstract
Concepts and methods [Karlin, S. & Ghandour, G. (1985) Proc. Natl. Acad. Sci. USA 82, 5800-5804] for the analysis of patterns and relationships are extended to multiple DNA and protein sequences. Functionals include multiple sequence common word occurrence distributions, characterizations of high frequency shared words, and ascertainment of long block identities. Various comparisons of sequences using natural alphabets obtained from grouping nucleotides or amino acids by their chemical and functional characteristics are described. Specific applications are given to globin genes, mitochondrial genomes, and a variety of mammalian viruses.
Similar articles
-
Comparative statistics for DNA and protein sequences: single sequence analysis.Proc Natl Acad Sci U S A. 1985 Sep;82(17):5800-4. doi: 10.1073/pnas.82.17.5800. Proc Natl Acad Sci U S A. 1985. PMID: 2994049 Free PMC article.
-
DNA sequence comparisons of the human, mouse, and rabbit immunoglobulin kappa gene.Mol Biol Evol. 1985 Jan;2(1):35-52. doi: 10.1093/oxfordjournals.molbev.a040336. Mol Biol Evol. 1985. PMID: 3939702
-
The use of multiple alphabets in kappa-gene immunoglobulin DNA sequence comparisons.EMBO J. 1985 May;4(5):1217-23. doi: 10.1002/j.1460-2075.1985.tb03763.x. EMBO J. 1985. PMID: 3924599 Free PMC article.
-
Use of long sequence alignments to study the evolution and regulation of mammalian globin gene clusters.Mol Biol Evol. 1993 Jan;10(1):73-102. doi: 10.1093/oxfordjournals.molbev.a039991. Mol Biol Evol. 1993. PMID: 8383794 Review.
-
The variable genes of the human immunoglobulin kappa locus.Biol Chem Hoppe Seyler. 1993 Nov;374(11):1001-22. Biol Chem Hoppe Seyler. 1993. PMID: 8292259 Review. No abstract available.
Cited by
-
Multiple-alphabet amino acid sequence comparisons of the immunoglobulin kappa-chain constant domain.Proc Natl Acad Sci U S A. 1985 Dec;82(24):8597-601. doi: 10.1073/pnas.82.24.8597. Proc Natl Acad Sci U S A. 1985. PMID: 3936038 Free PMC article.
-
CLUSS: clustering of protein sequences based on a new similarity measure.BMC Bioinformatics. 2007 Aug 4;8:286. doi: 10.1186/1471-2105-8-286. BMC Bioinformatics. 2007. PMID: 17683581 Free PMC article.
-
Trajectory Modelling Techniques Useful to Epidemiological Research: A Comparative Narrative Review of Approaches.Clin Epidemiol. 2020 Oct 30;12:1205-1222. doi: 10.2147/CLEP.S265287. eCollection 2020. Clin Epidemiol. 2020. PMID: 33154677 Free PMC article. Review.
-
Support vector machine (SVM) based multiclass prediction with basic statistical analysis of plasminogen activators.BMC Res Notes. 2014 Jan 27;7:63. doi: 10.1186/1756-0500-7-63. BMC Res Notes. 2014. PMID: 24468032 Free PMC article.
-
Molecular immunity to mycobacteria: knowledge from the mutation and phenotype spectrum analysis of Mendelian susceptibility to mycobacterial diseases.Int J Infect Dis. 2011 May;15(5):e305-13. doi: 10.1016/j.ijid.2011.01.004. Epub 2011 Feb 16. Int J Infect Dis. 2011. PMID: 21330176 Free PMC article. Review.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources