A flexible method to align large numbers of biological sequences
- PMID: 3148736
- DOI: 10.1007/BF02143508
A flexible method to align large numbers of biological sequences
Abstract
A method for the alignment of two or more biological sequences is described. The method is a direct extension of the method of Taylor (1987) incorporating a consensus sequence approach and allows considerable freedom in the control of the clustering of the sequences. At one extreme this is equivalent to the earlier method (Taylor 1987), whereas at the other, the clustering approaches the binary method of Feng and Doolittle (1987). Such freedom allows the program to be adapted to particular problems, which has the important advantage of resulting in considerable savings in computer time, allowing very large problems to be tackled. Besides a detailed analysis of the alignment of the cytochrome c superfamily, the clustering and alignment of the PIR sequence data bank (3500 sequences approx.) is described.
Similar articles
-
Hierarchical method to align large numbers of biological sequences.Methods Enzymol. 1990;183:456-74. doi: 10.1016/0076-6879(90)83031-4. Methods Enzymol. 1990. PMID: 2156130
-
Using CLUSTAL for multiple sequence alignments.Methods Enzymol. 1996;266:383-402. doi: 10.1016/s0076-6879(96)66024-8. Methods Enzymol. 1996. PMID: 8743695
-
transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156. BMC Bioinformatics. 2005. PMID: 15969769 Free PMC article.
-
BlastAlign: a program that uses blast to align problematic nucleotide sequences.Bioinformatics. 2005 Jan 1;21(1):122-3. doi: 10.1093/bioinformatics/bth459. Epub 2004 Aug 13. Bioinformatics. 2005. PMID: 15310559
-
Multiple sequence alignment with hierarchical clustering.Nucleic Acids Res. 1988 Nov 25;16(22):10881-90. doi: 10.1093/nar/16.22.10881. Nucleic Acids Res. 1988. PMID: 2849754 Free PMC article.
Cited by
-
Protein structure comparison using iterated double dynamic programming.Protein Sci. 1999 Mar;8(3):654-65. doi: 10.1110/ps.8.3.654. Protein Sci. 1999. PMID: 10091668 Free PMC article.
-
Refining multiple sequence alignments with conserved core regions.Nucleic Acids Res. 2006 May 17;34(9):2598-606. doi: 10.1093/nar/gkl274. Print 2006. Nucleic Acids Res. 2006. PMID: 16707662 Free PMC article.
-
Purification and kinetic characterization of the magnesium protoporphyrin IX methyltransferase from Synechocystis PCC6803.Biochem J. 2003 Apr 15;371(Pt 2):351-60. doi: 10.1042/BJ20021394. Biochem J. 2003. PMID: 12489983 Free PMC article.
-
Searching databases of conserved sequence regions by aligning protein multiple-alignments.Nucleic Acids Res. 1996 Oct 1;24(19):3836-45. doi: 10.1093/nar/24.19.3836. Nucleic Acids Res. 1996. PMID: 8871566 Free PMC article.
-
Molecular characterization of biologically diverse envelope variants of human immunodeficiency virus type 1 derived from an individual.J Virol. 1991 Oct;65(10):5574-8. doi: 10.1128/JVI.65.10.5574-5578.1991. J Virol. 1991. PMID: 1895406 Free PMC article.