Multiple DNA and protein sequence alignment based on segment-to-segment comparison

B Morgenstern¹, A Dress, T Werner

Affiliations

PMID: 8901539
PMCID: PMC37949
DOI: 10.1073/pnas.93.22.12098

Comparative Study

Multiple DNA and protein sequence alignment based on segment-to-segment comparison

B Morgenstern et al. Proc Natl Acad Sci U S A. 1996.

. 1996 Oct 29;93(22):12098-103.

doi: 10.1073/pnas.93.22.12098.

Authors

B Morgenstern¹, A Dress, T Werner

Affiliation

¹ National Research Center for Environment and Health, Institute of Mammalian Genetics, Neuherberg, Germany.

PMID: 8901539
PMCID: PMC37949
DOI: 10.1073/pnas.93.22.12098

Abstract

In this paper, a new way to think about, and to construct, pairwise as well as multiple alignments of DNA and protein sequences is proposed. Rather than forcing alignments to either align single residues or to introduce gaps by defining an alignment as a path running right from the source up to the sink in the associated dot-matrix diagram, we propose to consider alignments as consistent equivalence relations defined on the set of all positions occurring in all sequences under consideration. We also propose constructing alignments from whole segments exhibiting highly significant overall similarity rather than by aligning individual residues. Consequently, we present an alignment algorithm that (i) is based on segment-to-segment comparison instead of the commonly used residue-to-residue comparison and which (ii) avoids the well-known difficulties concerning the choice of appropriate gap penalties: gaps are not treated explicity, but remain as those parts of the sequences that do not belong to any of the aligned segments. Finally, we discuss the application of our algorithm to two test examples and compare it with commonly used alignment methods. As a first example, we aligned a set of 11 DNA sequences coding for functional helix-loop-helix proteins. Though the sequences show only low overall similarity, our program correctly aligned all of the 11 functional sites, which was a unique result among the methods tested. As a by-product, the reading frames of the sequences were identified. Next, we aligned a set of ribonuclease H proteins and compared our results with alignments produced by other programs as reported by McClure et al. [McClure, M. A., Vasi, T. K. & Fitch, W. M. (1994) Mol. Biol. Evol. 11, 571-592]. Our program was one of the best scoring programs. However, in contrast to other methods, our protein alignments are independent of user-defined parameters.

PubMed Disclaimer

References

1. Comput Appl Biosci. 1989 Apr;5(2):151-3 - PubMed
1. Nucleic Acids Res. 1988 Mar 11;16(5):1683-91 - PubMed
1. Methods Enzymol. 1990;183:352-65 - PubMed
1. Proc Natl Acad Sci U S A. 1992 Jan 15;89(2):599-602 - PubMed
1. Comput Appl Biosci. 1992 Apr;8(2):189-91 - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

Associated data

Actions
- Search in PubMed
- Search in Nucleotide
Actions
- Search in PubMed
- Search in Nucleotide
Actions
- Search in PubMed
- Search in Nucleotide
Actions
- Search in PubMed
- Search in Nucleotide
Actions
- Search in PubMed
- Search in Nucleotide
Actions
- Search in PubMed
- Search in Nucleotide
Actions
- Search in PubMed
- Search in Nucleotide
Actions
- Search in PubMed
- Search in Nucleotide
Actions
- Search in PubMed
- Search in Nucleotide
Actions
- Search in PubMed
- Search in Nucleotide
Actions
- Search in PubMed
- Search in Nucleotide

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Molecular Biology Databases
- NIAID Data Ecosystem - Find datasets on Infectious and Immune-mediated Diseases
Research Materials
- NCI CPTC Antibody Characterization Program
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Multiple DNA and protein sequence alignment based on segment-to-segment comparison

Affiliation

Multiple DNA and protein sequence alignment based on segment-to-segment comparison

Authors

Affiliation

Abstract

References

Publication types

MeSH terms

Substances

Associated data

LinkOut - more resources

Full Text Sources

Other Literature Sources

Molecular Biology Databases

Research Materials

Miscellaneous