Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Mar;30(3):167-72.
doi: 10.1007/s10930-011-9318-0.

Phylogenetic analysis of protein sequences based on distribution of length about common sub-string

Affiliations

Phylogenetic analysis of protein sequences based on distribution of length about common sub-string

Guisong Chang et al. Protein J. 2011 Mar.

Abstract

Up to now, various approaches for phylogenetic analysis have been developed. Almost all of them put stress on analyzing nucleic acid sequences or protein primary sequences. In this paper, we propose a new sequence distance for efficient reconstruction of phylogenetic trees based on the distribution of length about common sub-sequences between two sequences. We describe some applications of this method, which not only show the validity of the method, but also suggest a number of novel phylogenetic insights.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
The phylogenetic tree is constructed by our method HCS(AB). The proteomic sequence is a concatenation of all the known amino acid sequences for an organism, also with delimiters. Our phylogenetic tree can be obtained at any ionic strength, temperature, time
Fig. 2
Fig. 2
The phylogenetic tree is based on the distance of structural characteristic vector in Dai et al. . The proteomic sequence is a concatenation of all the known amino acid sequences for an organism, also with delimiters. The phylogenetic tree can be obtained at any ionic strength, temperature, time
Fig. 3
Fig. 3
The phylogenetic tree for 26 spike proteins is constructed based on our method HCS(AB). The proteomic sequence is a concatenation of all the known amino acid sequences for an organism, also with delimiters. Our phylogenetic tree can be obtained at any ionic strength, temperature, time

Similar articles

Cited by

References

    1. Cao Z, Liao B, Li R. Int J Quantum Chem. 2008;108:1485–1490. doi: 10.1002/qua.21698. - DOI
    1. Chang G, Wang T. J Biomol Struct Dyn. 2011;4:545–555. - PubMed
    1. Cover TM, Thomas JA (1991) In: Elements of information theory. Wiley, New York
    1. Dai Q, Liu X, Wang T. J Mol Struct. 2007;803:115–122.
    1. Dai Q, Yang Y, Wang T. Bioinformatics. 2008;24:2296–2302. doi: 10.1093/bioinformatics/btn436. - DOI - PubMed

Publication types

LinkOut - more resources