Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Apr 8;105(14):5441-6.
doi: 10.1073/pnas.0704422105. Epub 2008 Apr 2.

Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments

Affiliations

Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments

Lei Xie et al. Proc Natl Acad Sci U S A. .

Abstract

Here, a scalable, accurate, reliable, and robust protein functional site comparison algorithm is presented. The key components of the algorithm consist of a reduced representation of the protein structure and a sequence order-independent profile-profile alignment (SOIPPA). We show that SOIPPA is able to detect distant evolutionary relationships in cases where both a global sequence and structure relationship remains obscure. Results suggest evolutionary relationships across several previously evolutionary distinct protein structure superfamilies. SOIPPA, along with an increased coverage of protein fold space afforded by the structural genomics initiative, can be used to further test the notion that fold space is continuous rather than discrete.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest statement: This work is part of a provisional patent application filed by the University of California San Diego (University of California San Diego Reference No. SD2008-001-1).

Figures

Fig. 1.
Fig. 1.
RMSD distribution of the aligned common fragments of ligands from the 247-benchmark, using maximum size clique with amino acid grouping (Amino Acid Grouping), maximum weight common subgraph with chemical similarity (Chemical Similarity), substitution matrix, SOIPPA, and CE.
Fig. 2.
Fig. 2.
False-positive ratio vs. true-positive ratio for PSI-BLAST, CE, and SOIPPA. The aligned 247-benchmark pair is defined as a true positive if the two proteins are from the same (A) and different (B) SCOP superfamilies.
Fig. 3.
Fig. 3.
False-positive vs. true-positive ratios, using all 247-benchmark pairs, using maximum size clique with amino acid grouping (Amino Acid Group); maximum weight common subgraph with chemical similarity (Chemical Similarity); and substitution matrix, SOIPPA, and CE.
Fig. 4.
Fig. 4.
Multiple functional site alignment of a Rossmann NAD-binding protein (PDB ID code 2C5A) with four other proteins having the Rossmann topology but different SCOP superfamilies. The three conserved motifs are marked motif 1–3. The most conserved residues are labeled as red; and partially conserved residues are labeled as blue. Their corresponding SCOP superfamilies are listed in Table 1. The multiple functional site alignment is generated from pairwise SOIPPA alignments.

Similar articles

Cited by

References

    1. Orengo CA, Thornton JM. Protein families and their evolution—a structural perspective. Annu Rev Biochem. 2005;74:867–900. - PubMed
    1. Whisstock JC, Lesk AM. Prediction of protein function from protein sequence and structure. Q Rev Biophys. 2003;36:307–340. - PubMed
    1. Dobson PD, Cai YD, Stapley BJ, Doig AJ. Prediction of protein function in the absence of significant sequence similarity. Curr Med Chem. 2004;11:2135–2142. - PubMed
    1. Andreeva A, Murzin AG. Evolution of protein fold in the presence of functional constraints. Curr Opin Struct Biol. 2006;16:399–408. - PubMed
    1. Murzin AG. How far divergent evolution goes in protein. Curr Opin Struct Biol. 1998;8:380–387. - PubMed

Publication types

LinkOut - more resources