Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Aug 21:5:13275.
doi: 10.1038/srep13275.

CoMOGrad and PHOG: From Computer Vision to Fast and Accurate Protein Tertiary Structure Retrieval

Affiliations

CoMOGrad and PHOG: From Computer Vision to Fast and Accurate Protein Tertiary Structure Retrieval

Rezaul Karim et al. Sci Rep. .

Abstract

The number of entries in a structural database of proteins is increasing day by day. Methods for retrieving protein tertiary structures from such a large database have turn out to be the key to comparative analysis of structures that plays an important role to understand proteins and their functions. In this paper, we present fast and accurate methods for the retrieval of proteins having tertiary structures similar to a query protein from a large database. Our proposed methods borrow ideas from the field of computer vision. The speed and accuracy of our methods come from the two newly introduced features- the co-occurrence matrix of the oriented gradient and pyramid histogram of oriented gradient- and the use of Euclidean distance as the distance measure. Experimental results clearly indicate the superiority of our approach in both running time and accuracy. Our method is readily available for use from this website: http://research.buet.ac.bd:8080/Comograd/.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Representation of β sheets of domain d1n4ja in α carbon distance matrix gray-scale image.
Figure 2
Figure 2. Representation of α helices of domain d1irqa in α carbon distance matrix gray-scale image.
Figure 3
Figure 3. Level 1 quad tree of α carbon distance matrix image.
Figure 4
Figure 4. Percentage of matches of Class, Fold, Superfamily and Family for up to top 50 retrieval results.
Figure 5
Figure 5. MCC values for binary classification.

References

    1. Anfinsen C. B.. Principles that govern the folding of protein chains. Sci. 181, 223–230 (1973). - PubMed
    1. Tanford C. et al.. Protein denaturation. Adv. Protein Chem. 23, 121–282 (1968). - PubMed
    1. Wass M. N. & Sternberg M. J.. Prediction of ligand binding sites using homologous structures and conservation at CASP8. Proteins: Struct. Funct. Bioinforma. 77, 147–151 (2009). - PMC - PubMed
    1. Illergård K., Ardell D. H. & Elofsson A.. Structure is three to ten times more conserved than sequencea study of structural response in protein cores. Proteins: Struct. Funct. Bioinforma. 77, 499–508 (2009). - PubMed
    1. DeToma A. S., Salamekh S., Ramamoorthy A. & Lim M. H.. Misfolded proteins in alzheimer’s disease and type ii diabetes. Chem. Soc. Rev. 41, 608–621 (2012). - PMC - PubMed

LinkOut - more resources