Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2001 Oct 19;313(2):399-416.
doi: 10.1006/jmbi.2001.5034.

Conservation helps to identify biologically relevant crystal contacts

Affiliations

Conservation helps to identify biologically relevant crystal contacts

W S Valdar et al. J Mol Biol. .

Abstract

Some crystal contacts are biologically relevant, most are not. We assess the utility of combining measures of size and conservation to discriminate between biological and non-biological contacts. Conservation and size information is calculated for crystal contacts in 53 families of homodimers and 65 families of monomers. Biological contacts are shown to be usually conserved and typically the largest contact in the crystal. A range of neural networks accepting different combinations and encodings of this information is used to answer the following questions: (1) is a given crystal contact biological, and (2) given all crystal contacts in a homodimer, which is the biological one? Predictions for (1) are performed on both homodimer and monomer datasets. The best performing neural network combined size and conservation inputs. For the homodimers, it correctly classified 48 out of 53 biological contacts and 364 out of 366 non-biological contacts, giving a combined accuracy of 98.3 %. A more robust performance statistic, the phi-coefficient, which accounts for imbalances in the dataset, gave a value of 0.92. Taking all 535 non-biological contacts from the 65 monomers, this predictor made erroneous classifications only 4.3 % of the time. Predictions for (2) were performed on homodimers only. The best performing network achieved a prediction accuracy of 98.1 % using size information alone. We conclude that in answering question (1) size and conservation combined discriminate biological from non-biological contacts better than either measure alone. For answering question (2), we conclude that in our dataset size is so powerful a discriminant that conservation adds little predictive benefit.

PubMed Disclaimer

Similar articles

Cited by

Publication types

MeSH terms

LinkOut - more resources