Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Apr 1;16(4):a041463.
doi: 10.1101/cshperspect.a041463.

Engineering Proteins Using Statistical Models of Coevolutionary Sequence Information

Affiliations
Review

Engineering Proteins Using Statistical Models of Coevolutionary Sequence Information

Jerry C Dinan et al. Cold Spring Harb Perspect Biol. .

Abstract

Homologous protein sequences are wonderfully diverse, indicating many possible evolutionary "solutions" to the encoding of function. Consequently, one can construct statistical models of protein sequence by analyzing amino acid frequency across a large multiple sequence alignment. A central premise is that covariance between amino acid positions reflects coevolution due to a shared functional or biophysical constraint. In this review, we describe the implementation and discuss the advantages, limitations, and recent progress on two coevolution-based modeling approaches: (1) Potts models of protein sequence (direct coupling analysis [DCA]-like), and (2) the statistical coupling analysis (SCA). Each approach detects interesting features of protein sequence and structure-the former emphasizes local physical contacts throughout the structure, while the latter identifies larger evolutionarily coupled networks of residues. Recent advances in large-scale gene synthesis and high-throughput functional selection now motivate additional work to benchmark model performance across quantitative function prediction and de novo design tasks.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Alexander PA, He Y, Chen Y, Orban J, Bryan PN. 2009. A minimal sequence code for switching protein structure and function. Proc Natl Acad Sci 106: 21149–21154. 10.1073/pnas.0906408106 - DOI - PMC - PubMed
    1. Anfinsen CB. 1973. Principles that govern the folding of protein chains. Science 181: 223–230. 10.1126/science.181.4096.223 - DOI - PubMed
    1. Anfinsen CB, Scheraga HA. 1975. Experimental and theoretical aspects of protein folding. In Advances in protein chemistry (ed. Anfinsen CB, et al.), Vol. 29, pp. 205–300. Academic Press, New York. - PubMed
    1. Anishchenko I, Ovchinnikov S, Kamisetty H, Baker D. 2017. Origins of coevolution between residues distant in protein 3D structures. Proc Natl Acad Sci 114: 9122–9127. 10.1073/pnas.1702664114 - DOI - PMC - PubMed
    1. Baker D. 2014. Centenary award and Sir Frederick Gowland Hopkins memorial lecture. Protein folding, structure prediction and design. Biochem Soc Trans 42: 225–229. 10.1042/BST20130055 - DOI - PubMed

Publication types

LinkOut - more resources