Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Sep 3;16(9):a041458.
doi: 10.1101/cshperspect.a041458.

Artificial Intelligence Learns Protein Prediction

Affiliations
Review

Artificial Intelligence Learns Protein Prediction

Michael Heinzinger et al. Cold Spring Harb Perspect Biol. .

Abstract

From AlphaGO over StableDiffusion to ChatGPT, the recent decade of exponential advances in artificial intelligence (AI) has been altering life. In parallel, advances in computational biology are beginning to decode the language of life: AlphaFold2 leaped forward in protein structure prediction, and protein language models (pLMs) replaced expertise and evolutionary information from multiple sequence alignments with information learned from reoccurring patterns in databases of billions of proteins without experimental annotations other than the amino acid sequences. None of those tools could have been developed 10 years ago; all will increase the wealth of experimental data and speed up the cycle from idea to proof. AI is affecting molecular and medical biology at giant steps, and the most important might be the leap toward more powerful protein design.

PubMed Disclaimer

Similar articles

References

    1. Akdel M, Pires DEV, Pardo EP, Jänes J, Zalevsky AO, Mészáros B, Bryant P, Good LL, Laskowski RA, Pozzati G, et al. 2022. A structural biology community assessment of AlphaFold2 applications. Nat Struct Mol Biol 29: 1056–1067. 10.1038/s41594-022-00849-w - DOI - PMC - PubMed
    1. Alley EC, Khimulya G, Biswas S, AlQuraishi M, Church GM. 2019. Unified rational protein engineering with sequence-based deep representation learning. Nat Methods 16: 1315–1322. 10.1038/s41592-019-0598-1 - DOI - PMC - PubMed
    1. Altschul SF, Madden TL, Schaeffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. 1997. Gapped blast and PSI-blast: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402. 10.1093/nar/25.17.3389 - DOI - PMC - PubMed
    1. Andersen CAF, Palmer AG, Brunak S, Rost B. 2002. Continuum secondary structure captures protein flexibility. Structure 10: 175–184. 10.1016/s0969-2126(02)00700-1 - DOI - PubMed
    1. Andreeva A, Kulesha E, Gough J, Murzin AG. 2020. The SCOP database in 2020: expanded classification of representative family and superfamily domains of known protein structures. Nucleic Acids Res 48: D376–D382. 10.1093/nar/gkz1064 - DOI - PMC - PubMed

LinkOut - more resources