Protein Function Analysis through Machine Learning
- PMID: 36139085
- PMCID: PMC9496392
- DOI: 10.3390/biom12091246
Protein Function Analysis through Machine Learning
Abstract
Machine learning (ML) has been an important arsenal in computational biology used to elucidate protein function for decades. With the recent burgeoning of novel ML methods and applications, new ML approaches have been incorporated into many areas of computational biology dealing with protein function. We examine how ML has been integrated into a wide range of computational models to improve prediction accuracy and gain a better understanding of protein function. The applications discussed are protein structure prediction, protein engineering using sequence modifications to achieve stability and druggability characteristics, molecular docking in terms of protein-ligand binding, including allosteric effects, protein-protein interactions and protein-centric drug discovery. To quantify the mechanisms underlying protein function, a holistic approach that takes structure, flexibility, stability, and dynamics into account is required, as these aspects become inseparable through their interdependence. Another key component of protein function is conformational dynamics, which often manifest as protein kinetics. Computational methods that use ML to generate representative conformational ensembles and quantify differences in conformational ensembles important for function are included in this review. Future opportunities are highlighted for each of these topics.
Keywords: allostery; conformational sampling; force fields; machine learning; molecular docking; protein dynamics; protein function; protein structure prediction; protein–protein interactions.
Conflict of interest statement
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
Figures
References
-
- Jarvis R.A., Patrick E.A. Clustering Using a Similarity Measure Based on Shared Near Neighbors. IEEE Trans. Comput. 1973;C-22:1025–1034. doi: 10.1109/T-C.1973.223640. - DOI
-
- Sturm B.L., Ben-Tal O., Monaghan Ú., Collins N., Herremans D., Chew E., Hadjeres G., Deruty E., Pachet F. Machine learning research that matters for music creation: A case study. J. New Music Res. 2019;48:36–55. doi: 10.1080/09298215.2018.1515233. - DOI
-
- Rodolfa K.T., Lamba H., Ghani R. Empirical observation of negligible fairness–accuracy trade-offs in machine learning for public policy. Nat. Mach. Intell. 2021;3:896–904. doi: 10.1038/s42256-021-00396-x. - DOI
-
- Brook T. Music, Art, Machine Learning, and Standardization. Leonardo. 2021:1–11. doi: 10.1162/leon_a_02135. - DOI
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
