Protein language models are performant in structure-free virtual screening
- PMID: 39327890
- PMCID: PMC11427677
- DOI: 10.1093/bib/bbae480
Protein language models are performant in structure-free virtual screening
Abstract
Hitherto virtual screening (VS) has been typically performed using a structure-based drug design paradigm. Such methods typically require the use of molecular docking on high-resolution three-dimensional structures of a target protein-a computationally-intensive and time-consuming exercise. This work demonstrates that by employing protein language models and molecular graphs as inputs to a novel graph-to-transformer cross-attention mechanism, a screening power comparable to state-of-the-art structure-based models can be achieved. The implications thereof include highly expedited VS due to the greatly reduced compute required to run this model, and the ability to perform early stages of computer-aided drug design in the complete absence of 3D protein structures.
Keywords: cheminformatics; computer-aided drug design; protein language models; virtual screening.
© The Author(s) 2024. Published by Oxford University Press.
Figures





References
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources