Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jan 6;53(D1):D1287-D1294.
doi: 10.1093/nar/gkae1050.

canSAR 2024-an update to the public drug discovery knowledgebase

Affiliations

canSAR 2024-an update to the public drug discovery knowledgebase

Phillip W Gingrich et al. Nucleic Acids Res. .

Abstract

canSAR (https://cansar.ai) continues to serve as the largest publicly available platform for cancer-focused drug discovery and translational research. It integrates multidisciplinary data from disparate and otherwise siloed public data sources as well as data curated uniquely for canSAR. In addition, canSAR deploys a suite of curation and standardization tools together with AI algorithms to generate new knowledge from these integrated data to inform hypothesis generation. Here we report the latest updates to canSAR. As well as increasing available data, we provide enhancements to our algorithms to improve the offering to the user. Notably, our enhancements include a revised ligandability classifier leveraging Positive Unlabeled Learning that finds twice as many ligandable opportunities across the pocketome, and our revised chemical standardization pipeline and hierarchy better enables the aggregation of structurally related molecular records.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
Figure 1.
Figure 1.
Structural ligandability for KRAS highlights accessibility to ligandable structures and pockets. (A) Color coding of ligandability and structural alignment by sequence and functional domains enables discovery of structures of interest and shows that there are 656 experimentally determined structural instances for KRAS, of which 535 are predicted to have a ligandable cavity (green). The top bar for every protein represents the AlphaFold2 model. (B) Each individual structure can be examined in detail. Ligandable pockets are visualized with present ligands and physicochemical property distributions.
Figure 2.
Figure 2.
The chemical hierarchy for pacritinib illustrates how related structural forms of compounds as well as their associated bioactivities are aggregated. (A) Hierarchical ordering of molecules affords traceability of chemical records according to the performed chemical standardization transformations. (B) Bioactivities are integrated alongside associated chemical structural data, allowing for discovery of related records and activities.
Figure 3.
Figure 3.
Scaffold searching for sorafenib tosylate. Fragments and isotopic labeling are now correctly removed from Bemis-Murcko Scaffolds. This allows for improved discovery of structurally related compounds and, by extension, associated bioactivities.
Figure 4.
Figure 4.
The ‘Mutations’ page for KRAS. This visualization integrates mutational frequency and types, functional annotations, protein–protein interaction sites and ligandability predictions, allowing users to identify opportunities in sequence space with likely functional consequence.

References

    1. Halling-Brown M.D., Bulusu K.C., Patel M., Tym J.E., Al-Lazikani B.. canSAR: an integrated cancer public translational research and drug discovery resource. Nucleic Acids Res. 2012; 40:D947–D956. - PMC - PubMed
    1. Bulusu K.C., Tym J.E., Coker E.A., Schierz A.C., Al-Lazikani B.. canSAR: updated cancer research and drug discovery knowledgebase. Nucleic Acids Res. 2014; 42:D1040–D1047. - PMC - PubMed
    1. Tym J.E., Mitsopoulos C., Coker E.A., Razaz P., Schierz A.C., Antolin A.A., Al-Lazikani B.. canSAR: an updated cancer research and drug discovery knowledgebase. Nucleic Acids Res. 2016; 44:D938–D943. - PMC - PubMed
    1. Coker E.A., Mitsopoulos C., Tym J.E., Komianou A., Kannas C., Di Micco P., Villasclaras Fernandez E., Ozer B., Antolin A.A., Workman P.. canSAR: update to the cancer translational research and drug discovery knowledgebase. Nucleic Acids Res. 2019; 47:D917–D922. - PMC - PubMed
    1. Mitsopoulos C., Di Micco P., Fernandez E.V., Dolciami D., Holt E., Mica I.L., Coker E.A., Tym J.E., Campbell J., Che K.H.. canSAR: update to the cancer translational research and drug discovery knowledgebase. Nucleic Acids Res. 2021; 49:D1074–D1082. - PMC - PubMed