Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 30;26(17):5253.
doi: 10.3390/molecules26175253.

Congenericity of Claimed Compounds in Patent Applications

Affiliations

Congenericity of Claimed Compounds in Patent Applications

Maria J Falaguera et al. Molecules. .

Abstract

A method is presented to analyze quantitatively the degree of congenericity of claimed compounds in patent applications. The approach successfully differentiates patents exemplified with highly congeneric compounds of a structurally compact and well defined chemical series from patents containing a more diverse set of compounds around a more vaguely described patent claim. An application to 750 common patents available in SureChEMBL, SureChEMBLccs and ChEMBL is presented and the congenericity of patent compounds in those different sources discussed.

Keywords: ChEMBL; SureChEMBL; SureChEMBLccs; chemical series; patent compounds; similarity analysis.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Centroid (a) and all pairwise (b) similarity distributions for the set of exemplified molecules in SureChEMBLccs patent US-8598357. Additionally, included (a) is a sample of chemical structures with their SCHEMBL identifiers and similarity values (in parenthesis). The extent and mode are indicated, respectively, by vertically dashed and solid lines.
Figure 2
Figure 2
Centroid (a) and all pairwise (b) similarity distributions for the set of molecules exemplified in SureChEMBLccs patent US-8586617. Additionally, included (a) is a sample of chemical structures with their SCHEMBL identifiers and similarity values (in parenthesis). The extent and mode are indicated, respectively, by vertically dashed and solid lines.
Figure 3
Figure 3
Centroid (a) and all pairwise (b) similarity distributions for the set of molecules exemplified in SureChEMBLccs patent US-8962608. Additionally, included (a) is a sample of chemical structures with their SCHEMBL identifiers and similarity values (in parenthesis). The extent and mode are indicated, respectively, by vertically dashed and solid lines.
Figure 4
Figure 4
Pairwise correlations between the four descriptors obtained from centroid similarity distributions of 851 patents. The correlation coefficient (r2) is provided on top of each graph.
Figure 5
Figure 5
Centroid similarity distributions for four patents with almost identical density values. The mode and the extent are indicated, respectively, by vertically solid and dashed lines.
Figure 6
Figure 6
Principal Component Analysis of 750 patents based on the corresponding molecules available in ChEMBL (white triangles), SureChEMBLccs (dark grey circles) and SureChEMBL (light grey circles). PC1 and PC2 describe 75% and 23% of the total variance, respectively.
Figure 7
Figure 7
Congenericity Score (CScore) distributions for the same 750 patents in SureChEMBL, SureChEMBLccs and ChEMBL.

References

    1. Bemis G.W., Murcko M.A. The properties of known drugs. 1. Molecular frameworks. J. Med. Chem. 1996;39:2887–2893. doi: 10.1021/jm9602928. - DOI - PubMed
    1. Cases M., Garcia-Serna R., Hettne K., Weeber M., van der Lei J., Boyer S., Mestres J. Chemical and biological profiling of an annotated compound library to the nuclear receptor family. Curr. Top. Med. Chem. 2005;5:763–772. doi: 10.2174/1568026054637665. - DOI - PubMed
    1. Schuffenhauer A., Ertl P., Roggo S., Wetzel S., Koch M.A., Waldmann H. The scaffold tree—Visualization of the scaffold universe by hierarchical scaffold classification. J. Chem. Inf. Model. 2007;47:47–58. doi: 10.1021/ci600338x. - DOI - PubMed
    1. Dalke A., Hert J., Kramer C. Mmpdb: An open-source matched molecular pair platform for large multiproperty data sets. J. Chem. Inf. Model. 2018;58:902–910. doi: 10.1021/acs.jcim.8b00173. - DOI - PubMed
    1. Bandyopadhyay D., Kreatsoulas C., Brady P.G., Boyer J., He Z., Scavello G., Jr., Peryea T., Jadhav A., Nguyen D.-T., Guha R. Scaffold-based analytics: Enabling hit-to-lead decisions by visualizing chemical series linked across large datasets. J. Chem. Inf. Model. 2019;59:4880–4892. doi: 10.1021/acs.jcim.9b00243. - DOI - PubMed