Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 7:2:150032.
doi: 10.1038/sdata.2015.32. eCollection 2015.

A large-scale crop protection bioassay data set

Affiliations

A large-scale crop protection bioassay data set

Anna Gaulton et al. Sci Data. .

Abstract

ChEMBL is a large-scale drug discovery database containing bioactivity information primarily extracted from scientific literature. Due to the medicinal chemistry focus of the journals from which data are extracted, the data are currently of most direct value in the field of human health research. However, many of the scientific use-cases for the current data set are equally applicable in other fields, such as crop protection research: for example, identification of chemical scaffolds active against a particular target or endpoint, the de-convolution of the potential targets of a phenotypic assay, or the potential targets/pathways for safety liabilities. In order to broaden the applicability of the ChEMBL database and allow more widespread use in crop protection research, an extensive data set of bioactivity data of insecticidal, fungicidal and herbicidal compounds and assays was collated and added to the database.

PubMed Disclaimer

Conflict of interest statement

Syngenta is a commercial organization involved in crop protection research and development.

Figures

Figure 1
Figure 1. Diagram showing the data collection, standardization and integration process.
Details of assays performed, compounds tested and activity measurements were extracted from full text publications. Data were further standardized to normalize compound structures, convert units of measurement and assign target information, before being integrated into the ChEMBL database.
Figure 2
Figure 2. Comparison of crop protection and medicinal chemistry data sets.
Pie charts showing a comparison of the features of the extracted crop protection assays with existing ChEMBL data (medicinal chemistry literature): (a) target organism distribution by number of assays, (b) assay format distribution by number of assays, (c) assay type distribution by number of assays.

Dataset use reported in

  • doi: 10.1093/nar/gkt1031

References

Data Citations

    1. Gaulton A. 2014. ChEMBL. http://dx.doi.org/10.6019/CHEMBL.database.19 - DOI

References

    1. Bento A. P. et al. The ChEMBL bioactivity database: an update. Nucleic Acids Res. 42, D1083–D1090 (2014). - PMC - PubMed
    1. Gaulton A. et al. ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res. 40, D1100–D1107 (2012). - PMC - PubMed
    1. Besnard J. et al. Automated design of ligands to polypharmacological profiles. Nature 492, 215–220 (2012). - PMC - PubMed
    1. Dimova D., Stumpfe D. & Bajorath J. Systematic assessment of coordinated activity cliffs formed by kinase inhibitors and detailed characterization of activity cliff clusters and associated SAR information. Eur. J. Med. Chem. 90, 414–427 (2015). - PubMed
    1. Gfeller D. et al. SwissTargetPrediction: a web server for target prediction of bioactive small molecules. Nucleic Acids Res. 42, W32–W38 (2014). - PMC - PubMed

Publication types

LinkOut - more resources