Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 May 13;22(10):5176.
doi: 10.3390/ijms22105176.

Functional and Material Properties in Nanocatalyst Design: A Data Handling and Sharing Problem

Affiliations
Review

Functional and Material Properties in Nanocatalyst Design: A Data Handling and Sharing Problem

Daniel Lach et al. Int J Mol Sci. .

Abstract

(1) Background: Properties and descriptors are two forms of molecular in silico representations. Properties can be further divided into functional, e.g., catalyst or drug activity, and material, e.g., X-ray crystal data. Millions of real measured functional property records are available for drugs or drug candidates in online databases. In contrast, there is not a single database that registers a real conversion, TON or TOF data for catalysts. All of the data are molecular descriptors or material properties, which are mainly of a calculation origin. (2) Results: Here, we explain the reason for this. We reviewed the data handling and sharing problems in the design and discovery of catalyst candidates particularly, material informatics and catalyst design, structural coding, data collection and validation, infrastructure for catalyst design and the online databases for catalyst design. (3) Conclusions: Material design requires a property prediction step. This can only be achieved based on the registered real property measurement. In reality, in catalyst design and discovery, we can observe either a severe functional property deficit or even property famine.

Keywords: catalyst property prediction; catalysts informatics; catalytic material database; cheminformatics for material discovery; data collection; data handling in catalyst discovery; data science; data sharing in catalyst discovery; infrastructure for catalyst property prediction.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

Figure 1
Figure 1
The spiral of the optimization of the property design cycles in material or drug discovery.
Figure 2
Figure 2
Catalyst informatics for property production follows cheminformatics in drug design. Reprinted with permission from [22]. Copyright © 2021 John Wiley and Sons.
Figure 3
Figure 3
Simulation matrix for the HTS screening of metal and metal alloys. The ∆GH, which was calculated in silico is coded by the colors. The rows indicate the pure metal values, while the indicate the bimetallic solutes that were embedded in the individual metal surface layers. Reprinted with permission from [30]. Copyright © 2021 Royal Society of Chemistry.
Figure 4
Figure 4
Knowledge extraction in the HTS catalyst forward design. Reprinted with permission from [3]. Copyright © 2021 Elsevier.
Figure 5
Figure 5
The simplex method for generating the SiRMS descriptors for materials. Reprinted with permission from [39]. Copyright © 2021 American Chemical Society.
Figure 6
Figure 6
SiRMS representation of the material fragments that are influential and not-influential to a functional material property. Structural fragments that decrease the superconductivity critical temperatures (CT) are colored in red and those that enhance CT are shown in green. Non-influential fragments are in gray. (a) Ba2Ca2Cu3HgO8; (b) As2Ni2O6Sc2Sr4; (c) Mo6PbS8; (d) Mo6NdS8; (e) Li2Pd3B; (f) Li2Pt3B; (g) FeLaAsO and (h) FeLaPO. Reprinted with permission from [39]. Copyright © 2021 American Chemical Society
Figure 7
Figure 7
Complexity of the sciences from physics to the economy. Inspired from [1].
Figure 8
Figure 8
Schematic of the data structure of a Catalytic Material Database SQL.
Figure 9
Figure 9
Machine learning methods.
Figure 10
Figure 10
CMD home page.
Figure 11
Figure 11
The example of the catalyst view mode of CMC.
Figure 12
Figure 12
The example of the reaction view mode of CMC.
Figure 13
Figure 13
The data upload form in CMD.

References

    1. Polanski J., Gasteiger J. Computer Representation of Chemical Compounds. In: Leszczynski J., Kaczmarek-Kedziera A., Puzyn T., Papadopoulos M.G., Reis H., Shukla M.K.K., editors. Handbook of Computational Chemistry. Springer International Publishing; Cham, Germany: 2017. pp. 1997–2039.
    1. Polanski J., Duszkiewicz R. Property Representations and Molecular Fragmentation of Chemical Compounds in QSAR Modeling. Chemom. Intell. Lab. Syst. 2020;206:104146. doi: 10.1016/j.chemolab.2020.104146. - DOI
    1. Caruthers J.M., Lauterbach J.A., Thomson K.T., Venkatasubramanian V., Snively C.M., Bhan A., Katare S., Oskarsdottir G. Catalyst Design: Knowledge Extraction from High-Throughput Experimentation. J. Catal. 2003;216:98–109. doi: 10.1016/S0021-9517(02)00036-2. - DOI
    1. Polanski J. Encyclopedia of Bioinformatics and Computational Biology. Elsevier; Amsterdam, The Netherlands: 2019. Chemoinformatics: From Chemical Art to Chemistry in Silico; pp. 601–618.
    1. Goracci L., Tortorella S., Tiberi P., Pellegrino R.M., Di Veroli A., Valeri A., Cruciani G. Lipostar, a Comprehensive Platform-Neutral Cheminformatics Tool for Lipidomics. Anal. Chem. 2017;89:6257–6264. doi: 10.1021/acs.analchem.7b01259. - DOI - PubMed

Substances