Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct 16;15(45):18903-18919.
doi: 10.1039/d4sc03609c. Online ahead of print.

Diversity-driven, efficient exploration of a MOF design space to optimize MOF properties

Affiliations

Diversity-driven, efficient exploration of a MOF design space to optimize MOF properties

Tsung-Wei Liu et al. Chem Sci. .

Abstract

Metal-organic frameworks (MOFs) promise to engender technology-enabling properties for numerous applications. However, one significant challenge in MOF development is their overwhelmingly large design space, which is intractable to fully explore even computationally. To find diverse optimal MOF designs without exploring the full design space, we develop Vendi Bayesian optimization (VBO), a new algorithm that combines traditional Bayesian optimization with the Vendi score, a recently introduced interpretable diversity measure. Both Bayesian optimization and the Vendi score require a kernel similarity function, we therefore also introduce a novel similarity function in the space of MOFs that accounts for both chemical and structural features. This new similarity metric enables VBO to find optimal MOFs with properties that may depend on both chemistry and structure. We statistically assessed VBO by its ability to optimize three NH3-adsorption dependent performance metrics that depend, to different degrees, on MOF chemistry and structure. With ten simulated campaigns done for each metric, VBO consistently outperformed random search to find high-performing designs within a 1000-MOF subset for (i) NH3 storage, (ii) NH3 removal from membrane plasma reactors, and (iii) NH3 capture from air. Then, with one campaign dedicated to finding optimal MOFs for NH3 storage in a "hybrid" ∼10 000-MOF database, we identify twelve extant and eight hypothesized MOF designs with potentially record-breaking working capacity ΔN NH3 between 300 K and 400 K at 1 bar. Specifically, the best MOF designs are predicted to (i) achieve ΔN NH3 values between 23.6 and 29.3 mmol g-1, potentially surpassing those that MOFs previously experimentally tested for NH3 adsorption would have at the proposed operation conditions, (ii) be thermally stable at the operation conditions and (iii) require only ca. 10% of the energy content in NH3 to release the stored molecule from the MOF. Finally, the analysis of the generated simulation data during the search indicates that a pore size of around 10 Å, a heat of adsorption around 33 kJ mol-1, and the presence of Ca could be part of MOF design rules that could help optimize NH3 working capacity at the proposed operation conditions.

PubMed Disclaimer

Conflict of interest statement

There are no conflicts to declare.

Figures

Fig. 1
Fig. 1. Applications for which NH3 adsorption-based MOF performance metrics were optimized to test the efficacy of our Vendi Bayesian optimization (VBO) framework. (a) Adsorptive NH3 storage at ambient conditions with release at 400 K. (b) Membrane-based NH3 removal from plasma reactors during NH3 synthesis at 400 K and 1 bar. (c) Dilute NH3 capture from air in adsorbent traps at ambient conditions. Gas-phase composition relevant to each application indicated at the top. The three chosen metrics present different levels of dependence on MOF chemistry and structure.
Fig. 2
Fig. 2. Workflow for our VBO framework. an initial GP, trained with data for two randomly chosen MOFs, is used to predict the performance metric in the starting database. k + 1 MOFs are selected for molecular simulation evaluation based on the upper confidence bound (UCB) acquisition function. One MOF is chosen as the MOF scoring the highest UCB just as in standard Bayes optimization. The remaining k MOFs are selected based on UCB but only after 10% of the database is pruned. The MOFs pruned from the database are the MOFs that would increase the least the Vendi score of the cumulative set of MOFs evaluated by molecular simulation. The top k + 1 MOFs selected are then evaluated using molecular simulations. To perform a new iteration, the molecular simulation data for the newly evaluated k + 1 MOFs are added to the data for training the GP, and the MOF selection process is repeated.
Fig. 3
Fig. 3. Schematic representation of methods to calculate kernel similarity between MOFs. (a) Chemical similarity (Knode and Klinker kernels) obtained by decomposing two MOFs into their building blocks, and calculating the Tanimoto index between the Morgan fingerprints of their building blocks. (b) Global textural properties similarity (Kglobal kernel) obtained by calculating the radial basis function kernel of the Euclidean distance between the property vectors of two MOFs. (c) Detailed pore structure similarity (KPSD kernel) obtained by calculating the difference between one and the Jensen–Shannon divergence between the pore size distributions (PSDs) of two MOFs. The different kernels cover different aspects of MOFs, and by tuning the weights of each kernel, the representation is adaptable to prediction of properties with different level of dependence on MOF chemistry and structure.
Fig. 4
Fig. 4. MOF mapping onto two-dimensional plots by using multidimensional scaling (MDS) representations. (a) All MOFs in the hybrid database colored by their origin (either the ToBaCCo database or the CoRE database). (b–d) 1000 random MOF subset, colored by range of ΔNNH3 (b), MATS (c), and MATSTH (d) performance metrics. The extent of segregation observed is a harbinger of the efficacy of our MOF kernel similarity as input to train the GP.
Fig. 5
Fig. 5. Prediction performance of GP models (trained on a subset of 1000 random MOFs extracted from the hybrid database) to predict (a) ΔNNH3, (b) MATS, and (c) MATSTH. GP predictions appear on the vertical axis, while the ground truth (from molecular simulation) appears on the horizontal axis. The parity line is presented in red. Each point represents the prediction for a MOF, with the corresponding error bar representing the uncertainty of the predictions based on the prediction standard deviation. The observed prediction performance was found on subsequent statistical testing to be sufficient to make VBO effective.
Fig. 6
Fig. 6. Efficacy of VBO (blue) applied on a 1000 subset of random MOFs compared to Bayesian optimization (green) and random search (orange). Top row presents the evolution of the highest value of the performance metric as the number of MOF evaluations increases for (a) ΔNNH3 for ammonia storage, (b) MATS for ammonia removal from plasma reactor, and (c) MATSTH for ammonia capture from air. Bottom row presents the evolution of the Vendi score for the set of evaluated MOFs as the number of MOF evaluations increases for (d) ΔNNH3 for ammonia storage, (e) MATS for ammonia removal from plasma reactor, and (f) MATSTH for ammonia capture from air. Results in (a)–(f) are averaged across 10 repeat runs, the average value is indicated by the solid line, whereas the standard deviation is indicated by the shaded area. Both VBO and Bayesian optimization outperformed random search, but VBO provided higher diversity of MOF “solutions.”
Fig. 7
Fig. 7. Evolution of VBO campaign (blue) in the ∼10 000 MOF database, when searching for MOFs for NH3 storage, compared to the evolution of the random search (orange). (a) Evolution of the highest ΔNNH3 found among evaluated MOF at a given point in the campaign. (b) Evolution of the average ΔNNH3 among the top-20 evaluated MOFs at a given point in the campaign. (c) Evolution of the Vendi score of evaluated MOFs at a given point in the campaign. Note that the VBO campaign was ended early due to negligible changes in the highest ΔNNH3 since the 80th evaluation. Once again VBO greatly outperformed random search.
Fig. 8
Fig. 8. Plots of structure–performance relationships for NH3 storage. Each square bin corresponds to a combination of the ΔNNH3 performance metric and MOF property, where the transparency of each square bin is indicative of the number of MOFs in the bin, and the color of each bin reflects the average value of the property in the side color scale across all MOFs in the bin. (a) ΔNNH3versus MOF average pore diameter (APD), with each bin colored by MOF void fraction. (b) ΔNNH3versus heat of adsorption Qst, with each bin colored by gravimetric surface area. Optimal APD and Qst appears to be 10 Å and 33 kJ mol−1, respectively.
Fig. 9
Fig. 9. Statistical significance for comparison of elemental compositions between the top-14 MOFs and the entire database based on the p-values derived from the t-test. The dashed line represents our chosen critical value for the one-sided t-test. Bars that fall below this threshold indicate elements that are statistically significantly more abundant in the top-performing MOFs. Ca is a metal that appears significantly more frequently in the top-14 MOFs than in the full database.
Fig. 10
Fig. 10. Top-6 MOFs ranked by ΔNNH3 value. Hypothesized (a–c) and extant MOFs (d–f) are in the top and bottom rows, respectively. VF, GSA, and APD represent, respectively, the void fraction, gravimetric surface areas, and average pore diameter from pore size distribution. The CSD refcode and corresponding publication can be found in Table S5 with (a) n = 1, (b) n = 2, (c) n = 4, (d) n = 3, (e) n = 5, (f) n = 6, where n is the MOF ranking. The three hypothesized MOFs are potentially synthesizable per the free energy criterion by Anderson and Gómez-Gualdrón (Table S6†). Ca MOFs appear in the top-14 but not in top-6 presumably due to suboptimal textural properties.
Fig. 11
Fig. 11. (a) Thermal stability in top-20 MOFs from VBO campaign. Blue diamonds indicate ΔNNH3 (left-axis) and red circles indicate predicted thermal decomposition temperature (right-axis). Top-20 MOFs appear likely to withstand operation conditions. (b) Estimated energy penalty to release stored NH3 as percentage of the hydrogen-based energy content of NH3 (22.5 MJ kgNH3−1) in the top-20 MOFs. Penalty hovers around 8 to 12 percent on NH3 energy content.

References

    1. Gomez-Gualdron D. A. Gutov O. V. Krungleviciute V. Borah B. Mondloch J. E. Hupp J. T. Yildirim T. Farha O. K. Snurr R. Q. Computational design of metal–organic frameworks based on stable zirconium building units for storage and delivery of methane. Chem. Mater. 2014;26:5632–5639. doi: 10.1021/cm502304e. - DOI
    1. Zheng J. Tian J. Wu D. Gu M. Xu W. Wang C. Gao F. Engelhard M. H. Zhang J.-G. Liu J. et al., Lewis acid-base interactions between polysulfides and metal organic framework in lithium sulfur batteries. Nano Lett. 2014;14:2345–2352. doi: 10.1021/nl404721h. - DOI - PubMed
    1. Gómez-Gualdrón D. A. Colón Y. J. Zhang X. Wang T. C. Chen Y.-S. Hupp J. T. Yildirim T. Farha O. K. Zhang J. Snurr R. Q. Evaluating topologically diverse metal–organic frameworks for cryo-adsorbed hydrogen storage. Energy Environ. Sci. 2016;9:3279–3289. doi: 10.1039/C6EE02104B. - DOI
    1. Zhao X. Liu S. Tang Z. Niu H. Cai Y. Meng W. Wu F. Giesy J. P. Synthesis of magnetic metal-organic framework (MOF) for efficient removal of organic dyes from water. Sci. Rep. 2015;5:11849. doi: 10.1038/srep11849. - DOI - PMC - PubMed
    1. Kirchon A. Feng L. Drake H. F. Joseph E. A. Zhou H.-C. From fundamentals to applications: a toolbox for robust and multifunctional MOF materials. Chem. Soc. Rev. 2018;47:8611–8638. doi: 10.1039/C8CS00688A. - DOI - PubMed

LinkOut - more resources