Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec;11(45):e2403548.
doi: 10.1002/advs.202403548. Epub 2024 Oct 4.

From Small Data Modeling to Large Language Model Screening: A Dual-Strategy Framework for Materials Intelligent Design

Affiliations

From Small Data Modeling to Large Language Model Screening: A Dual-Strategy Framework for Materials Intelligent Design

Yeyong Yu et al. Adv Sci (Weinh). 2024 Dec.

Abstract

Small data in materials present significant challenges to constructing highly accurate machine learning models, severely hindering the widespread implementation of data-driven materials intelligent design. In this study, the Dual-Strategy Materials Intelligent Design Framework (DSMID) is introduced, which integrates two innovative methods. The Adversarial domain Adaptive Embedding Generative network (AAEG) transfers data between related property datasets, even with only 90 data points, enhancing material composition characterization and improving property prediction. Additionally, to address the challenge of screening and evaluating numerous alloy designs, the Automated Material Screening and Evaluation Pipeline (AMSEP) is implemented. This pipeline utilizes large language models with extensive domain knowledge to efficiently identify promising experimental candidates through self-retrieval and self-summarization. Experimental findings demonstrate that this approach effectively identifies and prepares new eutectic High Entropy Alloy (EHEA), notably Al14(CoCrFe)19Ni28, achieving an ultimate tensile strength of 1085 MPa and 24% elongation without heat treatment or extra processing. This demonstrates significantly greater plasticity and equivalent strength compared to the typical as-cast eutectic HEA AlCoCrFeNi2.1. The DSMID framework, combining AAEG and AMSEP, addresses the challenges of small data modeling and extensive candidate screening, contributing to cost reduction and enhanced efficiency of material design. This framework offers a promising avenue for intelligent material design, particularly in scenarios constrained by limited data availability.

Keywords: adversarial domain adaptation; experimental candidates screening; material intelligent design; small data modeling.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Overview of the Dual‐Strategy Materials Intelligent Design (DSMID) Framework. Adversarial domain Adaptive Embedding Generative network (AAEG) enhances material property prediction by aligning feature representations across domains using domain‐adaptive techniques. Automated Material Screening and Evaluation Pipeline (AMSEP) utilizes NSGA‐II and large language models to generate and identify materials with superior mechanical properties. Synthesis and Validate confirms the model's predictive accuracy and supports the reverse design of materials.
Figure 2
Figure 2
Adversarial domain adaptive embedding generative network structure. F: Feature representation network, whose output of the last hidden layer is used as the generated embedding of materials composition. C: Source domain bucketing network, used to provide gradients for F to perform source domain bucketing task. G: Generation network, used to provide gradients for F to confuse the source and target domain. D: Discriminator network is designed to perform two critical tasks: discriminate between the source and target domains for classification and source domain target properties bucketing. The network is adversarially trained with G and F and is directly transferred to F utilizing residual connections to minimize domain shift between the source and target domains.
Figure 3
Figure 3
Automated Screening and Evaluation of Inverse Design Candidates via Joint Data and Rule Stream Approach.
Figure 4
Figure 4
(Top row) Enhanced prediction accuracy in HEAs through AAEG: a tenfold validation analysis. (Bottom row) Comparative analysis of predictive and actual mechanical properties in OOD HEAs with the AAEG.
Figure 5
Figure 5
Overview of the Inverse Design Results Evaluated Using AMSEP. a) Comparison of experimental candidates with existing data points in the dataset. b) Evaluation of experimental schemes without additional processing. The mean UTS of this design outcome is 1071.36 MPa, while the mean EL is 16.36%.
Figure 6
Figure 6
The microstructure and true stress versus true strain curves of as‐cast design alloy: a) Electron backscatter diffraction (EBSD) phase image, b) X‐ray diffractometer (XRD) pattern. c) True stress versus true strain curves of the design HEA with a strain rate of 10−3 s −1, the inset is the geometrically necessary dislocations (GND) density of design HEA before and after tension. Detailed microstructural changes in Al14(CoCrFe)19Ni28 EHEA under tension can be found in Section S7 (Supporting Information).

Similar articles

References

    1. Zhao L., Li Y., Yu M., Peng Y., Ran F., Adv. Sci. 2023, 10, 2300283. - PMC - PubMed
    1. Marzari N., Ferretti A., Wolverton C., Nat. Mater. 2021, 20, 736. - PubMed
    1. He D., Liu Q., Mi Y., Meng Q., Xu L., Hou C., Wang J., Li N., Liu Y., Chai H., et al., Adv. Sci. 2024, 2307245. - PMC - PubMed
    1. Sendek A. D., Ransom B., Cubuk E. D., Pellouchoud L. A., Nanda J., Reed E. J., Adv. Energy Mater. 2022, 12, 2200553.
    1. Liu Y., Guo B., Zou X., Li Y., Shi S., Energy Storage Mater. 2020, 31, 434.

LinkOut - more resources