From Small Data Modeling to Large Language Model Screening: A Dual-Strategy Framework for Materials Intelligent Design
- PMID: 39364764
- PMCID: PMC11615768
- DOI: 10.1002/advs.202403548
From Small Data Modeling to Large Language Model Screening: A Dual-Strategy Framework for Materials Intelligent Design
Abstract
Small data in materials present significant challenges to constructing highly accurate machine learning models, severely hindering the widespread implementation of data-driven materials intelligent design. In this study, the Dual-Strategy Materials Intelligent Design Framework (DSMID) is introduced, which integrates two innovative methods. The Adversarial domain Adaptive Embedding Generative network (AAEG) transfers data between related property datasets, even with only 90 data points, enhancing material composition characterization and improving property prediction. Additionally, to address the challenge of screening and evaluating numerous alloy designs, the Automated Material Screening and Evaluation Pipeline (AMSEP) is implemented. This pipeline utilizes large language models with extensive domain knowledge to efficiently identify promising experimental candidates through self-retrieval and self-summarization. Experimental findings demonstrate that this approach effectively identifies and prepares new eutectic High Entropy Alloy (EHEA), notably Al14(CoCrFe)19Ni28, achieving an ultimate tensile strength of 1085 MPa and 24% elongation without heat treatment or extra processing. This demonstrates significantly greater plasticity and equivalent strength compared to the typical as-cast eutectic HEA AlCoCrFeNi2.1. The DSMID framework, combining AAEG and AMSEP, addresses the challenges of small data modeling and extensive candidate screening, contributing to cost reduction and enhanced efficiency of material design. This framework offers a promising avenue for intelligent material design, particularly in scenarios constrained by limited data availability.
Keywords: adversarial domain adaptation; experimental candidates screening; material intelligent design; small data modeling.
© 2024 The Author(s). Advanced Science published by Wiley‐VCH GmbH.
Conflict of interest statement
The authors declare no conflict of interest.
Figures






Similar articles
-
Simultaneous Strength-Ductility Enhancement of a Nano-Lamellar AlCoCrFeNi2.1 Eutectic High Entropy Alloy by Cryo-Rolling and Annealing.Sci Rep. 2018 Feb 19;8(1):3276. doi: 10.1038/s41598-018-21385-y. Sci Rep. 2018. PMID: 29459746 Free PMC article.
-
High entropy alloy property predictions using a transformer-based language model.Sci Rep. 2025 Apr 7;15(1):11861. doi: 10.1038/s41598-025-95170-z. Sci Rep. 2025. PMID: 40195458 Free PMC article.
-
A promising new class of high-temperature alloys: eutectic high-entropy alloys.Sci Rep. 2014 Aug 27;4:6200. doi: 10.1038/srep06200. Sci Rep. 2014. PMID: 25160691 Free PMC article.
-
[Application scenario design and prospect of generative artificial intelligence (AI) in intelligent manufacturing and supply chain of traditional Chinese medicine].Zhongguo Zhong Yao Za Zhi. 2024 Jul;49(14):3963-3970. doi: 10.19540/j.cnki.cjcmm.20240402.301. Zhongguo Zhong Yao Za Zhi. 2024. PMID: 39099369 Review. Chinese.
-
Intelligent Systems for Inorganic Nanomaterial Synthesis.Nanomaterials (Basel). 2025 Apr 21;15(8):631. doi: 10.3390/nano15080631. Nanomaterials (Basel). 2025. PMID: 40278497 Free PMC article. Review.
References
-
- Marzari N., Ferretti A., Wolverton C., Nat. Mater. 2021, 20, 736. - PubMed
-
- Sendek A. D., Ransom B., Cubuk E. D., Pellouchoud L. A., Nanda J., Reed E. J., Adv. Energy Mater. 2022, 12, 2200553.
-
- Liu Y., Guo B., Zou X., Li Y., Shi S., Energy Storage Mater. 2020, 31, 434.
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous